We left off, in part one of this series, at the point where we had Hadoop running with the Cloudera distribution, version 0.20.1+152. That’s Apache Hadoop release 0.20.1, plus 152 patches that Cloudera’s copious experience tells them they need for work in the real world. But perhaps we’re using, say, Hadoop streaming, and we read […]
INTRO Here at StyleFeeder, we spend a lot of time figuring out what our users are doing, and trying to figure out what they want. One of the tools we have brought to bear on these questions is Hadoop. Among the technical tools these days, Hadoop is like the prettiest girl in school, and it’s […]
A friend asked me about StyleFeeder‘s experience using CDNs, so I sent him the list of criteria that we use to evaluate the various content delivery networks that we have tried. We’re currently using Akamai, Cloudfront and Panther for various types of content. I’ve talked to pretty much everybody in the CDN space over the […]
Here at StyleFeeder, we do a lot of things for the sake of performance. We recently decided to take a set of processes that we had running on a few large EC2 instances over at Amazon Web Services, and consolidate them into a couple of clusters. First, you may ask, why use AWS at all? […]
When you have smashed your head into the table trying to get the included .pac file to work for Cloudera‘s EC2 Hadoop setup and want something that works properly in FoxyProxy, simply use the following URL patterns (available in text below the graphic for your cut/paste pleasure): As promised, *://10* *ec2*.amazonaws.com* *ec2.internal*
We are in the process of migrating one of our backend dataprocessing servers from a legacy hosting company in NYC to Contegix. What’s unusual about this transition is that we’re moving the machine onto Contegix’s new cloud platform rather to a traditional server. We’ve noticed a few things already. When we were copying over a […]