hadoopblog.blogspot.com
HDFS: Facebook has the world's largest Hadoop cluster!
http://hadoopblog.blogspot.com/2010/05/facebook-has-worlds-largest-hadoop.html
This are my musings related to the Apache Hadoop Project. Sunday, May 9, 2010. Facebook has the world's largest Hadoop cluster! It is not a secret anymore! The Datawarehouse Hadoop cluster at Facebook has become the largest known Hadoop storage cluster in the world. Here are some of the details about this single HDFS cluster:. 21 PB of storage in a single. 12 TB per machine (a few machines have 24 TB each). 1200 machines with 8 cores each 800 machines with 16 cores each. 32 GB of RAM per machine. 800 TB ...
whymustmybloghaveaname.blogspot.com
Why Must My Blog Have a Name?: Integrating Nutch's Language Identifier Into Your Own Java App
http://whymustmybloghaveaname.blogspot.com/2010/09/integrating-nutchs-language-identifier.html
Why Must My Blog Have a Name? Tuesday, September 7, 2010. Integrating Nutch's Language Identifier Into Your Own Java App. I've been doing some analysis of twitter data lately, and one of the features that I've needed is a quick method for determining the language of a tweet. Twitter's API does contain a language field for each tweet, but as far as I can tell the value of this field must be set by the user (maybe when they configure their account? Google's Compact Language Detection. Is an open source n-g...
SOCIAL ENGAGEMENT