sqlandsiva.blogspot.com
Data, Data Science and QA Learning's : Hadoop Ecosystem Internals
http://sqlandsiva.blogspot.com/2014/10/hadoop-ecosystem-internals.html
Data, Data Science and QA Learning's. Started Primarily for SQL Learning's later extended to accommodate Data Science, QA, Domain, Tech, Work, Life related learning’s. No one is harder on a talented person than the person themselves" - Linda Wilkinson. Trust your guts and don't follow the herd" ; "Validate direction not destination" ;. October 06, 2014. This post is quick summary from learning session. Data Copy Basics (Writing data to HDFS). Data Storage size in 64MB Blocks. DataNode - Actual Data.
shmsoft.com
SHMsoft.com - Home
http://shmsoft.com/shmsoft-hadoop-illuminated-pr-media.html
SHMsoft, Inc. Adds Review Capabilities to its Open Source eDiscovery Software FreeEed. September 29, 2013, Houston. A leader in open source software for eDiscovery, is pleased to announce the latest additions to its FreeEed. FreeEed is completely open source under business-friendly Apache 2.0 license. It is based on Hadoop, the de-facto standard for Big Data back-end technologies, and works with all major Hadoop distributions. The work is underway to have it certified on HortonWorks HDP. Being that FreeE...
blog.cloudera.com
How-to: Create a CDH Cluster on Amazon EC2 via Cloudera Manager - Cloudera Engineering Blog
http://blog.cloudera.com/blog/2013/03/how-to-create-a-cdh-cluster-on-amazon-ec2-via-cloudera-manager
Skip to main content. Best practices, how-tos, use cases, and internals from Cloudera Engineering and the community. How-to: Create a CDH Cluster on Amazon EC2 via Cloudera Manager. March 26, 2013. Editor’s Note (added Feb. 25, 2015): For releases beyond 4.5, Cloudera recommends the use of Cloudera Director. For deploying CDH in cloud environments. Since Cloudera Manager and the nodes running CDH use internal hostnames to communicate, the Cloudera Manager server must run on EC2 as well. In fact, the ...
blog.yantrajaal.com
blog@Yantrajaal: Demystify Map Reduce and Hadoop with this DIY tutorial
http://blog.yantrajaal.com/2014/05/getting-started-with-mapreduce-and.html
May 31, 2014. Demystify Map Reduce and Hadoop with this DIY tutorial. Hadoop, IMHO, is history. Rather than waste time with all this, suggest you check up my blog post on Spark with Python. In an earlier post on Data Science - A DIY approach. I had explained how one can initiate a career in data science, or data analytics, by using free resources available on the web. Since Hadoop and Map Reduce is a tool and a technique that is very popular in data science, this post will get you started and help you.
synerzip.com
From Hadoop to Spark - Webinar - August 19, 2015 - Synerzip
http://synerzip.com/from-hadoop-to-spark-webinar
Opt-in to our Future Interests Lists *. This will become your site username. How do we work. Must See Agile TV. From Hadoop to Spark – Webinar – August 19, 2015. Download webinar PPT – From Hadoop to Spark Webinar PPT. Spark is an up-and-coming distributed computing platform in Big Data and it has a lot of momentum going for it. The venerable platform for Big Data is Apache Hadoop — deployed widely and scalable to Petabyte data sizes. So can the new-comer Spark be a viable replacement for Hadoop? Synerzi...
datafoam.com
Hadoop - Datafoam
http://datafoam.com/hadoop
Filling the gaps in big data. Hadoop Adoption Where is your organization? If you are just digging in, many resources are available to help you. I suggest you start with these:. Http:/ hadoop.apache.org. Http:/ www.cloudera.com/. Apache is the owner of the Hadoop project. Hortonworks and Cloudera are currently the top 2 vendors who provide a distribution of the Hadoop software. These books have been very helpful in getting up to speed with Hadoop application, concepts and support:.
SOCIAL ENGAGEMENT