Apache HADOOP is a framework used to develop data processing applications which are executed in a distributed computing environment.
In this tutorial we will learn,
Components of Hadoop…
Source for picture: click here
Here's the list (new additions, more than 30 articles marke…
Regularly crunching large amounts of public & proprietary data to make pinpointed predictions is a challenging task. Hadoop data processing is a very useful infrastructure layer to help in that process. However, Python is a programm…
Online (single pass) algorithm have been studied extensively. For variance Welford's method is the most popular http://jonisalonen.com/2013/deriving-welfords-method-for-computing-... higher stati…"
However, as we predicted we are seeing that t…
I think it depends on what one wants to do. Is the goal to become a Hadoop Administrator or just use the Hadoop platform to execute for instance some PIG scripts? If the goal is to execute…"
LIBLINEAR: fast algorithm for big data:
Another great comment, from Rahul Singh:
This describes common problems of applying machine learning on…"
Dr. Vincent Granville is a visionary data scientist with 15 years of big data, predictive modeling, digital and business analytics experience. Vincent is widely recognized as the leading expert in scoring technology, fraud detection and web traffic optimization and growth. Over the last ten years, he has worked in real-time credit card fraud detection with Visa, advertising mix optimization with CNET, change point detection with Microsoft, online user experience with Wells Fargo, search intelligence with InfoSpace, automated bidding with eBay, click fraud detection with major search engines, ad networks and large advertising clients. Most recently, Vincent launched Data Science Central, the leading social network for big data, business analytics and data science practitioners. Vincent is a former post-doctorate of Cambridge University and the National Institute of Statistical Sciences. He was among the finalists at the Wharton School Business Plan Competition and at the Belgian Mathematical Olympiads. Vincent has published 40 papers in statistical journals and is an invited speaker at international conferences. He also developed a new data mining technology known as hidden decision trees, owns multiple patents, published the first data science book, and raised $6MM in start-up funding. Vincent is a top 20 big data influencers according to Forbes, was featured on CNN,
My Web Site Or LinkedIn Profile
Field of Expertise
Big Data, Data Science, Analytics, Visualization, BI, Other
C-Level, Executive Management
Data Science Central
Networking, New venture