Subscribe to our Newsletter

Vincent Granville

Issaquah, WA


Short Bio Dr. Vincent Granville is a visionary data scientist with 15 years of big data, predictive modeling, digital and business analytics experience. Vincent is widely recognized as the leading expert in scoring technology, fraud detection and web traffic optimization and growth. Over the last ten years, he has worked in real-time credit card fraud detection with Visa, advertising mix optimization with CNET, change point detection with Microsoft, online user experience with Wells Fargo, search intelligence with InfoSpace, automated bidding with eBay, click fraud detection with major search engines, ad networks and large advertising clients. Most recently, Vincent launched Data Science Central, the leading social network for big data, business analytics and data science practitioners. Vincent is a former post-doctorate of Cambridge University and the National Institute of Statistical Sciences. He was among the finalists at the Wharton School Business Plan Competition and at the Belgian Mathematical Olympiads. Vincent has published 40 papers in statistical journals and is an invited speaker at international conferences. He also developed a new data mining technology known as hidden decision trees, owns multiple patents, published the first data science book, and raised $6MM in start-up funding. Vincent is a top 20 big data influencers according to Forbes, was featured on CNN,

My Web Site Or LinkedIn Profile

Field of Expertise Big Data, Data Science, Analytics, Visualization, BI, Other

Professional Status C-Level, Executive Management

Your Company: Data Science Central

Interests: Networking, New venture

Activity Feed
Blog Posts
This tutorial is provided by Guru99. Originally posted here. 
Apache HADOOP is a framework used to develop data processing applications which are executed in a distributed computing environment.
In this tutorial we will learn,

Components of Hadoop…
"Below is an update for San Francisco, in October 2015. The salary increase was substantial, like 30%, over the last two years. But there are fewer jobs, though it could be seasonal, or the fact that some of these jobs don't show up on any…"
Including NoSQL, Map-Reduce, Spark, big data, and more. This resource includes technical articles, books, training and general reading. Enjoy the reading!

Source for picture: click here
Here's the list (new additions, more than 30 articles marked w…
Guest blog post.
Regularly crunching large amounts of public & proprietary data to make pinpointed predictions is a challenging task. Hadoop data processing is a very useful infrastructure layer to help in that process. However, Python is a programm…
"Here's one recent comment from one of our readers:
Online (single pass) algorithm have been studied extensively. For variance Welford's method is the most popular higher stati…"
Here's a selection of Hadoop-related articles worth checking out. Enjoy the reading!

16 resources to learn and understand hadoop 
A synthetic variance designed for hadoop and big data 
Salaries for hadoop professionals 
Ebook getting started with…
Hadoop has been the foundation for data programmes since Big Data hit the big time. It has been the launching point for data programmes for almost every company who is serious about their data offerings.
However, as we predicted we are seeing that t…
-- (BOOK) "Apache Hadoop YARN: Moving beyond MapReduce and Batch Processing with Apache Hadoop 2" by Arun C. Murthy, Vinod Kumar Vavilapalli, Doug Eadline, Joseph Niemiec, Jeff Markham (Pearson/Addison-Wesley Professional, March 2014, ISBN 978032193…
"It's written in Java. There is a wiki for Python. Anyone interested in creating a Perl repository? If yes contact me at [email protected]"
"Here's an very interesting comment from one of our readers:
I think it depends on what one wants to do. Is the goal to become a Hadoop Administrator or just use the Hadoop platform to execute for instance some PIG scripts? If the goal is to execute…"
"Also read How to Use Excel 2013 to Access Hadoop Data."
"Here are a few answers from members:

LIBLINEAR: fast algorithm for big data:
Another great comment, from Rahul Singh:
This describes common problems of applying machine learning on…"
As a Perl, R and Python guy, what is the easiest way to get started with Hadoop? A few specific questions:

Could you install Hadoop on Windows (on my laptop)? The proceduredescribed here is a bit complicated. Some say you can even run Hadoop from y…