Loading…
This event has ended. View the official site or create your own event → Check it out
This event has ended. Create your own
View analytic

Sign up or log in to bookmark your favorites and sync them to your phone or calendar.

Monday, November 9
 

8:00am

Registration & Breakfast
Food and introduction

Monday November 9, 2015 8:00am - 9:00am
Computer Science History Museum

9:00am

Welcome to H2O & What's new!
Speakers
avatar for SriSatish Ambati

SriSatish Ambati

CEO and co-founder, H2O.ai
Sri is co-founder and ceo of H2O (@h2oai), the builders of H2O. H2O democratizes bigdata science and makes hadoop do math for better predictions. Before H2O, Sri spent time scaling R over bigdata with researchers at Purdue and Stanford. Prior to that Sri co-founded Platfora and was the Director of Engineering at DataStax. Before that Sri was Partner & Performance engineer at java multi-core startup, Azul Systems, tinkering with the... Read More →
avatar for Arno Candel

Arno Candel

Chief Architect, H2O.ai
Arno is the Chief Architect of H2O, a distributed and scalable open-source machine learning platform. He is also the main author of H2O’s Deep Learning. Before joining H2O, Arno was a founding Senior MTS at Skytree where he designed and implemented high-performance machine learning algorithms. He has over a decade of experience in HPC with C++/MPI and had access to the world’s largest supercomputers as a Staff Scientist at SLAC... Read More →
avatar for Hank Roark

Hank Roark

Data Scientist and Hacker, H2O.ai
Hank is a Data Scientist & Hacker at H2O. Hank comes to H2O with a background turning data into products and system solutions and loves helping others find value in their data. He has a deep background in the the application domains of telematics, remote sensing, logistics, manufacturing, agriculture, and the Internet of Things. Before becoming passionate about machine intelligence, Hank managed international software teams and worked as IT... Read More →


Monday November 9, 2015 9:00am - 9:15am
Ramanujan Stage

9:00am

Welcome to H2O & What's new!
Speakers
avatar for SriSatish Ambati

SriSatish Ambati

CEO and co-founder, H2O.ai
Sri is co-founder and ceo of H2O (@h2oai), the builders of H2O. H2O democratizes bigdata science and makes hadoop do math for better predictions. Before H2O, Sri spent time scaling R over bigdata with researchers at Purdue and Stanford. Prior to that Sri co-founded Platfora and was the Director of Engineering at DataStax. Before that Sri was Partner & Performance engineer at java multi-core startup, Azul Systems, tinkering with the... Read More →
avatar for Arno Candel

Arno Candel

Chief Architect, H2O.ai
Arno is the Chief Architect of H2O, a distributed and scalable open-source machine learning platform. He is also the main author of H2O’s Deep Learning. Before joining H2O, Arno was a founding Senior MTS at Skytree where he designed and implemented high-performance machine learning algorithms. He has over a decade of experience in HPC with C++/MPI and had access to the world’s largest supercomputers as a Staff Scientist at SLAC... Read More →
avatar for Hank Roark

Hank Roark

Data Scientist and Hacker, H2O.ai
Hank is a Data Scientist & Hacker at H2O. Hank comes to H2O with a background turning data into products and system solutions and loves helping others find value in their data. He has a deep background in the the application domains of telematics, remote sensing, logistics, manufacturing, agriculture, and the Internet of Things. Before becoming passionate about machine intelligence, Hank managed international software teams and worked as IT... Read More →


Monday November 9, 2015 9:00am - 9:15am
Erdos Stage

9:00am

Hackathon
Speakers
NM

Nidhi Mehta

Hacker, H2O.ai
Nidhi is a Hacker at H2O. She has a background in Physics. Nidhi is an avid H2O and R user with a keen interest in large scale data analysis and machine learning algorithms.


Monday November 9, 2015 9:00am - 12:15pm
Boole

9:15am

Install & Upgrade H2O
Monday November 9, 2015 9:15am - 9:30am
Erdos Stage

9:15am

Install & Upgrade H2O
Monday November 9, 2015 9:15am - 9:30am
Ramanujan Stage

9:30am

Introduction to Data Science
Speakers
avatar for Erin LeDell

Erin LeDell

Data Scientist & Hacker, H2O.ai
Erin is a Statistician and Machine Learning Scientist at H2O.ai. She is the main author of H2O Ensemble. Before joining H2O, she was the Principal Data Scientist at Wise.io and Marvin Mobile Security (acquired by Veracode in 2012) and the founder of DataScientific, Inc.Erin received her Ph.D. in Biostatistics with a Designated Emphasis in Computational Science and Engineering from University of California, Berkeley. Her research focuses on... Read More →


Monday November 9, 2015 9:30am - 10:00am
Erdos Stage

9:30am

Top 10 Data Science Pitfalls
Speakers
avatar for Mark Landry

Mark Landry

Competition Data Scientist & Product Manager, H2O.ai
Mark Landry is a competition data scientist and product manager at H2O. He enjoys testing ideas in Kaggle competitions, where he is ranked in the top 100 in the world (top 0.03%) and well-trained in getting quick solutions to iterate over. Most at home in SQL, he found H2O through hacking in R. Interests are multi-model architectures and helping the world make fewer models that perform worse than the mean.


Monday November 9, 2015 9:30am - 10:00am
Ramanujan Stage

10:00am

What's new in R, Python, Flow
Speakers
avatar for Spencer Aiello

Spencer Aiello

Hacker, H2O.ai
Spencer is a tall drink of H2O: mysterious and furry. He enjoys long bike rides in the scorching sun. He has no life beyond H2O.
avatar for Cliff Click

Cliff Click

CTO and co-founder, H2O.ai
Cliff Click is the CTO and Co-Founder of H2O, makers of H2O, the opensource math and machine learning engine for Big Data. Cliff wrote his first compiler when he was 15 (Pascal to TRS Z-80!), although Cliff’s most famous compiler is the HotSpot Server Compiler (the Sea of Nodes IR). Cliff helped Azul Systems build an 864 core pure-Java mainframe that keeps GC pauses on 500Gb heaps to under 10ms, and worked on all aspects of that... Read More →


Monday November 9, 2015 10:00am - 10:30am
Ramanujan Stage

10:00am

Intro to R, Python, Flow
Speakers
avatar for Amy Wang

Amy Wang

Math Hacker, H2O.ai
Amy is a math hacker at H2O, as well as the Sales Engineering Lead. She graduated from Hunter College in NYC with a Masters in Applied Mathematics and Statistics with a heavy concentration on numerical analysis and financial mathematics. Her interest in applicable math eventually lead her to big data and finding the appropriate mediums for data analysis.


Monday November 9, 2015 10:00am - 10:30am
Erdos Stage

10:30am

Coffee
Monday November 9, 2015 10:30am - 10:45am
Ramanujan Stage

10:30am

Coffee
Monday November 9, 2015 10:30am - 10:45am
Computer Science History Museum

10:45am

GLM - Tomas
Speakers
avatar for Tomas Nykodym

Tomas Nykodym

Engineer & Hacker, H2O.ai
Tomas is our resident Software Engineer. He received his Masters degree from the Czech Technical University. Tomas has worked at IBM-research and Agent-Technology Group. He has participated on several projects related to malware detection/protection funded by US Air Force. Specifically, he developed a system for modeling software behavior using compressed graphs of the system calls made on the system. | Tomas also created a sandbox with simulated... Read More →


Monday November 9, 2015 10:45am - 11:30am
Ramanujan Stage

10:45am

Panel - Practical Data Science - Challenges, pitfalls
Moderators
avatar for SriSatish Ambati

SriSatish Ambati

CEO and co-founder, H2O.ai
Sri is co-founder and ceo of H2O (@h2oai), the builders of H2O. H2O democratizes bigdata science and makes hadoop do math for better predictions. Before H2O, Sri spent time scaling R over bigdata with researchers at Purdue and Stanford. Prior to that Sri co-founded Platfora and was the Director of Engineering at DataStax. Before that Sri was Partner & Performance engineer at java multi-core startup, Azul Systems, tinkering with the... Read More →

Speakers
avatar for Prasanta Behara

Prasanta Behara

SVP Engineering, ScoreData
Technical leader with deep background and experience in building enterprise, internet software and Big Data products. Proven record of building products from concepts stage and successfully launching in marketplace. | | Experience: | - Startup as well as large company experience | - Built the ShareThis real-time Advertising team with a team of Data scientists and engineers and scale it to handles billions of bids per day with few 10s... Read More →
avatar for Brendan Herger

Brendan Herger

Data Scientist, Capital One Financial Corporation
Brendan Herger is a Data Scientist, helping to bring large scale machine learning to Capital One. 
avatar for Fonda Ingram

Fonda Ingram

Performance Hacker and Technical Marketing, H2O.ai
Fonda Ingram works for H2O as a Performance Hacker and Technical Manager. She received her Ph.D. in Computer Engineering from North Carolina State University in 1998. In addition to receiving her Ph.D. at North Carolina State she also received a Master of Science degree in Computer Engineering and a Bachelor of Science degree in Computer Science. While in graduate school, she participated in the NASA graduate student research fellowship program... Read More →
avatar for Mark Landry

Mark Landry

Competition Data Scientist & Product Manager, H2O.ai
Mark Landry is a competition data scientist and product manager at H2O. He enjoys testing ideas in Kaggle competitions, where he is ranked in the top 100 in the world (top 0.03%) and well-trained in getting quick solutions to iterate over. Most at home in SQL, he found H2O through hacking in R. Interests are multi-model architectures and helping the world make fewer models that perform worse than the mean.
avatar for Nick Martin

Nick Martin

Data Scientist & Hacker, H2O
Nick is a Data Scientist at H2O and spends most of his time working alongside customers to develop data products. Prior to H2O he spent several years building Hadoop-backed data science capabilities in healthcare and finance and worked on some interesting projects with DoD and affiliated agencies. | | When he’s not neck-deep in customer data he’s having adventures with his wife and two kids. He basically only reads Foucault and... Read More →


Monday November 9, 2015 10:45am - 11:45am
Erdos Stage

11:30am

Python Pipelines
Speakers
avatar for Spencer Aiello

Spencer Aiello

Hacker, H2O.ai
Spencer is a tall drink of H2O: mysterious and furry. He enjoys long bike rides in the scorching sun. He has no life beyond H2O.


Monday November 9, 2015 11:30am - 12:15pm
Ramanujan Stage

11:30am

Lunch - Food Trucks
Please head out to the food trucks downstairs

Monday November 9, 2015 11:30am - 1:30pm
Computer Science History Museum

11:45am

Gradient Boosting Method and Random Forest
Speakers
avatar for Mark Landry

Mark Landry

Competition Data Scientist & Product Manager, H2O.ai
Mark Landry is a competition data scientist and product manager at H2O. He enjoys testing ideas in Kaggle competitions, where he is ranked in the top 100 in the world (top 0.03%) and well-trained in getting quick solutions to iterate over. Most at home in SQL, he found H2O through hacking in R. Interests are multi-model architectures and helping the world make fewer models that perform worse than the mean.


Monday November 9, 2015 11:45am - 12:30pm
Erdos Stage

12:30pm

Lunch
Monday November 9, 2015 12:30pm - 1:30pm
Erdos Stage

12:30pm

Lunch - Food Trucks
Monday November 9, 2015 12:30pm - 1:30pm
Ramanujan Stage

12:30pm

Lunch N' Learn with Arno Candel
Speakers
avatar for Arno Candel

Arno Candel

Chief Architect, H2O.ai
Arno is the Chief Architect of H2O, a distributed and scalable open-source machine learning platform. He is also the main author of H2O’s Deep Learning. Before joining H2O, Arno was a founding Senior MTS at Skytree where he designed and implemented high-performance machine learning algorithms. He has over a decade of experience in HPC with C++/MPI and had access to the world’s largest supercomputers as a Staff Scientist at SLAC... Read More →


Monday November 9, 2015 12:30pm - 1:30pm
TBA

12:30pm

Lunch N' Learn with Mark Landry
Speakers
avatar for Mark Landry

Mark Landry

Competition Data Scientist & Product Manager, H2O.ai
Mark Landry is a competition data scientist and product manager at H2O. He enjoys testing ideas in Kaggle competitions, where he is ranked in the top 100 in the world (top 0.03%) and well-trained in getting quick solutions to iterate over. Most at home in SQL, he found H2O through hacking in R. Interests are multi-model architectures and helping the world make fewer models that perform worse than the mean.


Monday November 9, 2015 12:30pm - 1:30pm
TBA

1:00pm

Hack H2O with Cliff Click
An hour on core H2O internals with Cliff, followed by an interactive session on hacking an algorithm into H2O.

Speakers
avatar for Cliff Click

Cliff Click

CTO and co-founder, H2O.ai
Cliff Click is the CTO and Co-Founder of H2O, makers of H2O, the opensource math and machine learning engine for Big Data. Cliff wrote his first compiler when he was 15 (Pascal to TRS Z-80!), although Cliff’s most famous compiler is the HotSpot Server Compiler (the Sea of Nodes IR). Cliff helped Azul Systems build an 864 core pure-Java mainframe that keeps GC pauses on 500Gb heaps to under 10ms, and worked on all aspects of that... Read More →


Monday November 9, 2015 1:00pm - 4:00pm
TBA

1:30pm

Calling Your Shots: How to ask Smarter Questions to Make Better Business Decisions - Chen & Jessica
Virtually everyone today wants to make “data-driven decisions”. But what does this really mean? How can you use data to call your shots? It begins by asking smarter questions so that you can make better business decisions. Sometimes, this starts with the question: “what do you want to know?” This isn’t a Jedi-mind trick; it's about translating business questions into data science concepts like regression, classification, clustering, and machine learning—to name a few.

Join Chen Huang, Data Strategist from Silicon Valley Data Science, and Jessica Lanford, Technical Writer from H2O.ai, as they bridge the gap between the worlds of business and data science. Chen will provide her business perspective, explaining data science concepts with relevant examples from her experience in the consulting, industry, and analytics fields. Jessica will provide a technical perspective by explaining the data science terminologies in an understandable manner.

Speakers
avatar for Chen Huang

Chen Huang

Data Strategist, Silicon Valley Data Science
Chen Huang is a Data Strategist at Silicon Valley Data Science. Drawing on her background in consulting, various industries, and analytics product companies, Chen brings a unique perspective on all aspects of data strategy, data transformation, analytics, and business decision making. Chen has knowledge and experience in global strategy rollouts, analytics, visualizations, and product management. Her specialty is bridging the gap between... Read More →
avatar for Jessica Lanford

Jessica Lanford

Documentation Hacker, H2O.ai
Jessica is a word hacker and seasoned technical communicator at H2O.ai. She brings our product to life by documenting the many features and functionality of H2O. Having worked for some of the top companies in technology including Dell, AT&T, and Lam Research, she is an expert at translating complex ideas to digestible articles.


Monday November 9, 2015 1:30pm - 2:15pm
Boole

1:30pm

H2O Rains with Databricks Cloud for Spark
Speakers
avatar for Richard Garris

Richard Garris

Big Data Solutions, Databricks
I am experienced manager and Senior Solutions Architect who has led business and technology projects for both small startups and Fortune 500 enterprises. My most recent experience is delivering data science solutions using Apache Spark at Databricks. | | My technical background includes enterprise Java (J2EE), Big Data (Hadoop and Spark), relational databses (Oracle, MySQL, DB2, SQLServer), BI (Cognos, SQL Server BI, Tableau), Machine... Read More →
avatar for Michal Malohlava

Michal Malohlava

Hacker & Developer, H2O.ai
Michal is a geek, developer, Java, Linux, programming languages enthusiast developing software for over 10 years. He obtained PhD from the Charles University in Prague in 2012 and post-doc at Purdue University. During his studies he was interested in construction of not only distributed but also embedded and real-time component-based systems using model-driven methods and domain-specific languages. He participated in design and... Read More →


Monday November 9, 2015 1:30pm - 2:15pm
Ramanujan Stage

1:30pm

Deep Learning - Arno
Speakers
avatar for Arno Candel

Arno Candel

Chief Architect, H2O.ai
Arno is the Chief Architect of H2O, a distributed and scalable open-source machine learning platform. He is also the main author of H2O’s Deep Learning. Before joining H2O, Arno was a founding Senior MTS at Skytree where he designed and implemented high-performance machine learning algorithms. He has over a decade of experience in HPC with C++/MPI and had access to the world’s largest supercomputers as a Staff Scientist at SLAC... Read More →


Monday November 9, 2015 1:30pm - 2:15pm
Erdos Stage

2:15pm

Sparkling Water
Speakers
avatar for Michal Malohlava

Michal Malohlava

Hacker & Developer, H2O.ai
Michal is a geek, developer, Java, Linux, programming languages enthusiast developing software for over 10 years. He obtained PhD from the Charles University in Prague in 2012 and post-doc at Purdue University. During his studies he was interested in construction of not only distributed but also embedded and real-time component-based systems using model-driven methods and domain-specific languages. He participated in design and... Read More →


Monday November 9, 2015 2:15pm - 3:00pm
Boole

2:15pm

GBM and Random Forest - Mark
Speakers
avatar for Mark Landry

Mark Landry

Competition Data Scientist & Product Manager, H2O.ai
Mark Landry is a competition data scientist and product manager at H2O. He enjoys testing ideas in Kaggle competitions, where he is ranked in the top 100 in the world (top 0.03%) and well-trained in getting quick solutions to iterate over. Most at home in SQL, he found H2O through hacking in R. Interests are multi-model architectures and helping the world make fewer models that perform worse than the mean.


Monday November 9, 2015 2:15pm - 3:00pm
Ramanujan Stage

2:15pm

GLM - Tomas
Speakers
avatar for Tomas Nykodym

Tomas Nykodym

Engineer & Hacker, H2O.ai
Tomas is our resident Software Engineer. He received his Masters degree from the Czech Technical University. Tomas has worked at IBM-research and Agent-Technology Group. He has participated on several projects related to malware detection/protection funded by US Air Force. Specifically, he developed a system for modeling software behavior using compressed graphs of the system calls made on the system. | Tomas also created a sandbox with simulated... Read More →


Monday November 9, 2015 2:15pm - 3:00pm
Erdos Stage

3:00pm

Py Sparking Water - Nidhi
Speakers
NM

Nidhi Mehta

Hacker, H2O.ai
Nidhi is a Hacker at H2O. She has a background in Physics. Nidhi is an avid H2O and R user with a keen interest in large scale data analysis and machine learning algorithms.


Monday November 9, 2015 3:00pm - 3:45pm
Boole

3:00pm

Ensembles (Erin)
Speakers
avatar for Erin LeDell

Erin LeDell

Data Scientist & Hacker, H2O.ai
Erin is a Statistician and Machine Learning Scientist at H2O.ai. She is the main author of H2O Ensemble. Before joining H2O, she was the Principal Data Scientist at Wise.io and Marvin Mobile Security (acquired by Veracode in 2012) and the founder of DataScientific, Inc.Erin received her Ph.D. in Biostatistics with a Designated Emphasis in Computational Science and Engineering from University of California, Berkeley. Her research focuses on... Read More →


Monday November 9, 2015 3:00pm - 3:45pm
Erdos Stage

3:00pm

GLRM
Speakers
avatar for Anqi Fu

Anqi Fu

Math Hacker, H2O.ai
| Anqi is a math hacker at H2O, where she implements and tests distributed machine learning algorithms.  Anqi worked on network security algorithms with the founder of RioRey, and spent summers conducting physics research at the Naval Research Laboratory and NIST. Anqi holds Master's degrees in Statistics and Economics from Stanford University, and a Bachelor's in Electrical Engineering from the University of Maryland, College Park.


Monday November 9, 2015 3:00pm - 3:45pm
Ramanujan Stage

3:45pm

Coffee
Monday November 9, 2015 3:45pm - 4:00pm
Erdos Stage

3:45pm

Coffee
Monday November 9, 2015 3:45pm - 4:00pm
Ramanujan Stage

3:45pm

Coffee Break
Monday November 9, 2015 3:45pm - 4:00pm
Computer Science History Museum

4:00pm

Sparkling Water - Michal
Speakers
avatar for Michal Malohlava

Michal Malohlava

Hacker & Developer, H2O.ai
Michal is a geek, developer, Java, Linux, programming languages enthusiast developing software for over 10 years. He obtained PhD from the Charles University in Prague in 2012 and post-doc at Purdue University. During his studies he was interested in construction of not only distributed but also embedded and real-time component-based systems using model-driven methods and domain-specific languages. He participated in design and... Read More →


Monday November 9, 2015 4:00pm - 4:45pm
Erdos Stage

4:00pm

Panel - Smart Applications
Smart Applications, Data Products and Algorithms - an example lead discussion.
Discuss the components and internals of a Smart Application 

Moderators
avatar for Vinod Iyengar

Vinod Iyengar

Director of Marketing & Growth
Vinod comes with over 7 years of Marketing & Data Science experience in multiple startups. He was the founding employee for his previous startup, Activehours , where he helped build the product and bootstrap the user acquisition with growth hacking. He has seen the user base for his companies grow from scratch to millions of customers. He’s built models to score leads, reduce churn, increase conversion, prevent fraud and many... Read More →

Speakers
avatar for Kanishk

Kanishk

VP of Customer Applications, H2O.ai
Kanishk Priyadarshi is a High-tech and Business Analytics executive based out of San Francisco and Silicon Valley.  He is currently focused on Machine Learning and AI Apps for the "Intelligent Enterprise" and the "Internet of Things".  He has advised companies such as eBay, Google, Zynga, HP, Cisco, and Intel on how to make better use of their data.
avatar for Tom Kraljevic

Tom Kraljevic

VP of Customer & Sales Engineering, H2O.ai
Tom is VP of Engineering of Customer and Pre-Sales Engineering at H2O & key to the magic of engineering & customer happiness. A quintessential team member - Tom is an extraordinary blend of technical genius, customer focus and management by example. Before joining H2O, Tom was Co-founder & CTO at Luminix, where he and the team developed a cutting-edge offline mobile application for Salesforce users. This involved a healthy... Read More →
avatar for Mark Masterson

Mark Masterson

Application Engineer, Kenandy Inc.
Mark is an Application Engineer at Kenandy Inc. building the next generation of ERP in the Cloud. In his spare time he likes to abuse his Safari Books Online membership and jam on hisTelecaster (which may be considered abuse to everyone else's ears). Mark is also a very proud Dog Father.
MS

Merritte Stidston

Director, Chief Information Architect, McKesson Corporation
avatar for Venkatesh Yadav

Venkatesh Yadav

‎Sr. Manager, Software Engineering, Adobe
Venkatesh Yadav (@venky7) is an innovative entrepreneurial thought leader, who is passionate about technology and entrepreneurship. He is a disciplined and driven technologist with a focus on value creation and team building. His areas of interest include Big Data, Cloud Technologies, Internet of Things and Mobile Application Development. | As a technology leader @ Adobe, Primarily focussed on Master Data Management and Data Science in... Read More →


Monday November 9, 2015 4:00pm - 5:00pm
Ramanujan Stage

4:45pm

Panel - Competitive Data Science
Moderators
avatar for Arno Candel

Arno Candel

Chief Architect, H2O.ai
Arno is the Chief Architect of H2O, a distributed and scalable open-source machine learning platform. He is also the main author of H2O’s Deep Learning. Before joining H2O, Arno was a founding Senior MTS at Skytree where he designed and implemented high-performance machine learning algorithms. He has over a decade of experience in HPC with C++/MPI and had access to the world’s largest supercomputers as a Staff Scientist at SLAC... Read More →

Speakers
avatar for Phillip Adkins

Phillip Adkins

Data Scientist, Banjo
Kaggle Master, with peak Kaggle ranking of 99th. Data scientist at Banjo. First place in epileptic seizure competition, as covered by NPR and the Washington Post.
avatar for Nick Kridler

Nick Kridler

Data Scientist, Stitch Fix
Nick Kridler is a Data Scientist on the Merch Algorithms team at Stitch Fix. He develops decision aids for the merchandising team and authored Pyxley, a Python package for simple web applications. Before working in fashion, he developed remote sensing and computer vision algorithms for the defense industry. Nick has competed in several data science competitions on Kaggle. As a member of team SluiceBox, Nick won both Cornell Right Whale... Read More →
avatar for Mark Landry

Mark Landry

Competition Data Scientist & Product Manager, H2O.ai
Mark Landry is a competition data scientist and product manager at H2O. He enjoys testing ideas in Kaggle competitions, where he is ranked in the top 100 in the world (top 0.03%) and well-trained in getting quick solutions to iterate over. Most at home in SQL, he found H2O through hacking in R. Interests are multi-model architectures and helping the world make fewer models that perform worse than the mean.
avatar for John Park

John Park

Data Scientist, HPE Security Research
Kaggle Master, with peak Kaggle ranking of 24th. John is a Principal Data Scientist at Hewlett Packard Enterprise, Security Research Division. John spent his youth as a high school mathelete, and majored at EECS and Cognitive Science at UC Berkeley. After graduation, he worked in cybersecurity industry, reverse-engineering binary files and classify them into malware. It was a grueling and complex task that never ends, yet does not have a slack... Read More →
LS

Lauren Savage

Data Scientist, AT&T
Data Scientist at AT&T Big Data: I have experience building predictive models and working with large datasets on distributed computing systems. I use R, Python, Hadoop, and Pig. I have a B.S. from Caltech and will graduate with a M.S. in Statistics this summer.
avatar for Guocong Song

Guocong Song

Data Scientist / Software Engineer, Playground.Global
Kaggle Master, with peak Kaggle ranking of 8th. Guocong has achieved 5 wins out of 10 data science competitions up to 2014. Before moving to the field of machine learning and Internet, he had been dedicated to wireless communications for a decade. He received IEEE Stephen O. Rice Prize for the best original paper in 2010, and co-authored a book titled “Energy and Spectrum Efficient Wireless Network Design” that will be published by... Read More →


Monday November 9, 2015 4:45pm - 5:45pm
Erdos Stage

5:00pm

Migrating from Proprietary Analytics Stacks to Open Source H2O
Speakers
avatar for Fonda Ingram

Fonda Ingram

Performance Hacker and Technical Marketing, H2O.ai
Fonda Ingram works for H2O as a Performance Hacker and Technical Manager. She received her Ph.D. in Computer Engineering from North Carolina State University in 1998. In addition to receiving her Ph.D. at North Carolina State she also received a Master of Science degree in Computer Engineering and a Bachelor of Science degree in Computer Science. While in graduate school, she participated in the NASA graduate student research fellowship program... Read More →


Monday November 9, 2015 5:00pm - 5:45pm
Ramanujan Stage

5:45pm

Building a smart application - Hands on - Tom
Speakers
avatar for Tom Kraljevic

Tom Kraljevic

VP of Customer & Sales Engineering, H2O.ai
Tom is VP of Engineering of Customer and Pre-Sales Engineering at H2O & key to the magic of engineering & customer happiness. A quintessential team member - Tom is an extraordinary blend of technical genius, customer focus and management by example. Before joining H2O, Tom was Co-founder & CTO at Luminix, where he and the team developed a cutting-edge offline mobile application for Salesforce users. This involved a healthy... Read More →


Monday November 9, 2015 5:45pm - 6:30pm
Erdos Stage

5:45pm

Deep Learning
Speakers
avatar for Arno Candel

Arno Candel

Chief Architect, H2O.ai
Arno is the Chief Architect of H2O, a distributed and scalable open-source machine learning platform. He is also the main author of H2O’s Deep Learning. Before joining H2O, Arno was a founding Senior MTS at Skytree where he designed and implemented high-performance machine learning algorithms. He has over a decade of experience in HPC with C++/MPI and had access to the world’s largest supercomputers as a Staff Scientist at SLAC... Read More →


Monday November 9, 2015 5:45pm - 6:30pm
Ramanujan Stage
 
Tuesday, November 10
 

8:00am

Registration & Breakfast
Food and introduction

Tuesday November 10, 2015 8:00am - 9:00am
Computer Science History Museum

9:00am

Sri and Team
Speakers
avatar for SriSatish Ambati

SriSatish Ambati

CEO and co-founder, H2O.ai
Sri is co-founder and ceo of H2O (@h2oai), the builders of H2O. H2O democratizes bigdata science and makes hadoop do math for better predictions. Before H2O, Sri spent time scaling R over bigdata with researchers at Purdue and Stanford. Prior to that Sri co-founded Platfora and was the Director of Engineering at DataStax. Before that Sri was Partner & Performance engineer at java multi-core startup, Azul Systems, tinkering with the... Read More →


Tuesday November 10, 2015 9:00am - 9:30am
Erdos Stage

9:00am

Sri and Team
Speakers
avatar for SriSatish Ambati

SriSatish Ambati

CEO and co-founder, H2O.ai
Sri is co-founder and ceo of H2O (@h2oai), the builders of H2O. H2O democratizes bigdata science and makes hadoop do math for better predictions. Before H2O, Sri spent time scaling R over bigdata with researchers at Purdue and Stanford. Prior to that Sri co-founded Platfora and was the Director of Engineering at DataStax. Before that Sri was Partner & Performance engineer at java multi-core startup, Azul Systems, tinkering with the... Read More →


Tuesday November 10, 2015 9:00am - 9:30am
Ramanujan Stage

9:30am

Machine learning platform for Smart Applications - Future of H2O
TBD

Speakers
avatar for SriSatish Ambati

SriSatish Ambati

CEO and co-founder, H2O.ai
Sri is co-founder and ceo of H2O (@h2oai), the builders of H2O. H2O democratizes bigdata science and makes hadoop do math for better predictions. Before H2O, Sri spent time scaling R over bigdata with researchers at Purdue and Stanford. Prior to that Sri co-founded Platfora and was the Director of Engineering at DataStax. Before that Sri was Partner & Performance engineer at java multi-core startup, Azul Systems, tinkering with the... Read More →


Tuesday November 10, 2015 9:30am - 10:00am
Erdos Stage

9:30am

Machine learning platform for Smart Applications - Future of H2O
TBD

Speakers
avatar for SriSatish Ambati

SriSatish Ambati

CEO and co-founder, H2O.ai
Sri is co-founder and ceo of H2O (@h2oai), the builders of H2O. H2O democratizes bigdata science and makes hadoop do math for better predictions. Before H2O, Sri spent time scaling R over bigdata with researchers at Purdue and Stanford. Prior to that Sri co-founded Platfora and was the Director of Engineering at DataStax. Before that Sri was Partner & Performance engineer at java multi-core startup, Azul Systems, tinkering with the... Read More →


Tuesday November 10, 2015 9:30am - 10:00am
Ramanujan Stage

10:00am

Solving customer churn with Machine learning - Julian Bharadwaj
Consumer churn is a critical KPI for many organizations both subscription based and e-commerce. This presentation discusses a framework for quantitative understanding of churn, compares the performance various machine learning algorithms and lists out technical challenges and solutions. It begins with a set of simple probability distributions, explains the tasks performed during the exploratory data analysis phase and finally, compares the results from different machine learning algorithms : Random Forests, Gradient Boosting Machine, Support Vector Machines and Deep Learning, among others. It also details who the different consumers of the data product are and how it's presented to each persona.

Speakers
avatar for Julian Bharadwaj

Julian Bharadwaj

Data Scientist, PayPal
Julian Bharadwaj is a Data Scientist at PayPal in the Analytics Centre of Excellence. He has been in PayPal for 3 years and before that, in the travel industry for over a decade doing everything from software development to consulting but, always looking to the data to solve problems. His current interest is in customer lifecycle, specifically, customer churn as applied in the eCommerce world. He holds a Bachelor's degree in Mechanical... Read More →


Tuesday November 10, 2015 10:00am - 10:30am
Ramanujan Stage

10:00am

Building Data Products for Data Natives - Monica Rogati
We increasingly expect the world around us to be “smart” and seamlessly adapt to our taste and habits. Building this world is a difficult task — and it requires new ways of thinking about data. We'll talk about what it takes to build data products — from analytics, exploration and technical challenges to the role of user feedback and machine learning.

Speakers
avatar for Monica Rogati

Monica Rogati

Data Science Advisor, Equity Partner at Data Collective, former VP of Data at Jawbone
Monica Rogati (@mrogati) is a leader in the field of data science. She built key data products and teams at Jawbone and LinkedIn; she is now a widely recognized advisor and speaker. | | As the VP of Data, Monica built Jawbone’s data science and engineering team, focusing on developing data products that helped millions lead healthier lives. Her team also analyzed Jawbone’s data to derive novel insights about sleep, movement and food... Read More →


Tuesday November 10, 2015 10:00am - 11:00am
Erdos Stage

10:30am

Machine Learning at Comcast - Andrew Leamon & Chushi Ren
With 10s of Millions of Customers and 100s of Millions of Devices it’s not hard to reach Big Data scale. Comcast has Big Data there is no question about that! Until recently, applying machine learning to these datasets required sampling or very expensive solutions. Comcast is now using H20.ai as part of it’s Advanced Analytics toolbox. In this talk we will walk through some of the applications of Machine Learning at Comcast. Hear how we are driving rich features in our X1 Products, improving Customer Service, increasing the efficiency and reliability of our network and helping our products to self-heal.

Speakers
avatar for Andrew Leamon

Andrew Leamon

Director - Engineering Analysis, Comcast
Drew Leamon started his career at Microsoft while studying Computer Science at Princeton University.   In his studies, he delved into Computer Graphics, Artificial Intelligence and Computational Neurobiology.  At Microsoft, he collaborated with Microsoft Research on one of the first commercial implementations of collaborative filtering for e-commerce.  This was released as Microsoft Site Server: Commerce Edition. |  Graduating... Read More →
avatar for Chushi Ren

Chushi Ren

Engineer 3, Comcast NBCUniversal
Chushi Ren is an Engineer 3 at Engineering Analysis at Comcast NBCUniversal. He received his MSE in Robotics from University of Pennsylvania. In school, Chushi’s research interest was on applying Machine Learning models to solve Computer Vision problems such as building a gesture recognition model using Kinect with one training sample. Chushi joined Comcast NBCUniversal as a CORE Technology Rotation Program associate before he took his... Read More →


Tuesday November 10, 2015 10:30am - 11:00am
Ramanujan Stage

11:00am

Top 10 Deep Learning Tips & Tricks - Arno Candel
This talk will present how to get the most ouf of H2O Deep Learning in practical examples.  Topics include parameter tuning, data preprocessing, model comparison, exploiting data sparsity, ensembling techniques and many more

Speakers
avatar for Arno Candel

Arno Candel

Chief Architect, H2O.ai
Arno is the Chief Architect of H2O, a distributed and scalable open-source machine learning platform. He is also the main author of H2O’s Deep Learning. Before joining H2O, Arno was a founding Senior MTS at Skytree where he designed and implemented high-performance machine learning algorithms. He has over a decade of experience in HPC with C++/MPI and had access to the world’s largest supercomputers as a Staff Scientist at SLAC... Read More →


Tuesday November 10, 2015 11:00am - 11:30am
Erdos Stage

11:00am

Consensus Lasso - Stephen Boyd
TBD

Speakers
avatar for Prof. Stephen Boyd

Prof. Stephen Boyd

Professor of Information Systems, Stanford University
Stephen P. Boyd is the Samsung Professor of Engineering, and Professor of Electrical Engineering in the Information Systems Laboratory at Stanford University. He has courtesy appointments in the Department of Management Science and Engineering and the Department of Computer Science, and is member of the Institute for Computational and Mathematical Engineering. His current research focus is on convex optimization applications in control... Read More →


Tuesday November 10, 2015 11:00am - 12:00pm
Ramanujan Stage

11:30am

Quora: Machine learning algorithms to grow the world's knowledge - Xavier Amatriain
At Quora our mission is to “share and grow the world's knowledge”. We want to do this by getting the right questions to the right people to answer them, but also by getting the existing answers to people who are interested in them. In order to accomplish this we need to build a complex ecosystem where we value issues such as content quality, engagement, demand, interests, or reputation. It is not possible to build a system like this unless most of the process are highly automated and scalable. We are fortunate though to have lots of very good quality data on which to build machine learning solutions that can help address all of the previous requirements.

In this talk I will describe some interesting uses of machine learning at Quora that range from different recommendation approaches such as personalized ranking to classifiers built to detect duplicate questions or spam. I will describe some of the modeling and feature engineering approaches that go into building these systems. I will also share some of the challenges faced when building such a large-scale knowledge base of human-generated knowledge.

Speakers
avatar for Xavier Amatriain

Xavier Amatriain

VP of Engineering, Quora
Xavier Amatriain is VP of Engineering at Quora, where he leads the team building the best source of knowledge in the Internet. With over 50 publications in different fields, Xavier is best known for his work on Machine Learning in general, and Recommender Systems in particular. Before Quora, he was Research/Engineering Director at Netflix, where he led the team building the famous Netflix Recommendation algorithms. Previously, Xavier was also... Read More →


Tuesday November 10, 2015 11:30am - 12:00pm
Erdos Stage

11:30am

Lunch - Food Trucks
Please head out to the food trucks downstairs

Tuesday November 10, 2015 11:30am - 1:30pm
Computer Science History Museum

12:00pm

Fireside Chat with Monica Rogati
Speakers
avatar for Monica Rogati

Monica Rogati

Data Science Advisor, Equity Partner at Data Collective, former VP of Data at Jawbone
Monica Rogati (@mrogati) is a leader in the field of data science. She built key data products and teams at Jawbone and LinkedIn; she is now a widely recognized advisor and speaker. | | As the VP of Data, Monica built Jawbone’s data science and engineering team, focusing on developing data products that helped millions lead healthier lives. Her team also analyzed Jawbone’s data to derive novel insights about sleep, movement and food... Read More →


Tuesday November 10, 2015 12:00pm - 1:00pm
Math Lounge

1:00pm

Machine Learning to Save Lives - Taposh Dutta Roy
Abstract: 
Clinical data science has been around since the invent of modern medicine. However, with advent of new technology and data we can now better predict outcomes. In this presentation we will discuss machine learning in health care in general and focus on Early Detection of Impending Physiological Disorder (EDIP). EDIP helps us predict 12 hours before the advent of an Unplanned transfer to ICU. 

Description: 
The ultimate goal of any knowledge is to impact human life. Machine learning and data mining has been at the forefront of this effort. A lot of effort has gone into explaining the benefits of data science in general. However, the ultimate goal is to save lives. In this presentation we will present the current state of predictive analytics used in health care and focus on one use case. That use case is Early Detection of Impending Physiological Disorder. In this use case we will discuss deployment of a predictive model for in hospital deterioration.

Speakers
avatar for Taposh Dutta Roy

Taposh Dutta Roy

Health Data Project Lead/Scientist, Kaiser Permanante
Technical executive and advisor with a passion for turning data into actionable insights, meaningful stories and awesome products. I have a unique combination of product, technology and strategy consulting, data science and start-up experience. I am a consumer focused, machine learning and data science geek. | | https://www.kaggle.com/duttaroy | https://github.com/taposh/ | | Experienced in diverse domains :  | Biotech Start-ups... Read More →


Tuesday November 10, 2015 1:00pm - 1:30pm
Erdos Stage

1:00pm

Survey of available Machine Learning Frameworks - Brendan Herger
Learning a new modeling framework is time consuming, and doesn’t always pay off. However, as more feature engineering and modeling frameworks become available, its difficult not to leverage their abilities. Only interested in frameworks available in R? Need a large selection of clustering and regression algorithms? Limiting your train data set because your framework is bursting at the seams?

We’ll cover an in depth overview of the strengths, weaknesses and design logic of the top feature engineering and modeling frameworks available, and which of these frameworks justify pushing through the learning curve.

Speakers
avatar for Brendan Herger

Brendan Herger

Data Scientist, Capital One Financial Corporation
Brendan Herger is a Data Scientist, helping to bring large scale machine learning to Capital One. 


Tuesday November 10, 2015 1:00pm - 1:30pm
Ramanujan Stage

1:30pm

Advanced analytics at AT&T
TBD

Speakers
avatar for Martin McEnroe

Martin McEnroe

Big Data Visualization Evangelist, AT&T
I've always wanted to know everything about everything. I've designed CMOS, I know how adders work, I know BIOS code, timing diagrams, start-up testing, disk head physics, software design, caching, visual basic, that's just PCs. I have picked apart so many things. But LinkedIn, I love. I love understanding what they are doing, the algorithms behind the matches, the maps, the recommendations. I love the sociological aspect of how we interact... Read More →


Tuesday November 10, 2015 1:30pm - 2:00pm
Ramanujan Stage

1:30pm

Big data machine learning with R - Nachum Shacham
TBD

Speakers
avatar for Nachum Shacham

Nachum Shacham

Principal Data Scientist, Paypal
Nachum Shacham is a Principal Data Scientist at PayPal where he is working on modeling and extracting business value from large transactional, behavioral, and system performance datasets. Before, he was with eBay, analyzing performance of large data platforms. Prior, he was with Sri, leading research in internet technologies, generation of wireless internet and real-time voice and Video communications over mobile networks. As co-founded CTO of... Read More →


Tuesday November 10, 2015 1:30pm - 2:00pm
Erdos Stage

2:00pm

Building a Better Forecast with H20 & Salesforce - Mark Masterson
The native Salesforce implementation of the Collaborative Forecast is excellent, and uses your existing Opportunity Pipeline data to determine your future revenue. However, it still relies on humans to take this information and interpret it, and plan for the future accordingly. What if you could harness the power of machine learning to assist you in this difficult decision making process? Join us as we explore the world of machine learning, and how it can integrate into the Salesforce Platform.

Speakers
avatar for Mark Masterson

Mark Masterson

Application Engineer, Kenandy Inc.
Mark is an Application Engineer at Kenandy Inc. building the next generation of ERP in the Cloud. In his spare time he likes to abuse his Safari Books Online membership and jam on hisTelecaster (which may be considered abuse to everyone else's ears). Mark is also a very proud Dog Father.


Tuesday November 10, 2015 2:00pm - 2:30pm
Ramanujan Stage

2:00pm

Panel - Industrial Data Science - Practitioners' Perspective
Moderators
avatar for SriSatish Ambati

SriSatish Ambati

CEO and co-founder, H2O.ai
Sri is co-founder and ceo of H2O (@h2oai), the builders of H2O. H2O democratizes bigdata science and makes hadoop do math for better predictions. Before H2O, Sri spent time scaling R over bigdata with researchers at Purdue and Stanford. Prior to that Sri co-founded Platfora and was the Director of Engineering at DataStax. Before that Sri was Partner & Performance engineer at java multi-core startup, Azul Systems, tinkering with the... Read More →

Speakers
avatar for Xavier Amatriain

Xavier Amatriain

VP of Engineering, Quora
Xavier Amatriain is VP of Engineering at Quora, where he leads the team building the best source of knowledge in the Internet. With over 50 publications in different fields, Xavier is best known for his work on Machine Learning in general, and Recommender Systems in particular. Before Quora, he was Research/Engineering Director at Netflix, where he led the team building the famous Netflix Recommendation algorithms. Previously, Xavier was also... Read More →
SM

Scott Marsh

Research and Development Analyst, Progressive
Research and Development Analyst at Progressive Insurance
avatar for Taposh Dutta Roy

Taposh Dutta Roy

Health Data Project Lead/Scientist, Kaiser Permanante
Technical executive and advisor with a passion for turning data into actionable insights, meaningful stories and awesome products. I have a unique combination of product, technology and strategy consulting, data science and start-up experience. I am a consumer focused, machine learning and data science geek. | | https://www.kaggle.com/duttaroy | https://github.com/taposh/ | | Experienced in diverse domains :  | Biotech Start-ups... Read More →
avatar for Nachum Shacham

Nachum Shacham

Principal Data Scientist, Paypal
Nachum Shacham is a Principal Data Scientist at PayPal where he is working on modeling and extracting business value from large transactional, behavioral, and system performance datasets. Before, he was with eBay, analyzing performance of large data platforms. Prior, he was with Sri, leading research in internet technologies, generation of wireless internet and real-time voice and Video communications over mobile networks. As co-founded CTO of... Read More →
avatar for Daqing Zhao

Daqing Zhao

Director of Advanced Analytics, Macy's
Bio Daqing Zhao has over 20 years of experience in advanced data analytics. Trained in physical sciences, he gained extensive expertise in customer centric marketing, in optimization of all stages of customer acquisition, conversion and retention. He worked on predictive modeling for banner ads, web behavior, search keywords, 1:1 emails, customer life time values, real time recommendation engines, and others. Daqing is Director of Advanced... Read More →


Tuesday November 10, 2015 2:00pm - 3:00pm
Erdos Stage

2:30pm

Self Guiding Applications - Venkatesh Yadav
In this talk we will share the idea of developing self guiding application that would provide the most engaging user experience possible using crowd sourced knowledge. We will discuss and share how historical usage data could be mined using machine learning to identify application usage patterns to generate probable next actions. 

Speakers
avatar for Venkatesh Yadav

Venkatesh Yadav

‎Sr. Manager, Software Engineering, Adobe
Venkatesh Yadav (@venky7) is an innovative entrepreneurial thought leader, who is passionate about technology and entrepreneurship. He is a disciplined and driven technologist with a focus on value creation and team building. His areas of interest include Big Data, Cloud Technologies, Internet of Things and Mobile Application Development. | As a technology leader @ Adobe, Primarily focussed on Master Data Management and Data Science in... Read More →


Tuesday November 10, 2015 2:30pm - 3:00pm
Ramanujan Stage

3:00pm

From Insight to Action with H2O and TIBCO's Fast Data Platform - Michael O'Connell
Learn how H2O's machine learning engine seamlessly integrates with TIBCO Spotfire - a premier data discovery, visualization and analytics platform; and with TIBCO Enterprise Runtime for R (TERR) - an enterprise-class environment for running R scripts and packages, to help organizations move from insights to action on big data. The presentation will include demos of Spotfire, TERR and H2O working together. 

Speakers
AM

Anna Maria Nowakowska

Data Scientist, TIBCO Software, Inc.
avatar for Michael O'Connell

Michael O'Connell

Chief Data Scientist, TIBCO Software Inc.
Michael O'Connell is Chief Data Scientist at TIBCO Software,developing analytic solutions across a number of industries including Financial Services, Energy, Life Sciences, Consumer Goods & Retail, and Telco, Media & Networks. He has been working on statistical software applications for the past 20 years, and has published more than 50 papers and several software packages on statistical methods. Michael did his... Read More →


Tuesday November 10, 2015 3:00pm - 3:30pm
Erdos Stage

3:00pm

Using Google Analytics for data-driven marketing - Dan Stone
Machine learning in Google Analytics to provide automatic insights about which users are most likely to convert. 

Speakers
avatar for Dan Stone

Dan Stone

Product Manager, Google Analytics
Dan is the lead product manager for user centric analysis and audience marketing at Google Analytics. Prior to joining Google, Dan's experience ranges from venture-funded consumer technology startups, where he has led product and user acquisition to working with top 100 companies in the telecommunications, media, and technology industries, where he has led advanced analytics, sentiment analysis, and marketing optimization. Dan holds a B.S. in... Read More →


Tuesday November 10, 2015 3:00pm - 3:30pm
Ramanujan Stage

3:30pm

Coffee
Tuesday November 10, 2015 3:30pm - 4:00pm
Computer Science History Museum

4:00pm

Munging, modeling and pipelines using Python - Hank
H2O version 3 introduces a new Python API.  Designed for data scientist accustomed to the PyData ecosystem, this new H2O interface brings the power of distributed and parallel processing to the Pythonic data scientist.  This new API will be presented using a motivating example of predicting jet turbo-fan remaining useful life based on sensor measurements.

Speakers
avatar for Hank Roark

Hank Roark

Data Scientist and Hacker, H2O.ai
Hank is a Data Scientist & Hacker at H2O. Hank comes to H2O with a background turning data into products and system solutions and loves helping others find value in their data. He has a deep background in the the application domains of telematics, remote sensing, logistics, manufacturing, agriculture, and the Internet of Things. Before becoming passionate about machine intelligence, Hank managed international software teams and worked as IT... Read More →


Tuesday November 10, 2015 4:00pm - 4:30pm
Ramanujan Stage

4:00pm

Panel - Future is Now
Moderators
avatar for SriSatish Ambati

SriSatish Ambati

CEO and co-founder, H2O.ai
Sri is co-founder and ceo of H2O (@h2oai), the builders of H2O. H2O democratizes bigdata science and makes hadoop do math for better predictions. Before H2O, Sri spent time scaling R over bigdata with researchers at Purdue and Stanford. Prior to that Sri co-founded Platfora and was the Director of Engineering at DataStax. Before that Sri was Partner & Performance engineer at java multi-core startup, Azul Systems, tinkering with the... Read More →

Speakers
avatar for Jishnu Bhattacharjee

Jishnu Bhattacharjee

Managing Director, Nexus Venture Partners
Jishnu brings to Nexus several years of operating and investing expertise in hi-technology start-ups. He has invested in technology, internet and technology enabled services companies and is interested in a wide range of start-ups. Jishnu is currently on the boards of Vdopia, Druva, GenWi, helpshift, Unmetric, Oxdata, Elasticbox, Biz2Credit and observes the board of Kaltura. His investments also include Cloud.com (acquired by Citrix) and... Read More →
avatar for Prof. Stephen Boyd

Prof. Stephen Boyd

Professor of Information Systems, Stanford University
Stephen P. Boyd is the Samsung Professor of Engineering, and Professor of Electrical Engineering in the Information Systems Laboratory at Stanford University. He has courtesy appointments in the Department of Management Science and Engineering and the Department of Computer Science, and is member of the Institute for Computational and Mathematical Engineering. His current research focus is on convex optimization applications in control... Read More →
avatar for Cliff Click

Cliff Click

CTO and co-founder, H2O.ai
Cliff Click is the CTO and Co-Founder of H2O, makers of H2O, the opensource math and machine learning engine for Big Data. Cliff wrote his first compiler when he was 15 (Pascal to TRS Z-80!), although Cliff’s most famous compiler is the HotSpot Server Compiler (the Sea of Nodes IR). Cliff helped Azul Systems build an 864 core pure-Java mainframe that keeps GC pauses on 500Gb heaps to under 10ms, and worked on all aspects of that... Read More →
avatar for Chris Pouliot

Chris Pouliot

VP, Data Science, Lyft
As a VP of Data Science at Lyft, Chris Pouliot is a real life rocket scientist, who has also spun astronauts until they were motion sick, split atoms to make an aircraft carrier go fast, provided insightful analysis that led to Google to change their top ad color, and helped Netflix determine what movies and TV shows to buy, and how much they should pay. He helped advise many Cowboy portfolio companies on analytics and data infrastructure and... Read More →
avatar for Monica Rogati

Monica Rogati

Data Science Advisor, Equity Partner at Data Collective, former VP of Data at Jawbone
Monica Rogati (@mrogati) is a leader in the field of data science. She built key data products and teams at Jawbone and LinkedIn; she is now a widely recognized advisor and speaker. | | As the VP of Data, Monica built Jawbone’s data science and engineering team, focusing on developing data products that helped millions lead healthier lives. Her team also analyzed Jawbone’s data to derive novel insights about sleep, movement and food... Read More →
avatar for Dave Whiting

Dave Whiting

Head of Data Science Analysis, Financial Services, Capital One
Dave is experienced in predictive analytics and statistical analysis using frequentist and Bayesian approaches. His specialty is in computationally intensive analyses due to large data size and/or complex methodology. Dave's statistical expertise includes experimental design, multivariate models, mixed models, GLMM, GAM, semi- and non-parametric models, correlated data analysis (repeated measures, time series, spatial statistics), CART... Read More →


Tuesday November 10, 2015 4:00pm - 5:00pm
Erdos Stage

4:30pm

Collaborative, reproducible research with H2O - Nick Elprin
Reproducibility and collaboration are important best practices for many data science teams. They accelerate the pace of research, and, in certain industries, they help meet regulatory requirements around auditing and transparency. In this talk, we'll describe a handful of principles, tools, and technical tips for applying best practices around reproducibility and collaboration to your H2O workflows.

Speakers
avatar for Nick Elprin

Nick Elprin

CEO and Co-Founder, Domino Data Lab
Nick Elprin is a co-founder and CEO of Domino. Before starting Domino, Nick was a senior technologist and technology manager at a large hedge fund, where he managed a team that designed, developed, and delivered the firm’s next-generation research platform. He holds a BA and MS in computer science from Harvard.


Tuesday November 10, 2015 4:30pm - 5:00pm
Ramanujan Stage

5:00pm

Learning how humans and non-humans interact with digital ads - Sergei Izrailev
Learning how humans and non-humans interact with digital ads.

After spending significant resources on creating products and developing great stories about them, brands use advertising to share these stories with their audience. In digital advertising, it is often implicitly assumed that ads are delivered to the desired audience effectively, however, this rarely is the case. At the very least, digital ads must be delivered in the right geography, in an environment that doesn’t hurt the brand and be actually viewed by a human. But even this is not enough for the brand’s message to get across. This talk will discuss how large scale analysis of interaction of humans and bots with the ads helps us ensure that the advertisers' stories are heard by the intended audience. 

Speakers
avatar for Sergei Izrailev

Sergei Izrailev

SVP, Data Science, Integral Ad Science
Dr. Sergei Izrailev is SVP, Data Science at Integral Ad Science, where he is focusing on advancing Integral’s media quality measurement and programmatic advertising product offerings, as well as developing and scaling the data science team, which is one of the largest among similar size companies. Before Integral, he led the data science team at Collective and was responsible for the data science strategy and execution, as well as for the... Read More →


Tuesday November 10, 2015 5:00pm - 5:20pm
Erdos Stage

5:00pm

Data Science in Action @ 6sense
The talk will cover areas like:


  • Reusing most of your model building pipeline in production.

  • Using H2O, Docker, Mesos, Hive to productionize 100s of models each day.

  • Data Science in Action @ 6senseRunning multiple models on same dataset to monitor performance.


Speakers
avatar for Viral Bajaria

Viral Bajaria

CTO & Co-Founder, 6sense
At 6sense, Viral leads the development of 6sense’s innovative analytics and predictive platform. Everyday Viral wakes up to realize his passion of using machine learning to help find buyers that are in-market and level the playing field between marketing teams with small or big budgets. Prior to 6sense, Viral built the big data platform at Hulu that processed over 3 billion events per day. He was an early adopter of Hadoop (early 2009), and... Read More →


Tuesday November 10, 2015 5:00pm - 5:30pm
Ramanujan Stage

5:20pm

What do you need before you can do predictive analysis - Maria & Peter
Before one can do predictive analysis, you need to be able to collect, model, and analyze your data. What can that look like? What are the Engineering Challenges? What data infrastructure questions need to be answered while planning for predictive analytics? What are the business and organizational challenges? How do you get everyone on-board? All of these questions need to be answered before you can take action on the data. We'll show you how Keen can be part of that solution.

Speakers
avatar for Maria Dumanis

Maria Dumanis

Data Engineer and Account Manager, Keen IO
Maria Dumanis is a Data Engineer and Account Manager at Keen IO, a platform that delivers fast and flexible data analytics. Her main focus has been to understand customers' business needs and figure out the best way they can be more data driven in their decision making. She strives to make her customers and team members happy and informed! | | | Abstract: | Before one can do predictive analysis, you need to be able to collect, model... Read More →
avatar for Peter Nachbaur

Peter Nachbaur

Analytics Platform Architect, Keen IO
Peter Nachbaur wears several hats for Keen IO in San Francisco building Analytics as a Service and a self-governing organization. Peter focuses on scaling the data platform and internal culture. Recently he designed and built Query Caching while orchestrating Keen Family Day for everyone to bring their loved ones out for games. He strives to make products smarter and people happier. His favorite hat is a plastic viking helmet.


Tuesday November 10, 2015 5:20pm - 5:40pm
Erdos Stage

5:30pm

Panel - Last Mile Delivery of Data Science
Moderators
avatar for Kanishk

Kanishk

VP of Customer Applications, H2O.ai
Kanishk Priyadarshi is a High-tech and Business Analytics executive based out of San Francisco and Silicon Valley.  He is currently focused on Machine Learning and AI Apps for the "Intelligent Enterprise" and the "Internet of Things".  He has advised companies such as eBay, Google, Zynga, HP, Cisco, and Intel on how to make better use of their data.

Speakers
avatar for Prasanta Behara

Prasanta Behara

SVP Engineering, ScoreData
Technical leader with deep background and experience in building enterprise, internet software and Big Data products. Proven record of building products from concepts stage and successfully launching in marketplace. | | Experience: | - Startup as well as large company experience | - Built the ShareThis real-time Advertising team with a team of Data scientists and engineers and scale it to handles billions of bids per day with few 10s... Read More →
avatar for Henri Dwyer

Henri Dwyer

Data Scientist, Dataiku
Henri Dwyer is a data scientist and engineer working on building the best platform for data scientists at Dataiku. He received an MSc in Engineering from Columbia University and a BS and an Ms in Engineering from Ecole Polytechnique in Paris. He is now living in New York City and is always keen on discovering new data science problems to solve.
avatar for Seshu Guddanti

Seshu Guddanti

Seshu Guddanti is Sr. Director of Ad-Tech at Zynga. He leads Zynga’s ad-delivery engine and is responsible for roadmap and strategic direction. Previously, he was in management consulting as Sr. Manager, Analytics in Accenture, where he advised clients in enhancing marketing analytics by data science and analytical infrastructure. At Adobe, Seshu Guddanti consulted with leading Fortune 500 companies in areas of big data, digital... Read More →
avatar for Hassan Namarvar

Hassan Namarvar

Senior Principal Data Scientist, Oracle
Hassan is a Senior Principal Data Scientist at Oracle. Before joining Oracle Hassan was a Principal Applied Scientist at eBay where he worked on large-scale machine-learned ranking algorithms to improve eBay search quality. Prior to that, Hassan held Principal Data Scientist positions at ShareThis Inc where he worked on online advertising optimization by modeling real time bidding transactional and user behavior datasets using large-scale machine... Read More →


Tuesday November 10, 2015 5:30pm - 6:30pm
Ramanujan Stage

5:40pm

H2O for Genomics - Hussam Al-Deen Ashab
Genomic data provide a unique source of information for personalized patient treatment management. By analyzing large amounts of complex genomic data, GenomeDx is developing tools that simplify the way patient care is managed. GenomeDx mission is to transform patient care by putting usable genomic information in the hands of physicians and patients. Using machine learning to capture genomic fingerprints of cancer is promising to transform genomic data from bench to clinic.

This talk will present how H2O can be utilized to develop tools that simplify the way patient care can be managed.

Speakers
avatar for Hussam Al-Deen Ashab

Hussam Al-Deen Ashab

Bioinformatician, GenomeDx Biosciences Inc.
Hussam is a Bioinformatician at the research and development team at Genomedx Biosciences Inc. ; A Vancouver based company working on urological cancer genomics. He is working on the design and implementation of statistical / predictive models to develop decision support systems to improve patients Healthcare and provide personalized molecular insights on the tumor of each patient. At Genomedx, he was able to develop several machine... Read More →


Tuesday November 10, 2015 5:40pm - 6:10pm
Erdos Stage

6:10pm

Analytics driven scheduling and orchestration for distributed systems : A DNS Use Case - Rishiraj Pravahan
Analytics driven scheduling and orchestration for distributed systems : A DNS Use Case” highlighting data driven scheduling and orchestration of docker containers for a distributed app environment. The talk will focus on simple classification techniques to identify states and anomalies using time series data.

Speakers
avatar for Rishiraj Pravahan

Rishiraj Pravahan

Principal Data Scientist/Engineer, AT&T Inc
Rishiraj Pravahan is a data scientist working for AT&T. Prior to joining AT&T. Rishiraj worked for the ATLAS experiment at CERN where he was part of the team that discovered the Higgs Boson. While at CERN, he worked on constructing, commissioning and calibrating the ATLAS detector as well as on software techniques to analyze the massive dataset from the Large Hadron Collider to search for new physics.  | He has also been a passionate... Read More →


Tuesday November 10, 2015 6:10pm - 6:30pm
Erdos Stage
 
Wednesday, November 11
 

8:00am

Registration & Breakfast
Food and introduction

Wednesday November 11, 2015 8:00am - 9:00am
Computer Science History Museum

9:00am

Panel - Culture of Data Driven Decision Making - Livestream
Moderators
avatar for SriSatish Ambati

SriSatish Ambati

CEO and co-founder, H2O.ai
Sri is co-founder and ceo of H2O (@h2oai), the builders of H2O. H2O democratizes bigdata science and makes hadoop do math for better predictions. Before H2O, Sri spent time scaling R over bigdata with researchers at Purdue and Stanford. Prior to that Sri co-founded Platfora and was the Director of Engineering at DataStax. Before that Sri was Partner & Performance engineer at java multi-core startup, Azul Systems, tinkering with the... Read More →

Speakers
AB

Allison Baker

Allison is a data scientist in clinical services group at HCA.
avatar for Arno Candel

Arno Candel

Chief Architect, H2O.ai
Arno is the Chief Architect of H2O, a distributed and scalable open-source machine learning platform. He is also the main author of H2O’s Deep Learning. Before joining H2O, Arno was a founding Senior MTS at Skytree where he designed and implemented high-performance machine learning algorithms. He has over a decade of experience in HPC with C++/MPI and had access to the world’s largest supercomputers as a Staff Scientist at SLAC... Read More →
avatar for Mark Landry

Mark Landry

Competition Data Scientist & Product Manager, H2O.ai
Mark Landry is a competition data scientist and product manager at H2O. He enjoys testing ideas in Kaggle competitions, where he is ranked in the top 100 in the world (top 0.03%) and well-trained in getting quick solutions to iterate over. Most at home in SQL, he found H2O through hacking in R. Interests are multi-model architectures and helping the world make fewer models that perform worse than the mean.
avatar for Erin LeDell

Erin LeDell

Data Scientist & Hacker, H2O.ai
Erin is a Statistician and Machine Learning Scientist at H2O.ai. She is the main author of H2O Ensemble. Before joining H2O, she was the Principal Data Scientist at Wise.io and Marvin Mobile Security (acquired by Veracode in 2012) and the founder of DataScientific, Inc.Erin received her Ph.D. in Biostatistics with a Designated Emphasis in Computational Science and Engineering from University of California, Berkeley. Her research focuses on... Read More →
avatar for Martin McEnroe

Martin McEnroe

Big Data Visualization Evangelist, AT&T
I've always wanted to know everything about everything. I've designed CMOS, I know how adders work, I know BIOS code, timing diagrams, start-up testing, disk head physics, software design, caching, visual basic, that's just PCs. I have picked apart so many things. But LinkedIn, I love. I love understanding what they are doing, the algorithms behind the matches, the maps, the recommendations. I love the sociological aspect of how we interact... Read More →
avatar for Szilard  Pafka

Szilard Pafka

Chief Data Scientist, Epoch
Szilard studied Physics in the 90s and has obtained a PhD by using statistical methods to investigate the risk of financial portfolios. Next he has worked in a bank quantifying and managing market risk. About a decade ago he moved to California to become the Chief Scientist of a credit card processing company doing everything data (ETL, analysis, modeling, visualization, machine learning etc). He is also the founder/organizer of several data... Read More →
CS

Chris Severs

Data Scientist/Manager, eBay
Ph.D Mathematician and accomplished software engineer. Experience with data mining at a massive scale, machine learning and applied statistics. | | Contributor to numerous open source projects. | | Specialties: Mathematics, programming, data analysis, machine learning.
MS

Magnus Stensmo

Principal Research Scientist, Exabeam
Research and development from prototypes to production code in machine learning, information retrieval, data and text mining, categorization, clustering, collaborative filtering, neural networks, unstructured knowledge, information and data management.
MT

Martin Tobias

Martin is a data scientist in clinical services group at HCA.
avatar for Daqing Zhao

Daqing Zhao

Director of Advanced Analytics, Macy's
Bio Daqing Zhao has over 20 years of experience in advanced data analytics. Trained in physical sciences, he gained extensive expertise in customer centric marketing, in optimization of all stages of customer acquisition, conversion and retention. He worked on predictive modeling for banner ads, web behavior, search keywords, 1:1 emails, customer life time values, real time recommendation engines, and others. Daqing is Director of Advanced... Read More →


Wednesday November 11, 2015 9:00am - 10:00am
Ramanujan Stage

9:00am

Panel - Culture of Data Driven Decision Making - Operationalizing Insights
Moderators
avatar for SriSatish Ambati

SriSatish Ambati

CEO and co-founder, H2O.ai
Sri is co-founder and ceo of H2O (@h2oai), the builders of H2O. H2O democratizes bigdata science and makes hadoop do math for better predictions. Before H2O, Sri spent time scaling R over bigdata with researchers at Purdue and Stanford. Prior to that Sri co-founded Platfora and was the Director of Engineering at DataStax. Before that Sri was Partner & Performance engineer at java multi-core startup, Azul Systems, tinkering with the... Read More →

Speakers
avatar for Arno Candel

Arno Candel

Chief Architect, H2O.ai
Arno is the Chief Architect of H2O, a distributed and scalable open-source machine learning platform. He is also the main author of H2O’s Deep Learning. Before joining H2O, Arno was a founding Senior MTS at Skytree where he designed and implemented high-performance machine learning algorithms. He has over a decade of experience in HPC with C++/MPI and had access to the world’s largest supercomputers as a Staff Scientist at SLAC... Read More →
avatar for Cliff Click

Cliff Click

CTO and co-founder, H2O.ai
Cliff Click is the CTO and Co-Founder of H2O, makers of H2O, the opensource math and machine learning engine for Big Data. Cliff wrote his first compiler when he was 15 (Pascal to TRS Z-80!), although Cliff’s most famous compiler is the HotSpot Server Compiler (the Sea of Nodes IR). Cliff helped Azul Systems build an 864 core pure-Java mainframe that keeps GC pauses on 500Gb heaps to under 10ms, and worked on all aspects of that... Read More →
AD

Abhay Dabholkar

Principal - Big Data Software Engineer, AT&T
Working in IT industry since 1991 in different roles including Lead Big Data Engineer, Solution Architect, Software Engineer, Application Performance Specialist, Web Security Architect and Ethical Hacker | Passionate about large distributed system, search & realtime analytics. Successfully developed, implemented, deployed & managed predictive Machine Learning algorithms using Hadoop, H2O & R. I have spent number of years... Read More →
avatar for Mark Landry

Mark Landry

Competition Data Scientist & Product Manager, H2O.ai
Mark Landry is a competition data scientist and product manager at H2O. He enjoys testing ideas in Kaggle competitions, where he is ranked in the top 100 in the world (top 0.03%) and well-trained in getting quick solutions to iterate over. Most at home in SQL, he found H2O through hacking in R. Interests are multi-model architectures and helping the world make fewer models that perform worse than the mean.
avatar for Erin LeDell

Erin LeDell

Data Scientist & Hacker, H2O.ai
Erin is a Statistician and Machine Learning Scientist at H2O.ai. She is the main author of H2O Ensemble. Before joining H2O, she was the Principal Data Scientist at Wise.io and Marvin Mobile Security (acquired by Veracode in 2012) and the founder of DataScientific, Inc.Erin received her Ph.D. in Biostatistics with a Designated Emphasis in Computational Science and Engineering from University of California, Berkeley. Her research focuses on... Read More →
avatar for Martin McEnroe

Martin McEnroe

Big Data Visualization Evangelist, AT&T
I've always wanted to know everything about everything. I've designed CMOS, I know how adders work, I know BIOS code, timing diagrams, start-up testing, disk head physics, software design, caching, visual basic, that's just PCs. I have picked apart so many things. But LinkedIn, I love. I love understanding what they are doing, the algorithms behind the matches, the maps, the recommendations. I love the sociological aspect of how we interact... Read More →
avatar for Xavier Quintuna

Xavier Quintuna

Xavier Quintuna is the principal Big Data architect for Orange Silicon Valley, a subsidiary of the global telecom provider Orange. He has developed Big Data solutions for Call Detail Records, Quality of Service, CDNs, Marketing and Fraud across Orange business groups. He began working with Hadoop in 2008 to explore how it could be used to better support the telecom industry.


Wednesday November 11, 2015 9:00am - 10:00am
Erdos Stage

10:00am

Translating advanced analytics for business users
Developing analytics for business users involves identifying the analytics consumer, hiring the right talent, developing the analytics collaboratively, telling a story with the data. In this talk, we’ll explore strategies on how to approach each of the steps delineated above and solve for them in modern business organizations. 

Speakers
avatar for Conor Jensen

Conor Jensen

Analytics Program Director, Zurich North America
Conor Jensen is the Analytics Program Director for Zurich North America’s Predictive Analytics unit. He led the implementation of a new data infrastructure in addition to building the new team and refining the analytic development process. Conor has spent nearly a decade working in the analytics space, both in model development and as a model user. Previously, Conor was part of an internal strategy consulting practice at Zurich and... Read More →


Wednesday November 11, 2015 10:00am - 10:30am
Ramanujan Stage

10:00am

Machine Intelligence and Data Products - Hilary Mason
What will the next generation of data products look like? What are the most impactful techniques that are not yet widely used? This talk will explore a few new and exciting algorithms and look at the role that data scientists can play in creating the next generation of exciting products.

Speakers
avatar for Hilary Mason

Hilary Mason

Founder, Fast Forward Labs
Hilary Mason is the Founder of Fast Forward Labs, a machine intelligence research company, and the Data Scientist in Residence at Accel. Previously, she was the Chief Scientist at bitly where she led an team that studied attention on the internet in realtime, doing a mix of research, exploration, and engineering. Hilary also co-founded of HackNY, co-host DataGotham, and is a member of NYCResistor. |  Hilary is an advisor to several... Read More →


Wednesday November 11, 2015 10:00am - 11:00am
Erdos Stage

10:30am

Generalized low rank models
Across business and research, analysts seek to understand large collections of data organized as a table with numeric, Boolean, and categorical values. Many entries in the table may be noisy or even missing altogether. Low rank models facilitate understanding of tabular data by producing a condensed vector representation for every row and column in the data set. These representations can then be compared, clustered, plotted, and used in subsequent analysis. In this presentation, we will describe what a low rank model is and demonstrate how to build them in H2O. Through examples, we'll see how to fit low rank models to numeric and categorical data sets with missing values, and how to use these models to identify important features and make better predictions.

Speakers
avatar for Anqi Fu

Anqi Fu

Math Hacker, H2O.ai
| Anqi is a math hacker at H2O, where she implements and tests distributed machine learning algorithms.  Anqi worked on network security algorithms with the founder of RioRey, and spent summers conducting physics research at the Naval Research Laboratory and NIST. Anqi holds Master's degrees in Statistics and Economics from Stanford University, and a Bachelor's in Electrical Engineering from the University of Maryland, College Park.
avatar for Madeleine Udell

Madeleine Udell

Postdoctoral Fellow, Caltech Center for the Mathematics of Information
Madeleine Udell is a postdoctoral fellow at Caltech's Center for the Mathematics of Information, hosted by Joel Tropp. She will be joining Cornell as an Assistant Professor in the School of Operations Research and Information Engineering in July 2016. Her research focus is on modeling and solving large-scale optimization problems and on finding and exploiting structure in high dimensional data, with applications in marketing, demographic... Read More →


Wednesday November 11, 2015 10:30am - 11:00am
Ramanujan Stage

11:00am

What it will take to be a successful company in the new Digital Age - Michael Marks
Speakers
avatar for Michael Marks

Michael Marks

Partner, Board of Director H2O.ai, Paxion
Michael Marks is currently a general partner at WRV II, a technology venture fund,  | Bigwood II, a non-technology venture fund, and Riverwood Capital, a private equity fund  | which invests in rapidly growing private companies in North America and in emerging  | markets. Previously, he was a Partner and Senior Advisor at Kohlberg Kravis Roberts &  | Co. in 2006 and 2007. And before KKR, he spent 13 years as CEO of... Read More →


Wednesday November 11, 2015 11:00am - 11:30am
Erdos Stage

11:00am

Cancer detection via the lasso and customized training
I will review the lasso method for high dimensional supervised learningand discuss an example of its utility in  cancer diagnosis via mass spectometry. I will also discuss "customized training" for improving the  classifier performance.

Speakers
avatar for Prof. Rob Tibshirani

Prof. Rob Tibshirani

Professor of Health Research and Policy, and Statistics, Stanford University
Robert Tibshirani's main interests are in applied statistics, biostatistics, and data mining. He is co-author of the books Generalized Additive Models (with T. Hastie), An Introduction to the Bootstrap (with B. Efron), and Elements of Statistical Learning (with T. Hastie and J. Friedman). His current research focuses on problems in biology and genomics, medicine, and industry. With collaborator Balasubramanian Narasimhan, he also develops... Read More →


Wednesday November 11, 2015 11:00am - 12:00pm
Ramanujan Stage

11:30am

Keynote: How to Approach Clinical Prediction - Jason P Jones
Everything I ever needed to know about clinical prediction, I learned from the Titanic.  Clinical data often demonstrate (a) non-linearity, (b) informative missingness, and (c) important interactions.  We will illustrate these points with the fabulous Titanic dataset and demonstrate how to approach both clinical prediction and shared decision making.

Speakers
avatar for Jason P Jones

Jason P Jones

Vice President, Information Strategy for Care Transformation, Kaiser Permanente
Jason Jones, PhD is Vice President of Information, Strategy for Care Transformation.  In this role, he supports patient care delivery by focusing upon building tools and capacity for organizational decision making and point of care decision support involving predictive analytics.  Organizational decision making includes identifying areas of opportunity and how improvement will be measured (should we aim to improve... Read More →


Wednesday November 11, 2015 11:30am - 12:00pm
Erdos Stage

11:30am

Lunch - Food Trucks
Please head out to the food trucks downstairs

Wednesday November 11, 2015 11:30am - 1:30pm
Computer Science History Museum

12:00pm

Booksigning with Hilary Mason
Speakers
avatar for Hilary Mason

Hilary Mason

Founder, Fast Forward Labs
Hilary Mason is the Founder of Fast Forward Labs, a machine intelligence research company, and the Data Scientist in Residence at Accel. Previously, she was the Chief Scientist at bitly where she led an team that studied attention on the internet in realtime, doing a mix of research, exploration, and engineering. Hilary also co-founded of HackNY, co-host DataGotham, and is a member of NYCResistor. |  Hilary is an advisor to several... Read More →


Wednesday November 11, 2015 12:00pm - 1:00pm
Math Lounge

12:00pm

Booksigning with Tibshirani
Wednesday November 11, 2015 12:00pm - 1:00pm
Rotunda

1:00pm

Transamerica product recommender platform - Vishal Bamba
Description:

Transamerica built a product recommendation system that can be leveraged across multiple distribution channels to recommend products to serve customer needs and reduce complexity. This session highlights machine learning technology and models, big data architecture that enabled us to build product recommender platform.

Abstract:

Transamerica sells wide array of products in Insurance, Investment and retirement space. Due to complexity and variety of our products, selecting a right product that serves customer needs becomes challenging. We built a recommendation engine to guide the customer and simplify customer experience. Recommendation engine is also leveraged by marketing and customer service teams for cross sell and upsell offers. 

We developed a customer data hub providing 360 degree view of the customer on Cloudera’s Hadoop platform with data feeding from all our administration systems. Recommendation models were built using H2O machine learning platform on our customer data hub.

In this presentation, we will focus specifically on our machine learning models and data challenges engineering team faced. The presentation will also highlight the architecture of the customer data hub.

 

Speakers
avatar for Vishal Bamba

Vishal Bamba

Vice President Strategy & Architecture, Transamerica
Vishal Bamba is the current VP of Strategy & Architecture at Transamerica. Prior to becoming VP, Vshal was the Chief Architect and Director of Enterprise Architecture at Transamerica for 12 years. Vishal led teams in implementing cutting-edge big data and machine learning platforms, as well as crafting a strategy to optimize these new tools.
NP

Nitin Prabhu

Manager, Innovation & Architecture, Transamerica


Wednesday November 11, 2015 1:00pm - 1:30pm
Ramanujan Stage

1:00pm

Panel - Algorithms - Design & Application Gotchas
Algorithms are changing the World!

How Algorithms work - Trade offs when choosing algorithms in practice.
Design and Implementation trade offs in implementation of Algorithms 

Moderators
avatar for Mark Landry

Mark Landry

Competition Data Scientist & Product Manager, H2O.ai
Mark Landry is a competition data scientist and product manager at H2O. He enjoys testing ideas in Kaggle competitions, where he is ranked in the top 100 in the world (top 0.03%) and well-trained in getting quick solutions to iterate over. Most at home in SQL, he found H2O through hacking in R. Interests are multi-model architectures and helping the world make fewer models that perform worse than the mean.

Speakers
avatar for Arno Candel

Arno Candel

Chief Architect, H2O.ai
Arno is the Chief Architect of H2O, a distributed and scalable open-source machine learning platform. He is also the main author of H2O’s Deep Learning. Before joining H2O, Arno was a founding Senior MTS at Skytree where he designed and implemented high-performance machine learning algorithms. He has over a decade of experience in HPC with C++/MPI and had access to the world’s largest supercomputers as a Staff Scientist at SLAC... Read More →
avatar for Erin LeDell

Erin LeDell

Data Scientist & Hacker, H2O.ai
Erin is a Statistician and Machine Learning Scientist at H2O.ai. She is the main author of H2O Ensemble. Before joining H2O, she was the Principal Data Scientist at Wise.io and Marvin Mobile Security (acquired by Veracode in 2012) and the founder of DataScientific, Inc.Erin received her Ph.D. in Biostatistics with a Designated Emphasis in Computational Science and Engineering from University of California, Berkeley. Her research focuses on... Read More →
avatar for Tomas Nykodym

Tomas Nykodym

Engineer & Hacker, H2O.ai
Tomas is our resident Software Engineer. He received his Masters degree from the Czech Technical University. Tomas has worked at IBM-research and Agent-Technology Group. He has participated on several projects related to malware detection/protection funded by US Air Force. Specifically, he developed a system for modeling software behavior using compressed graphs of the system calls made on the system. | Tomas also created a sandbox with simulated... Read More →
avatar for Prof. Rob Tibshirani

Prof. Rob Tibshirani

Professor of Health Research and Policy, and Statistics, Stanford University
Robert Tibshirani's main interests are in applied statistics, biostatistics, and data mining. He is co-author of the books Generalized Additive Models (with T. Hastie), An Introduction to the Bootstrap (with B. Efron), and Elements of Statistical Learning (with T. Hastie and J. Friedman). His current research focuses on problems in biology and genomics, medicine, and industry. With collaborator Balasubramanian Narasimhan, he also develops... Read More →
avatar for Madeleine Udell

Madeleine Udell

Postdoctoral Fellow, Caltech Center for the Mathematics of Information
Madeleine Udell is a postdoctoral fellow at Caltech's Center for the Mathematics of Information, hosted by Joel Tropp. She will be joining Cornell as an Assistant Professor in the School of Operations Research and Information Engineering in July 2016. Her research focus is on modeling and solving large-scale optimization problems and on finding and exploiting structure in high dimensional data, with applications in marketing, demographic... Read More →


Wednesday November 11, 2015 1:00pm - 2:00pm
Erdos Stage

1:30pm

A Look Under Progressive's Big Data Hood - Pawan Divakarla
Progressive’s reverence for data is an idea and a promise that’s been woven into the company’s culture for decades. Progressive Insurance’s Pawan Divakarla will talk about trends in utilizing and reacting to data when and where it’s most effective.

Speakers
avatar for Pawan  Divakarla

Pawan Divakarla

Data and Analytics Business Leader, Progressive Casualty Insurance Company
avatar for Brian Durkin

Brian Durkin

Innovation Enablement Services, Progressive
Brian Durkin is an innovation strategist in Progressive's Enterprise Architecture Organization. Throughout his eleven years at Progressive he has played many roles, ranging from application developer to enterprise architecture consultant; the common thread being a passion for making data more useful. He is currently part of the product research and development team focusing on geospatial analytics for usage based insurance where he uses... Read More →


Wednesday November 11, 2015 1:30pm - 2:00pm
Ramanujan Stage

2:00pm

PAAS : Predictive Analytics offered as a Service - Prateem Mandal
MarketShare provides a marketing analytics technology platform for the world’s biggest brands, helping them to improve the impact of marketing on revenue. Learn about challenges in building a multi-tenant analytics platform and how MarketShare productized analytics to be offered as service that allowed them to scale scaling models with petabytes of data and 10's of thousands of variables/parameters to produce models tailored to the customer's data, the need to deploy these models through interactive web interface with sub-second response times.

The speaker will cover:

·         The challenges of building models at scale

·         The differences between an analytics services offered as a product versus being offered in a consultancy practice

·         How to find and evaluate enterprise partners to build an advanced analytics platform

·         Before and after: Improvements to costs, performance, reliability, and availability

Speakers
avatar for Prateem Mandal

Prateem Mandal

Technical Lead Architect, MarketShare


Wednesday November 11, 2015 2:00pm - 2:30pm
Ramanujan Stage

2:00pm

Sparkling Water on the Spark Notebook: Interactive Genomes clustering - Xavier Tordoir
It’s a matter of fact that H2O provides advanced Machine Learning capabilities scaling with large datasets. Also, interoperating between H2O and generic large scale data manipulation frameworks like Apache Spark is of utmost importance to help Data Scientists bring the most efficiency on the table, this is where Sparkling Water is shining. The last stone of the edifice is then to  work interactively on data from a single environment, allowing the data scientist to share his results and code. We present here the Spark Notebook working with Sparkling Water to bring the valuable H2O libraries to the Spark environment. We show a case of genomics data processing, leveraging Spark and its genomics library ADAM to efficiently access raw data with domain specific objects, data preparation is done with spark and deep learning from H2O is used to compute a model for population stratification within the set of genomes under investigation.

Speakers
avatar for Xavier Tordoir

Xavier Tordoir

Founder, Data Fellas, Inc.
Xavier started his career as a researcher in Experimental Physics and also focused on data processing. Further down the road, he took part in projects in finance, genomics and software development for academic research. During that time, he worked on timeseries, on prediction of biological molecular structures and interactions, and applied Machine Learning methodologies. He developed solutions to manage and process data distributed across data... Read More →


Wednesday November 11, 2015 2:00pm - 2:30pm
Erdos Stage

2:30pm

Determining the Winners from Big Data
TBD

Speakers
avatar for Raymie Stata

Raymie Stata

Jack of All Trades, Altiscale
Raymie comes to Altiscale from Yahoo!, where he was Chief Technical Officer. At Yahoo! he played an instrumental role in algorithmic search, display advertising, and cloud computing. He also helped set Yahoo!’s open-source strategy and initiated its participation in the Apache Hadoop project. Prior to joining Yahoo! Raymie founded Stata Laboratories, maker of the Bloomba search-based e-mail client, which Yahoo! acquired in 2004. He has also... Read More →


Wednesday November 11, 2015 2:30pm - 3:00pm
Erdos Stage

2:30pm

NCS Continuous Media Optimization with H2O - Satya Satyamoorthy
Nielsen Catalina Solutions helps marketers and media companies measure and improve advertising performance by accurately linking what consumers watch, see, or hear (whether on television, the web, in print, or on the radio) with what they buy. A joint venture of Nielsen and Catalina, NCS provides the gold standard in marketing performance analytics.
Through unique sets of “big data” NCS is able to target and measure advertising based on what matters most for CPG companies: actual offline sales. Nielsen Catalina Solutions has the largest single-source dataset in the country to plan and measure CPG media with over 70 million+ households and retail sales data down to the UPC level, updated daily direct from retailer point-of-sale systems. This talk will go into the details of Continuous Media Optimization – of Audience selection, Activation, Measurement and Optimization and details about a solution that was build using H2O as the core. It will go into the details of the Enterprise Service based platform, various Model methodologies, Prediction methods and best practices in building enterprise Business Intelligence platforms that can be hosted on portals and caters to tens of concurrent users using H2O.

Speakers
avatar for Satya Satyamoorthy

Satya Satyamoorthy

Director of Application Development, Nielsen Catalina Solutions
Satya is a veteran in Data Analytics with over fifteen years of experience handling Big Data. He was with Nielsen Company for several years before Nielsen started the JV with Catalina called Nielsen Catalina Solutions in 2008. Satya was one of the founding members and is in charge of Application development at NCSolutions. NCSolutions help marketers and media companies measure and improve advertising performance by accurately... Read More →


Wednesday November 11, 2015 2:30pm - 3:00pm
Ramanujan Stage

3:00pm

Machine translation in Mobile Games–Augmenting Social Media Text Normalization with Incentivized feedback
Abstract: Machine Zone, creator of the popular mobile game: Game of War: Fire Age, pioneered the use of Realtime Machine Translation at scale in the Mobile gaming space. Game of War: Fire Age is a Massively Multiplayer Online Role Playing Game (MMORPG) played on mobile devices globally. The game has over a hundred thousand concurrent users at any given time and the massive player base communicates within the game through a Chat Server. The Chat Server is central to the game’s functioning and provides a global communication platform for players from around the world to communicate in realtime. In order to overcome language barriers, Machine Zone developed MZ Transformer, a Social Text Normalization and Translation system, which translates incoming foreign language chats to a player’s device language. The large connected player base is a social network in itself and Machine Translation enables this community to communicate better and engage more effectively with the game.

Speakers
avatar for Nikhil Bojja

Nikhil Bojja

Director, Machine Learning & NLP, Machine Zone


Wednesday November 11, 2015 3:00pm - 3:30pm
Erdos Stage

3:00pm

Benchmarking open source ML platforms
Binary classification is one of the most widely used machine learning methods in business applications. If the number of features is not very large (sparse), algorithms such as random forests, gradient boosted trees or deep learning neural networks (and ensembles of those) are expected to perform the best in terms of accuracy. There are countless off-the-shelf open source implementations for the previous algorithms (e.g. R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.), but which one to use in practice? Surprisingly, there is a huge variation between even the most commonly used implementations of the same algorithm in terms of scalability, speed, accuracy. In this talk we will see which open source tools work reasonably well on larger datasets commonly encountered in practice.

 

Speakers
avatar for Szilard  Pafka

Szilard Pafka

Chief Data Scientist, Epoch
Szilard studied Physics in the 90s and has obtained a PhD by using statistical methods to investigate the risk of financial portfolios. Next he has worked in a bank quantifying and managing market risk. About a decade ago he moved to California to become the Chief Scientist of a credit card processing company doing everything data (ETL, analysis, modeling, visualization, machine learning etc). He is also the founder/organizer of several data... Read More →


Wednesday November 11, 2015 3:00pm - 3:30pm
Ramanujan Stage

3:30pm

Coffee
Wednesday November 11, 2015 3:30pm - 4:00pm
Computer Science History Museum

4:00pm

Clustering + Feature Extraction on Text with H2O and Lexalytics - Seth Redmore
First we’ll show you how to get text to cluster well (tricky!), and then show how you can use tools like Lexalytics Salience Engine (or Stanford NLTK or OpenCalais, or any other Text Mining toolkit) to extract information from those clusters.  This gives you the power to automatically segment your content into buckets, and then understand just what’s in those buckets.   Useful for understanding differing points of view in political discourse, or message boards, or reviews – basically any type of text can benefit from this “cluster & extract” treatment.   It’s a great way to discover what’s going on without making any assumptions or having to ask questions.  Let your (text) data surprise you.

Speakers
avatar for Seth Redmore

Seth Redmore

Chief Marketing Officer, Lexalytics, Inc.
Seth is currently 3 for 4 in startups, including co-founding and selling a networking company to Cisco Systems. Seth was into data before it was big, and has been dealing with unstructured data problems for over ten years now. Seth is responsible for all marketing inside of Lexalytics, from arguing about taglines to filling sales' insatiable need for leads.  Seth also spends lots of time processing and visualizing unstructured data to... Read More →


Wednesday November 11, 2015 4:00pm - 4:30pm
Ramanujan Stage

4:00pm

Panel - Machine Learning in Financial Services
Machine learning, data science and analytics in the financial services space. How different companies are tackling their business problems and challenges with data?

Moderators
avatar for Brendan Herger

Brendan Herger

Data Scientist, Capital One Financial Corporation
Brendan Herger is a Data Scientist, helping to bring large scale machine learning to Capital One. 

Speakers
AA

Alex Aguilar

CTO, Overland Advisors
CTO of Overland Advisors
avatar for Conor Jensen

Conor Jensen

Analytics Program Director, Zurich North America
Conor Jensen is the Analytics Program Director for Zurich North America’s Predictive Analytics unit. He led the implementation of a new data infrastructure in addition to building the new team and refining the analytic development process. Conor has spent nearly a decade working in the analytics space, both in model development and as a model user. Previously, Conor was part of an internal strategy consulting practice at Zurich and... Read More →
AK

Alexy Khrabrov

Chief Scientist, Nitro
Alexy Khrabrov (Ph.D., Computer Science, University of Pennsylvania ) is a computer scientist working at the intersection of startups, “big" data, and functional programming.  He builds and supports communities of technologists creating value and having fun at this magical center of the Venn diagram.  Alexy is the first Chief Scientist at Nitro, bringing data mining, NLP, Scala and Apache Spark technologies together to... Read More →
avatar for Hang Li

Hang Li

Director of Data Science/Fraud Analytics, Western Union Digital
Hang Li is the Director of Data Science/Fraud Analytics at Western Union Digital. His team is responsible for the development, deployment and maintenance of all the risk models and rules used by WU to fight fraud in real time. Hang has over ten years of experience in fraud analytics, in areas including POS fraud, buyer/merchant fraud in online commerce and money transfer. Before joining WU, Hang worked for Google, PayPal, and Discover... Read More →
avatar for Jeff Tyree

Jeff Tyree

Sr. Director, Enterprise Architecture, Capital One
Our enterprise architecture practice is focused on core technology disruptors, such as incorporating microservices, the first native cloud architecture, and adopting a cloud-first mindset for application builds; gaining real-time analytic insights through connecting our big data platform with an event-driven architecture; maturing our continuous delivery pipeline through automation and integration of security controls; and embracing an open... Read More →
avatar for Angela Williams

Angela Williams

AVP, Transamerica Life and Protection
AVP, Transamerica Life and Protection


Wednesday November 11, 2015 4:00pm - 5:00pm
Erdos Stage

4:30pm

ML could solve NLP challenges - ontology management - Erik Huddleston
TBD

Speakers
avatar for Erik Huddleston

Erik Huddleston

CEO, TrendKite
Erik Huddleston is the CEO of TrendKite. Prior to TrendKite he was VP Product at Sprinklr. Erik is passionate about the transformational potential of social and big data for the world’s largest brands. Erik previously served as Chief Technology Officer at Inovis and drove the vision, development, and evangelism for their cloud integration, analytics, and social supply chain offering. Erik joined Inovis through the acquisition of... Read More →


Wednesday November 11, 2015 4:30pm - 4:50pm
Ramanujan Stage

5:00pm

How Macy's uses Advanced Analytics and Big Data - Daqing Zhao
Data and Analytics play an important role in Macy's customer centric strategies. Macy's omni channel customer strategies are aimed to provide superior customers shopping experience, store or online, with interaction through web sites, emails, using desktop, tablet and mobile devices. From improving customer experience to optimizing our business processes, Advanced Analytics at Macy’s.com use data to build powerful tools to improve the experience of our customers. As a type of data scientists who are domain experts, we focus on solving specific data driven business problems such as those customer or supply chain analytics and modeling, using increasingly larger data sets which require scalable and efficient modeling tools. We will discuss our experience in all these areas, with some cases to illustrate.

Speakers
avatar for Daqing Zhao

Daqing Zhao

Director of Advanced Analytics, Macy's
Bio Daqing Zhao has over 20 years of experience in advanced data analytics. Trained in physical sciences, he gained extensive expertise in customer centric marketing, in optimization of all stages of customer acquisition, conversion and retention. He worked on predictive modeling for banner ads, web behavior, search keywords, 1:1 emails, customer life time values, real time recommendation engines, and others. Daqing is Director of Advanced... Read More →


Wednesday November 11, 2015 5:00pm - 5:20pm
Erdos Stage

5:00pm

Panel - Advanced Machine Learning in Natural Language Processing
We'll talk about the advancements in Natural Language Processing and the current challenges including ontology, sentiment, semantics, pragmatics and how Machine Learning could help solve them.

Speakers
avatar for Nikhil Bojja

Nikhil Bojja

Director, Machine Learning & NLP, Machine Zone
avatar for Erik Huddleston

Erik Huddleston

CEO, TrendKite
Erik Huddleston is the CEO of TrendKite. Prior to TrendKite he was VP Product at Sprinklr. Erik is passionate about the transformational potential of social and big data for the world’s largest brands. Erik previously served as Chief Technology Officer at Inovis and drove the vision, development, and evangelism for their cloud integration, analytics, and social supply chain offering. Erik joined Inovis through the acquisition of... Read More →
AK

Alexy Khrabrov

Chief Scientist, Nitro
Alexy Khrabrov (Ph.D., Computer Science, University of Pennsylvania ) is a computer scientist working at the intersection of startups, “big" data, and functional programming.  He builds and supports communities of technologists creating value and having fun at this magical center of the Venn diagram.  Alexy is the first Chief Scientist at Nitro, bringing data mining, NLP, Scala and Apache Spark technologies together to... Read More →
avatar for Seth Redmore

Seth Redmore

Chief Marketing Officer, Lexalytics, Inc.
Seth is currently 3 for 4 in startups, including co-founding and selling a networking company to Cisco Systems. Seth was into data before it was big, and has been dealing with unstructured data problems for over ten years now. Seth is responsible for all marketing inside of Lexalytics, from arguing about taglines to filling sales' insatiable need for leads.  Seth also spends lots of time processing and visualizing unstructured data to... Read More →
avatar for Oleg Rogynskyy

Oleg Rogynskyy

VP of Marketing & Growth, H2O.ai
Oleg heads up marketing and growth initiatives at H2O. Prior to H2O, Oleg spent 8 years building 3 startups in natural language processing space. | Prior to joining H2O, Oleg led Nstein Technologies to the acquisition by Open Text, and Semantria to the acquisition by Lexalytics.


Wednesday November 11, 2015 5:00pm - 6:00pm
Ramanujan Stage

5:20pm

ML in Industry - Beyond Installing Packages - Sadik Kapadia
Building cutting edge production machine learning (ML), systems is hard. We shall discuss variety of rules of thumb that can improve the chances of success. We shall range from what is "good" code; researching ML; architectural design; devops burden and big company inertia.

Speakers
avatar for Sadik Kapadia

Sadik Kapadia

CTO, Sociogramics Inc
ABSTRACT | | ML in Industry - Beyond Installing Packages | | Building cutting edge production machine learning (ML), systems is hard. We shall discuss variety of rules of thumb that can improve the chances of success. We shall range from what is "good" code; researching ML; architectural design; devops burden and big company inertia. | Sadik Kapadia is a veteran of the machine learning world. In the late 1980s, early 1990s he made major... Read More →


Wednesday November 11, 2015 5:20pm - 5:40pm
Erdos Stage

5:40pm

Roll your own big data analytics in the cloud without reinventing the wheel - Vin Sharma
Big data analytics has been proven to improve profitability and spur innovation for the enterprise. However, the real-world deployment of big data analytics solutions at scale is still slow.

Intel, teamed up with ecosystem partners, has introduced an open source analytic platform-as-a-service software stack for data scientists and app developers to build and deploy domain-specific advanced analytics applications at cloud scale. The platform is designed to accelerate the development of cloud-native applications driven by big data analytics, simplify their deployment on-premises and in public clouds, and deliver hardware-enhanced performance and security for analytic workloads.

Join us and discover use cases in various industries and learn how you can build your own solutions with this open source trusted analytic platform.

Speakers
avatar for Vin Sharma

Vin Sharma

Director of Strategy and Product, Big Data Analytics, Intel


Wednesday November 11, 2015 5:40pm - 6:00pm
Erdos Stage