2015년 2월 25일 수요일

Data Today: I say DARPA, Cascading workflow visualizer + more

O'Reilly Media Logo
O'Reilly DataNewsletter

1. A tale of two clusters: Mesos and YARN

With Myriad, analytics can be performed on the same hardware that runs your production services. Here's how.

2. 50 shades of Bayes

Here’s a great tutorial (with code) on naïve Bayes classification by Lynn Cherny using text from 50 Shades of Gray.

3. Processing frameworks for Hadoop

How to decide which framework is best for your particular use case.

4. I say DARPA

Let’s try some free association. I say DARPA, you say_______. If open data wasn’t the first thing that sprung to mind, you’re not alone. But, in fact, DARPA is "developing an open source software library for big data to help overcome the challenges of effectively scaling to modern data volume and characteristics."

5. Business model innovation needs (data) science

Here’s a post by Jerry Overton, created to accompany his talk at last week’s Strata + Hadoop World. (@sbmarkb tweeted about this session: "Just delivered full ROI on the cost of attending @strataconf!")
Sponsored Content

Taming Data Variety

Andy PalmerJoin us on March 5 for a free 30-minute webcast, hosted by data-industry veteran Andy Palmer. He'll discuss how enterprise organizations are leveraging new approaches to delivering the cleanest, widest view of data to downstream analytic tools, for applications as diverse as:
  • Procurement Optimization
  • Clinical Study Data Conversion
  • Customer Data Integration
Thanks to Tamr for sponsoring Taming Data Variety: Intelligent Solutions Using Machine Learning and Expert Crowdsourcing.
Learn more →

6. Cascading workflow visualizer

Etsy has open-sourced Sahale, a cascading workflow visualizer that helps you make sense of tasks clustered within Hadoop jobs.

7. POTUS introduces DJ at Strata + Hadoop World

You probably heard that DJ Patil has been named U.S. Chief Data Scientist at White House Office of Science and Technology Policy. If you missed President Obama's video message to Strata attendees last week—or DJ's keynote—you can watch them here. (Spoilers: Data as a team sport. President Obama looks at dashboards every day. And the White House wants you.)
image

8. Performance improvements in Apache Spark

The goal is for Spark to offer a single platform where users can get the best distributed algorithms for any data processing task. Reynold Xin takes a look atrecent performance improvements. (Related: If Spark development is a strength of yours, get certified as a Spark developer and be recognized for your expertise.)
Sponsored Content

Big data on bare metal cloud

Rackspace logoIn the world of big data, bare metal is king. Many companies are seeking an architecture that allows for full utilization of resources like I/O and throughput, but we often hear from you that when it comes to Big Data you are forced to trade the advantages of cloud (elastic, on-demand, flexible) for the consistency and predictability of bare metal. With Rackspace Cloud Big Data OnMetal, get the best of both worlds.
Sign up for a free trial →

9. Signals from Strata

If you attended last week's sell-out Strata + Hadoop World, you know how huge it was—and how hard it is to give a concise recap. But Jenn Webb has made a noble effort to sum up the key insights from Strata + Hadoop World in San Jose, CA, 2015.
If you were inspired by the outstanding speakers and fascinating sessions at Strata + Hadoop World, we've got good news. The Call for Presenters for Strata + Hadoop World in New York (Sept 29–Oct 1, 2015) has just opened. Give us your best ideas, and save the dates.

10. Freebie of the week

Understanding the Chief Data OfficerUnderstanding the Chief Data Officer: How Leading Businesses are Transforming Themselves with Data has just been released. In this free report, Julie Steele provides a clear, concise look at how CDOs view their nascent role in high-profile organizations such as Wells Fargo, Samsung, the Republican National Committee, Allstate, and the Federal Reserve Board. Thanks to Silicon Valley Data Science for sponsoring this week's freebie.
Get the free report →

댓글 없음:

댓글 쓰기