Factual is a well-funded and expanding start-up with a mission to create a platform where anyone can share and mash open data on any subject. We are pioneers in the worldwide trend towards open information flow, and our goal is to provide the infrastructure to curate high-quality data and make it accessible everywhere.
Factual is looking for an engineer with good attention to detail that has both experience with and passion for large data. The candidate would take ownership of our data pipeline and manage the process and schedule to ensure timely delivery of data.
The ideal candidate would:
* Have a CS degree
* Have 2+ years of professional hands-on software experience
* Have some experience with cloud computing or cluster management
* Be enthusiastic about large data
* Be eager to learn new tools and skills related to data and scripting
* Be able to propose out-of-the-box strategies based on insights about data
Desired Skills (at least three of the following):
* Proficient in a language such as C++, Java, Python, Ruby, Perl
* Linux/bash scripting
* SQL and/or Extract/Transform/Load (ETL) frameworks
* HTML DOM/XPath and/or Regular expressions
* Project management
Responsibilities (after training and ramp up):
* Oversee entire data pipeline and schedule
* Evaluate and judge algorithmically generated data and assess data quality
* Coordinate and run web-scale jobs on large clusters and write automation scripts
* Aggregate, clean, and merge data
* Generate, maintain, and operate the generation software and scripts
* Author CSS and XPath selectors
Why work at Factual?
* Mission: We believe good data can change the world.
* People: Our goal is to hire outstanding teammates with great skills who have a commitment towards shipping an innovative product. We're looking for people who are passionate about improving the world with better data.
* Challenge: If you crave an interesting challenge, we have plenty of them, from forming new kinds of business partnerships to large-scale computing and developing new tools for data collaboration.
* Self-determination: You'll have a well-defined mission, but at the same time, our philosophy is to give people room to implement the best solution without micro-management. Bring your creativity and passion!
* Meritocracy: We reward people who do great work.
* Agility: As a start-up, we move quickly, and we are committed to keeping our organization flat and maintaining the feel of a small company.
* Transparency: We keep secrets to a minimum and communication is highly valued.
Who are we?
Factual was founded in 2007 by Gil Elbaz, co-founder of Applied Semantics (AdSense), which was acquired by Google in 2003. Gil has had a lifelong passion for organizing and structuring information, and building smart tools which can make better sense of data. Along with founding engineers, Tim Chklovski and Myron Ahn, he built Factual on the idea that it would be a better world if more decisions were data-driven. So we set out to develop an open data platform and community in an effort to maximize data accuracy, transparency, and availability. Other fellow data lovers who are helping to guide this ambitious project are Eva Ho, an ex-Googler and ASI-er and Bill Michels, former GM of Yahoo BOSS, as well as other industry veterans from Google, LinkedIn, and Idealab.
Our investors are an A-list group of accomplished internet and software pioneers who include: Andreessen Horowitz, Bill Gross via Idealab, Danny Rimer, Esther Dyson, Founder Collective, Mårten Mickos and many others.
For more information:
http://www.factual.com/about
http://www.factual.com/press
http://blog.factual.com/
http://www.factual.com/FAQ
http://twitter.com/factualinc
http://www.facebook.com/factualinc
Interested? Send a brief cover letter and resume/CV in Word or PDF format to jobs@factual.com.
Source: Joel On Software