Installed as a layer above Hadoop, the open-source Pydoop package enables Python scripts to do big data work easily.
Data analysis is only half the battle; getting the data into a Hadoop cluster is the first step in any Big Data deployment. Apache Flume uses an elegant design to make data loading easy and efficient.
Mozilla's Firefox OS delivers an easy way to develop and market apps for Android and the upcoming Mozilla-specific phone. Mike Riley takes a first look at developing apps for the platform.
The Hadoop ecosystem relies on composability the ability to use output from one tool as input to the next to efficiently process data at scale, from simple projects, to processing streams of real-time data, to building data warehouses.
MapReduce on small datasets can be run easily and without much coding or fiddling provided you know what to do. Here's how.
Next phase of common OpenStack-based architecture arrives
Red Hat Software Collections 1.0 arrives in beta
Language-aware, three-way source code merging tool free for open source projects
Working with predictive models and machine-learning applications on Apache Hadoop
Version 1.1 adds race detector to find concurrency bugs
Cloudera Developer Kit includes APIs, tools, and documentation
Open API to workflow tool with file sharing and sync features
A total of 91% of software projects contain indirect open source dependencies
Atlassian Stash 2.4 encourages and facilitates forking for open source coders using Git
HTML5 library enables native Windows apps to run in any browser
Events of Interest
June 17-19. Boston, MA. E2 Conference
June 18-20. Santa Clara, CA. O'Reilly Velocity Web Performance and Operations Conference
June 24-28. San Jose, CA. 2013 USENIX Annual Technical Conference
June 26-27. San Francisco, CA. Build 2013
July 22-26. Portland, OR. O'Reilly Open Source Convention 2013
July 29-31, 2013. Santa Clara, CA. JVM Language Summit
August 20-21. Raleigh, NC. Business and Technology Solutions Summit 2013: Cloud and Big Data Conference and Expo
September 16-19. Santa Clara, CA. Storage Developer Conference (SDC)
September 18-20. St. Louis, MO. Strange Loop 2013
October 1-3. San Francisco, CA. Atlassian Summit 2013
October 5-6. Los Altos Hills, CA. Silicon Valley Code Camp
Ocotber 23-25. San Fransisco, CA. API Strategy and Practice Conference
October 28-30. London, United Kingdom. JAXLondon 2013 Big Data Conference
Videos of Past Events
March 2013. GPU Technology Conference
September 2012. Strangeloop
September 2012. Intel Developer Forum
August 2012. VMWorld
July 2012. Java Language Summit
June 2012. Google I/O 2012
May 2012. Atlassian Summit
May 2012 (paid). Fluent Conference
March 2012. Multicore World
July 2011. JVM Language Summit