Following the tradition set by recent versions, the new release of the MongoDB NoSQL data store comes with a batch of new features designed to appeal to the enterprise market, including a new built-in search engine, more support for geospatial data, and the ability to balance workloads across multiple servers more effectively.
"We're moving more quickly," said Kelly Stirman, 10gen director of product marketing, referring to how MongoDB's growing user base is giving the company more resources. MongoDB 2.4, available Tuesday, was released only six months after the last major version, 2.2 "We've substantially increased the size of the engineering corporation and the company has grown dramatically over the past year," Stirman said.
[ Andrew C. Oliver answers the question on everyone's mind: Which freaking database should I use? | Keep up with the latest approaches to managing information overload and compliance in InfoWorld's Enterprise Data Explosion Digital Spotlight. ]
The company has also updated the commercially supported version of this open source data store, called MongoDB Enterprise.
Since 10Gen began work on MongoDB in 2007, the data store has been downloaded more than 4 million times. The document data store was designed to ingest and read large amounts of data very quickly, and has proved itself to popular in the fields of analysis, content management, mobile and social infrastructure and user data management. 10gen supports more than 600 commercial customers with the enterprise version of the data store, including Craigslist, Disney, Electronic Arts, eBay, Foursquare, Intuit, LexisNexis, MTV, Salesforce.com and Telefonica.
One of the chief new features of MongoDB 2.4 is hashed-based sharding. Sharding takes place when different parts of a data table are spread across multiple servers. Hashed-based sharding randomly distributes new entries across all the available servers. As a result, data is distributed more evenly, minimizing hotspots that occur when too much frequently consulted data -- such as recently captured data -- gets placed on a single server.
"You get a nice distribution of all the documents across all the shards for reads and writes," Stirman said, adding that range-based sharding -- which was the previous default sharding algorithm -- will continue to be available.
The new built-in search engine, still in beta mode, may eliminate the need to maintain an external search engine, such as Apache Lucene/Solr. It offers simple text search, so it does not have all the capabilities of stand-alone search engine, such as natural language processing. But because it is built into MongoDB itself, it is a lot easier to configure and maintain, Stirman said.
"It will be good enough for a lot of applications, and the community will be excited because they won't have to worry about integrating another technology, especially if they are deploying across multiple data centers. With MongoDB, that is pretty easy," Stirman said.
MongoDB 2.4 contains a number of new techniques that may allow developers to make better use of data. One new feature, capped arrays, should find itself useful in interactive Web 2.0 environments. A capped array is an array with a predefined limit of the number of items it can hold. So it could be used for a website for displaying the 20 most popular user comments, for instance.