New tool to pick out hidden patterns in vast data sets
December 16th, 2011 - 9:10 pm ICT by IANSWashington, Dec 16 (IANS) Researchers have developed a statistical tool that can pick out unsuspected or hidden patterns in mountains of data in a way that no other software programme can.
Part of a suite of statistical tools called MINE, it can tease out multiple patterns hidden in health information from around the globe, data on the changing bacterial landscape of the gut, and much more.
MINE, which stands for ‘Maximal Information-based Nonparametric Exploration’, is able to analyze a broad spectrum of patterns.
“There are massive data sets that we want to explore, and within them, there may be many relationships that we want to understand. The human eye is the best way to find these relationships, but these data sets are so vast that we can’t do that. This toolkit gives us a way of mining the data to look for relationships,” said Pardis Sabeti, senior study author and assistant professor at the Centre for Systems Biology, Harvard University.
The researchers tested their analytical toolkit on several large data sets, including one provided by Harvard researcher Peter Turnbaugh who is interested in identifying the trillions of microorganisms that live in the gut.
Working with Turnbaugh, the research team harnessed MINE to make more than 22 million comparisons and narrowed in on a few hundred patterns of interest that had not been observed before.
“The goal of this statistic is to take data with a lot of different dimensions and many possible correlations and pick out the top ones,” said Michael Mitzenmacher, senior study author and professor of computer science at Harvard University.
Other statistical tools work well for searching for a specific pattern in a large data set, but cannot score and compare different kinds of possible relationships.
- Geographic patterns can help predict how disease spreads - Apr 16, 2010
- Revolutionary JHU computer that can crunch 5 petabytes of data! - Nov 02, 2010
- New scientific approach could reveal pieces of human DNA favored by evolution - Jan 10, 2010
- Genes that aid antimalarial drug resistance identified - Apr 22, 2011
- The physics behind coffee rings - Nov 24, 2010
- New technique can quash cyber crimes - Mar 09, 2011
- La Nina may trigger flu pandemics: Study - Jan 17, 2012
- New tool can ensure better safety on planes - Sep 13, 2011
- Two Americans share Nobel Prize for Economics - Oct 10, 2011
- New tool can locate buried artefacts - Mar 09, 2011
- Intelligence genes may not be tied to sharp brain - Feb 26, 2012
- IMImobile announces partnership with Tree Worldwide - May 03, 2012
- Man who harvested data of 100m Facebook users speaks out - Jul 30, 2010
- A machine that can tell when you're lying - Mar 27, 2012
- New tool to diagnose autism within minutes - Apr 12, 2012
Tags: assistant professor, broad spectrum, computer science, correlations, different kinds, globe data, harvard researcher, harvard university, health information, hidden patterns, human eye, massive data, microorganisms, statistic, statistical tool, statistical tools, study author, systems biology, trillions, turnbaugh