Frontiers in Massive Data Analysis
View larger
  • Status: Final Book
  • Downloads: 10,916
Purchase Options
Purchase Options MyNAP members save 10% online. Login or Register
Overview

Authors

Description

Data mining of massive data sets is transforming the way we think about crisis response, marketing, entertainment, cybersecurity and national intelligence. Collections of documents, images, videos, and networks are being thought of not merely as bit strings to be stored, indexed, and retrieved, but as potential sources of discovery and knowledge, requiring sophisticated analysis techniques that go far beyond classical indexing and keyword counting, aiming to find relational and semantic interpretations of the phenomena underlying the data.

Frontiers in Massive Data Analysis examines the frontier of analyzing massive amounts of data, whether in a static database or streaming through a system. Data at that scale--terabytes and petabytes--is increasingly common in science (e.g., particle physics, remote sensing, genomics), Internet commerce, business analytics, national security, communications, and elsewhere. The tools that work to infer knowledge from data at smaller scales do not necessarily work, or work well, at such massive scale. New tools, skills, and approaches are necessary, and this report identifies many of them, plus promising research directions to explore. Frontiers in Massive Data Analysis discusses pitfalls in trying to infer knowledge from massive data, and it characterizes seven major classes of computation that are common in the analysis of massive data. Overall, this report illustrates the cross-disciplinary knowledge--from computer science, statistics, machine learning, and application disciplines--that must be brought to bear to make useful inferences from massive data.

Topics

  • Math, Chemistry and Physics — Math and Statistics
  • Math, Chemistry and Physics — Policy, Reviews and Evaluations

Publication Info

190 pages | 6 x 9
Paperback
ISBN: 978-0-309-28778-4
Contents
Related Resources
Research Tools

Suggested Citation

National Research Council. Frontiers in Massive Data Analysis. Washington, DC: The National Academies Press, 2013.

Import this citation to:

Copyright Information

The National Academies Press (NAP) has partnered with Copyright Clearance Center's Rightslink service to offer you a variety of options for reusing NAP content. Through Rightslink, you may request permission to reprint NAP content in another publication, course pack, secure website, or other media. Rightslink allows you to instantly obtain permission, pay related fees, and print a license directly from the NAP website. The complete terms and conditions of your reuse license can be found in the license agreement that will be made available to you during the online order process. To request permission through Rightslink you are required to create an account by filling out a simple online form. The following list describes license reuses offered by the National Academies Press (NAP) through Rightslink:

  • Republish text, tables, figures, or images in print
  • Post on a secure Intranet/Extranet website
  • Use in a PowerPoint Presentation
  • Distribute via CD-ROM
  • Photocopy

Click here to obtain permission for the above reuses. If you have questions or comments concerning the Rightslink service, please contact:

Rightslink Customer Care
Tel (toll free): 877/622-5543
Tel: 978/777-9929
E-mail: customercare@copyright.com
Web: http://www.rightslink.com

To request permission to distribute a PDF, please contact our Customer Service Department at 800-624-6242 for pricing.

To request permission to translate a book published by the National Academies Press or its imprint, the Joseph Henry Press, please click here to view more information.

Related Books more

More by the Board on Mathematical Sciences and Their Applications more