From Big Data Resources
Jump to: navigation, search

This wiki contains additional resources for the article Big Data and its Technical Challenges http://bit.ly/bigdatachallenges by H. V. Jagadish, Johannes Gehrke, Alexandros Labrinidis, Yannis Papakonstantinou, Jignesh Patel, Raghu Ramakrishnan, and Cyrus Shahabi, which was published in the Communications of the ACM, July 2014, Vol. 57 No. 7, Pages 86-94 (DOI: 10.1145/2611567).


Data Provenance

  • Peter Buneman, Sanjeev Khanna, and Wang Chiew Tan. 2000. Data Provenance: Some Basic Issues. In Proceedings of the 20th Conference on Foundations of Software Technology and Theoretical Computer Science (FST TCS 2000), Sanjiv Kapoor and Sanjiva Prasad (Eds.). Springer-Verlag, London, UK, UK, 87-93. http://dl.acm.org/citation.cfm?id=759696
  • Yael Amsterdamer, Susan B. Davidson, Daniel Deutch, Tova Milo, Julia Stoyanovich, and Val Tannen. 2011. Putting lipstick on pig: enabling database-style workflow provenance. Proc. VLDB Endow. 5, 4 (December 2011), 346-357. http://dl.acm.org/citation.cfm?id=2095693

Crowd-Sourcing

  • Hyunjung Park, Hector Garcia-Molina, Richard Pang, Neoklis Polyzotis, Aditya Parameswaran, and Jennifer Widom. 2012. Deco: a system for declarative crowdsourcing. Proc. VLDB Endow. 5, 12 (August 2012), 1990-1993. http://dl.acm.org/citation.cfm?id=2367555

Scale

  • Andrew Pavlo, Erik Paulson, Alexander Rasin, Daniel J. Abadi, David J. DeWitt, Samuel Madden, and Michael Stonebraker. 2009. A comparison of approaches to large-scale data analysis. InProceedings of the 2009 ACM SIGMOD International Conference on Management of data(SIGMOD '09), Carsten Binnig and Benoit Dageville (Eds.). ACM, New York, NY, USA, 165-178. DOI=10.1145/1559845.1559865 http://doi.acm.org/10.1145/1559845.1559865 http://dl.acm.org/citation.cfm?id=1559865

Timeliness

  • Don Carney, Uğur Çetintemel, Alex Rasin, Stan Zdonik, Mitch Cherniack, and Mike Stonebraker. 2003. Operator scheduling in a data stream manager. In Proceedings of the 29th international conference on Very large data bases - Volume 29 (VLDB '03), Johann Christoph Freytag, Peter C. Lockemann, Serge Abiteboul, Michael J. Carey, Patricia G. Selinger, and Andreas Heuer (Eds.), Vol. 29. VLDB Endowment 838-849. http://dl.acm.org/citation.cfm?id=1315523
  • Daniel J. Abadi, Yanif Ahmad, Magdalena Balazinska, Ugur Çetintemel, Mitch Cherniack, Jeong-Hyon Hwang, Wolfgang Lindner, Anurag Maskey, Alex Rasin, Esther Ryvkina, Nesime Tatbul, Ying Xing, Stanley B. Zdonik: The Design of the Borealis Stream Processing Engine. CIDR 2005: 277-289 http://www.cidrdb.org/cidr2005/papers/P23.pdf

Privacy

  • Hien To, Gabriel Ghinita, and Cyrus Shahabi, A Framework for Protecting Worker Location Privacy in Spatial Crowdsourcing, In Proceedings of the 40th International Conference on Very Large Data Bases (VLDB 2014), Hangzhou, China, September 2014 (to appear) http://infolab.usc.edu/DocsDemos/p992-to.pdf

Visualization

  • Edward R. Tufte, The Visual Display of Quantitative Information, 2nd Edition, Graphics Press, May 2001
  • Edward R. Tufte, Beautiful Evidence, Graphics Press, July 2006



If you would like to contribute to this list, please contact Alexandros Labrinidis at http://labrinidis.cs.pitt.edu/contact.php

Last updated by Alexandros Labrinidis (talk) 9:12, 10 July 2014 (EDT)