Living objects: towards flexible big data sharing

Authors

  • Jonathan Marti Barcelona Supercomputing Center, Barcelona, Spain
  • Anna Queralt Barcelona Supercomputing Center, Barcelona, Spain
  • Daniel Gasull Barcelona Supercomputing Center, Barcelona, Spain
  • Toni Cortes Universitat Politècnica de Catalunya, Barcelona Supercomputing Center, Barcelona, Spain

Keywords:

Data sharing, Data control, Offloading, Enrichment, Persistent objects, Data as a Service (DaaS), Big data

Abstract

Data sharing and especially enabling third parties to build new services using large amounts of shared data is clearly a trend for the future and a main driver for innovation. However, sharing data is a challenging and involved process today: The owner of the data wants to maintain full and immediate control on what can be done with it, while users are interested in offering new services which may involve arbitrary and complex processing over large volumes of data. Currently, flexibility in building applications can only be achieved with public or non-sensitive data, which is released without restrictions. In contrast, if the data provider wants to impose conditions on how data is used, access to data is centralized and only predefined functions are provided to the users. We advocate for an alternative that takes the best of both worlds: distributing control on data among the data itself to provide flexibility to consumers. To this end, we exploit the well-known concept of object, an abstraction that couples data and code, and make it act and react according to the circumstances.

Downloads

Download data is not yet available.

References

[1] M. Balazinska, B. Howe, and Dan Suciu. "Data Markets in the Cloud: An Opportunity for the Datababase Community." PVLDB 4, no. 12, 2011, pp. 1482-1485.
[2] D. Bennett, and A. Harvey. 3Publishing Open Government Data ́. W3C Working Draft. 2009. http://www.w3.org/TR/gov-data/ (accessed January 2013)
[3] M. J. Carey, N. Onose, and M. Petropulos. "Data Services." Commun. ACM 55, no. 6, 2012, pp. 86-97.
[4] R. G. G. Cattell, and D. K. Barry. The Object Data Standard: ODMG 3.0. Morgan Kaufmann, 2000.
[5] F. Chang, J. Dean, S. Ghemawat, W.C. Hsieh, D.A. Wallach, M. Burrows, T. Chandra, A. Fikes, and R.E.
Gruber. "Bigtable: A distributed storage system for structured data.", ACM Trans on Computer Systems 26, no. 2, 2008.
[6] J. Dean, and S. Ghemawhat, "MapReduce: Simplified Data Processing on Large Clusters", Symposium on Operating System Design and Implementation, OSDI, 2004, pp. 137-150.
[7] G. DeCandia, D. Hastorun, M. Jampani, G. Kakulapati, A. Lakshman, A. Pilchin, S. Sivasubramanian, P. Vosshall, and W. Vogels, "Dynamo: Amazon highly available keyvalue store", Symposium on Operating Systems Principles, SOSP, 2007, pp. 205-220.
[8] EyeDB - Open Source Object http://www.eyedb.org (accessed July 2012).
[9] B. G. Fitch, A. Rayshubskiy, M. C. Pitman, T. J. Ward, and R. S. Germain. "Using the Active Storage Fabrics Model to Address Petascale Storage Challenges", Annual Workshop on Petascale Data Storage, PDSW, 2009, pp. 47-54.
[10] E. Friedman, P. M. Pawlowski, and J. Cieslewicz, "SQL/MapReduce: A practical approach to self-describing, polymorphic and parellelizable user-defined functions", PVLDB 2 no. 2, 2009, pp. 1402-1413.
[11] H. Gonzalez, A.Y. Halevy, C. Jensen, A. Langen, J. Madhavan, R. Shapley, and W. Shen. "Google Fusion Tables: Data Management, Integration, and Collaboration in the Cloud.", ACM Symposium on Cloud Computing, SoCC, 2010, pp. 175-180.
[12] J. M. Hellerstein, C. Ré, F. Schoppmann, D. Z. Wang, E. Fratkin, A. Gorajek, K. S. Ng, C. Welton, X. Feng, K. Li, and A. Kumar: "The MADlib Analytics Library or MAD Skills, the SQL", PVLDB 5, no. 12, 2012, pp. 1700-1711.
[13] Java Platform, Standard Edition 7, API Specification, http://docs.oracle.com/javase/7/docs/api/java/security/GuardedObject.html (accessed July 2013).
[14] JBoss Community, Hibernate, http://www.hibernate.org (accessed January 2013).
[15] J. Madhavan, S. Balakrishnan, K. Brisbin, H. Gonzalez, N. Gupta, A. Y. Halevy, K. Jacqmin-Adams, H. Lam, A. Langen, H. Lee, R. McChesney, R. Shapley, and W. Shen "Big Data Storytelling Trough Interactive Maps." IEEE Data Engineering Bulletin 35, no. 2, 2012, pp. 46-54.
[16] N. Markovic, D. Nemirovsky, O. Unsal, M. Valero, and A. Cristal. 3Object Oriented Execution Model (OOM) ́. In NDCA, held in conjunction with ISCA, 2011.
[17] McObject, Perst, http://www.mcobject.com/perst (accessed January 2013).
[18] Oracle TopLink, http://www.oracle.com/technetwork/middleware/toplink/overview/index.html (accessed January 2013).
[19] Oracle Database Application Developer's Guide - Object-Relational Features, http://docs.oracle.com/cd/B19306_01/appdev.102/b14251.pdf (accessed January 2013).
[20] Progress Software, ObjectStore, http://www.progress.com/en/objectstore/index.html (accessed January 2013).
[21] M. Stonebraker, S. Madden, D. J. Abadi, S. Haziropoulos, N. Hachem, and P. Helland. 3The End of an
Architectural Era (It's Time for a Complete Rewrite) ́, International Conference on Very Large Data Bases, VLDB, 2007, pp. 1150-1160.
[22] Teradata Aster, http://www.asterdata.com (accessed January 2013).
[23] Versant Object Database, http://www.versant.com/products/versant-object-database (accessed January 2013).

Downloads

Published

2013-10-01

How to Cite

Marti, J., Queralt, A., Gasull, D., & Cortes, T. (2013). Living objects: towards flexible big data sharing. Journal of Computer Science and Technology, 13(02), p. 56–63. Retrieved from https://journal.info.unlp.edu.ar/JCST/article/view/611

Issue

Section

Invited Articles