Cassandra Logo
 
kashlev

www.wayne.edu


chebotko

www.datastax.com

Wednesday, September 23
2:40 PM - 3:20 PM


World's best data modeling tool for Apache Cassandra
Data modeling is one of the most important steps ensuring performance and scalability of Cassandra-powered applications. The existing Chebotko data modeling methodology lays out important data modeling principles, rules and patterns to design a conceptual, logical and physical data models. While this approach enables rigorous and sound schema design, it requires specialized training and experience. To dramatically reduce time, simplify and streamline the Cassandra database design process, we develop an online tool that automates the most complex, error-prone, and time-consuming data modeling tasks: conceptual-to-logical mapping, logical-to-physical mapping, and CQL generation.

In this talk, using real life examples from the IoT domain, we demonstrate how to design correct and efficient database schemas for Cassandra. First, we use our tool, called KDM, to design a conceptual data model and specify application access patterns. Second, we demonstrate how KDM generates a logical data model that is visualized using Chebotko diagram notation. Third, we explain how to configure a logical data model and automatically generate a physical data model. Fourth, we showcase how KDM generates a CQL script for instantiating a physical data model in Cassandra. Finally, we discuss best practices for Cassandra data modeling with KDM.

The KDM tool is available for free at kdm.dataview.org and is used by many in industry and academia.

Andrey Kashlev - Wayne State University
Andrey Kashlev is a PhD candidate in big data, working in the Department of Computer Science at Wayne State University. His research focuses on big data, including data modeling for NoSQL, big data workflows, and provenance management. He has published numerous research articles in peer-reviewed international journals and conferences, including IEEE Transactions on Services Computing, Data and Knowledge Engineering, International Journal of Computers and Their Applications, and the IEEE International Congress on Big Data.

Artem Chebotko - DataStax
Dr. Artem Chebotko is a Solution Architect at DataStax. His core expertise is in data modeling, data management, data mining, and data analytics. For over 10 years, he has been leading and participating in research and development projects on NoSQL, RDF, XML, Relational, and Provenance databases. He is an author of more than 50 research and technical papers, including his recent work titled "A Big Data Modeling Methodology for Apache Cassandra" that appeared in IEEE Big Data Congress 2015. He is an educator with extensive experience in both industry and academic training.


        |        Code of Conduct        |        T&C        |        Privacy