Search icon CANCEL
Subscription
0
Cart icon
Your Cart (0 item)
Close icon
You have no products in your basket yet
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Free Learning
Arrow right icon
Managing Data as a Product
Managing Data as a Product

Managing Data as a Product: Design and build data-product-centered socio-technical architectures

Arrow left icon
Profile Icon Andrea Gioia
Arrow right icon
Coming Soon Coming Soon Publishing in Nov 2024
€18.99 per month
eBook Nov 2024 368 pages 1st Edition
Subscription
Free Trial
Renews at €18.99p/m
Arrow left icon
Profile Icon Andrea Gioia
Arrow right icon
Coming Soon Coming Soon Publishing in Nov 2024
€18.99 per month
eBook Nov 2024 368 pages 1st Edition
Subscription
Free Trial
Renews at €18.99p/m
Subscription
Free Trial
Renews at €18.99p/m

What do you get with a Packt Subscription?

Free for first 7 days. $19.99 p/m after that. Cancel any time!
Info icon
You can access this book only when it is published in Nov 2024
Product feature icon Unlimited ad-free access to the largest independent learning library in tech. Access this title and thousands more!
Product feature icon 50+ new titles added per month, including many first-to-market concepts and exclusive early access to books as they are being written.
Product feature icon Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.
Product feature icon Thousands of reference materials covering every tech concept you need to stay up to date.
Subscribe now
View plans & pricing
Table of content icon View table of contents Preview book icon Preview Book

Managing Data as a Product

From Data as a Byproduct to Data as a Product

In this book, we will explore how to transition from managing data merely as a byproduct that supports applications to managing data as a product in its own right. Before tackling the various aspects that contribute to this paradigm shift, it’s crucial to understand why managing data as a product is important and how this practice enables us to surpass the limits of today’s data platforms.

In this chapter, we will explore the history of monolithic data platforms, which have characterized the evolution of data management over the last 30 years. We will seek to understand the common problems that make them incapable of sustainably managing the accidental complexity they generate as they grow. Finally, we will see why addressing the fundamental issues, instead of merely treating surface-level symptoms, requires more than just technological innovations. It calls for a paradigm shift that leads us toward more sustainable socio...

Reviewing the history of monolithic data platforms

Managing the substantial amount of data that’s generated by every company daily is a complex endeavor. It calls for dedicated resources and technological support in the form of a specific data platform.

Nowadays, data platforms often fall short in delivering the expected value compared to the investments made, primarily due to organizations’ inability to sustainably manage the complexity they generate over time.

System complexity

The complexity of a system is determined by the number of its components multiplied by the number of correlations between them. A database with 10,000 tables is not much more complex than a database with 100 tables if the tables themselves are not correlated. Each table tells its own story. It can be manipulated without concern for the meaning of other tables and the potential impacts that the executed action may have on them. However, the complexity between the two databases is very...

Understanding why monolithic data platforms fail

If we look at the evolution of data management over the last 40 years, we’ll see a story of incredible technological revolutions and just as many project failures. At the beginning of this chapter, we mentioned that the main reason for these failures is the complexity generated by data management platforms, and this complexity grows approximately quadratically with the size of the platform. Therefore, these are not typical project failures as we are accustomed to understanding them. Data platforms rarely fail before their launch, never making it into production. Instead, they often experience failures related to their ability to evolve and survive over time. Platforms don’t fail immediately but over time, as they struggle to deliver the expected value in proportion to the constantly increasing maintenance costs they generate.

Like a Jenga tower becoming increasingly unstable as more pieces are added until it collapses...

Exploring why we need to manage data as a product

To escape the quagmire we find ourselves in, it is necessary to radically change the mental model we use to approach data management and, consequently, the organizational structures and associated operational practices. It’s a systemic change – a paradigm shift in data management practice.

As we’ve seen in the previous sections, attempts to address these problems have been predominantly cosmetic, not radical. We’ve tried to modify the system tactically, reacting to surface-level problems as they arise.

In system thinking, a system can be changed from the outside by acting on parts of it where small changes can lead to significant and lasting changes over time; these parts are called leverage points. Donella Meadows, a renowned researcher in this field, has classified possible leverage points into 12 categories, ranking them by effectiveness. It’s not necessary to delve into the details of each...

Summary

There is no doubt about the importance that data holds for organizations today to compete in the market. However, how we traditionally manage data has led to the construction of monolithic platforms unable to survive the complexity they generate.

Technological innovations have helped us solve many problems related to data management over time, but not the root causes of these problems. With each new generation of data platforms, the same issues have always resurfaced.

In this chapter, we saw how a paradigm shift in data management is necessary to make our platforms sustainable over time – a paradigm shift centered around the idea of treating data as a product and building modular platforms capable of governing the intrinsic complexity they generate without collapsing. It is a systemic transformation that touches on different levels of the organization. Throughout, we have shown the desired transformation required for each level, from mental models to operating...

Further reading

For more information on the topics that were covered in this chapter, please take a look at the following resources:

  • An Architecture for a Business and Information System, by B. Devlin and Paul T. (1988): https://www.semanticscholar.org/paper/An-Architecture-for-a-Business-and-Information-Devlin-Murphy/c22ce1eeafb01f0682e194a2a22349aa141b78f6
  • Building the Data Warehouse, by W. H. Inmon (1992): https://www.amazon.com/Building-Data-Warehouse-W-Inmon/dp/0764599445
  • The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling, by R. Kimball and Margy Ross (1996): https://www.amazon.com/Data-Warehouse-Toolkit-Definitive-Dimensional/dp/1118530802/
  • Data Lake Architecture: Designing the Data Lake and Avoiding the Garbage Dump, by W. H. Inmon (2016): https://www.amazon.com/Data-Lake-Architecture-Designing-Avoiding/dp/B01HN4JOPC/
  • The Modern Data Stack: Past, Present, and Future, by Tristan Handy (2020): https://www.getdbt.com/blog/future-of...
Left arrow icon Right arrow icon
Download code icon Download Code

Key benefits

  • Leverage data-as-product to unlock the modular platform potential and fix flaws in traditional monolithic architectures
  • Learn how to identify, implement, and operate data products throughout their life cycle
  • Design and execute a forward-thinking strategy to turn your data products into organizational assets
  • Purchase of the print or Kindle book includes a free PDF eBook

Description

Traditional monolithic data platforms struggle with scalability and burden central data teams with excessive cognitive load, leading to challenges in managing technological debt. As maintenance costs escalate, these platforms lose their ability to provide sustained value over time. With two decades of hands-on experience implementing data solutions and his pioneering work in the Open Data Mesh Initiative, Andrea Gioia brings practical insights and proven strategies for transforming how organizations manage their data assets. Managing Data as a Product introduces a modular and distributed approach to data platform development, centered on the concept of data products. In this book, you’ll explore the rationale behind this shift, understand the core features and structure of data products, and learn how to identify, develop, and operate them in a production environment. The book guides you through designing and implementing an incremental, value-driven strategy for adopting data product-centered architectures, including strategies for securing buy-in from stakeholders. Additionally, it explores data modeling in distributed environments, emphasizing its crucial role in fully leveraging modern generative AI solutions. By the end of this book, you’ll have gained a comprehensive understanding of product-centric data architecture and the essential steps needed to adopt this modern approach to data management.

Who is this book for?

If you’re an experienced data engineer, data leader, architect, or practitioner committed to reimagining your data architecture and designing one that enables your organization to get the most value from your data in a sustainable and scalable way, this book is for you. Whether you’re a staff engineer, product manager, or a software engineering leader or executive, you’ll find this book useful. Familiarity with basic data engineering principles and practices is assumed.

What you will learn

  • Overcome the challenges in scaling monolithic data platforms, including cognitive load, tech debt, and maintenance costs
  • Discover the benefits of adopting a data-as-a-product approach for scalability and sustainability
  • Navigate the complete data product lifecycle, from inception to decommissioning
  • Automate data product lifecycle management using a self-serve platform
  • Implement an incremental, value-driven strategy for transitioning to data-product-centric architectures
  • Optimize data modeling in distributed environments to enhance GenAI-based use cases

Product Details

Country selected
Publication date, Length, Edition, Language, ISBN-13
Publication date : Nov 29, 2024
Length: 368 pages
Edition : 1st
Language : English
ISBN-13 : 9781835469378
Category :
Languages :
Tools :

What do you get with a Packt Subscription?

Free for first 7 days. $19.99 p/m after that. Cancel any time!
Info icon
You can access this book only when it is published in Nov 2024
Product feature icon Unlimited ad-free access to the largest independent learning library in tech. Access this title and thousands more!
Product feature icon 50+ new titles added per month, including many first-to-market concepts and exclusive early access to books as they are being written.
Product feature icon Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.
Product feature icon Thousands of reference materials covering every tech concept you need to stay up to date.
Subscribe now
View plans & pricing

Product Details

Publication date : Nov 29, 2024
Length: 368 pages
Edition : 1st
Language : English
ISBN-13 : 9781835469378
Category :
Languages :
Tools :

Packt Subscriptions

See our plans and pricing
Modal Close icon
€18.99 billed monthly
Feature tick icon Unlimited access to Packt's library of 7,000+ practical books and videos
Feature tick icon Constantly refreshed with 50+ new titles a month
Feature tick icon Exclusive Early access to books as they're written
Feature tick icon Solve problems while you work with advanced search and reference features
Feature tick icon Offline reading on the mobile app
Feature tick icon Simple pricing, no contract
€189.99 billed annually
Feature tick icon Unlimited access to Packt's library of 7,000+ practical books and videos
Feature tick icon Constantly refreshed with 50+ new titles a month
Feature tick icon Exclusive Early access to books as they're written
Feature tick icon Solve problems while you work with advanced search and reference features
Feature tick icon Offline reading on the mobile app
Feature tick icon Choose a DRM-free eBook or Video every month to keep
Feature tick icon PLUS own as many other DRM-free eBooks or Videos as you like for just €5 each
Feature tick icon Exclusive print discounts
€264.99 billed in 18 months
Feature tick icon Unlimited access to Packt's library of 7,000+ practical books and videos
Feature tick icon Constantly refreshed with 50+ new titles a month
Feature tick icon Exclusive Early access to books as they're written
Feature tick icon Solve problems while you work with advanced search and reference features
Feature tick icon Offline reading on the mobile app
Feature tick icon Choose a DRM-free eBook or Video every month to keep
Feature tick icon PLUS own as many other DRM-free eBooks or Videos as you like for just €5 each
Feature tick icon Exclusive print discounts
Banner background image

Table of Contents

17 Chapters
Part 1: Data Products and the Power of Modular Architectures Chevron down icon Chevron up icon
Chapter 1: From Data as a Byproduct to Data as a Product Chevron down icon Chevron up icon
Chapter 2: Data Products Chevron down icon Chevron up icon
Chapter 3: Data Product-Centered Architectures Chevron down icon Chevron up icon
Part 2: Managing the Data Product Lifecycle Chevron down icon Chevron up icon
Chapter 4: Identifying Data Products and Prioritizing Developments Chevron down icon Chevron up icon
Chapter 5: Designing and Implementing Data Products Chevron down icon Chevron up icon
Chapter 6: Operating Data Products in Production Chevron down icon Chevron up icon
Chapter 7: Automating Data Product Lifecycle Management Chevron down icon Chevron up icon
Part 3: Designing a Successful Data Product Strategy Chevron down icon Chevron up icon
Chapter 8: Moving through the Adoption Journey Chevron down icon Chevron up icon
Chapter 9: Team Topologies and Data Ownership at Scale Chevron down icon Chevron up icon
Chapter 10: Distributed Data Modeling Chevron down icon Chevron up icon
Chapter 11: Building an AI-Ready Information Architecture Chevron down icon Chevron up icon
Chapter 12: Bringing It All Together Chevron down icon Chevron up icon
Index Chevron down icon Chevron up icon
Other Books You May Enjoy Chevron down icon Chevron up icon
Get free access to Packt library with over 7500+ books and video courses for 7 days!
Start Free Trial

FAQs

What is included in a Packt subscription? Chevron down icon Chevron up icon

A subscription provides you with full access to view all Packt and licnesed content online, this includes exclusive access to Early Access titles. Depending on the tier chosen you can also earn credits and discounts to use for owning content

How can I cancel my subscription? Chevron down icon Chevron up icon

To cancel your subscription with us simply go to the account page - found in the top right of the page or at https://subscription.packtpub.com/my-account/subscription - From here you will see the ‘cancel subscription’ button in the grey box with your subscription information in.

What are credits? Chevron down icon Chevron up icon

Credits can be earned from reading 40 section of any title within the payment cycle - a month starting from the day of subscription payment. You also earn a Credit every month if you subscribe to our annual or 18 month plans. Credits can be used to buy books DRM free, the same way that you would pay for a book. Your credits can be found in the subscription homepage - subscription.packtpub.com - clicking on ‘the my’ library dropdown and selecting ‘credits’.

What happens if an Early Access Course is cancelled? Chevron down icon Chevron up icon

Projects are rarely cancelled, but sometimes it's unavoidable. If an Early Access course is cancelled or excessively delayed, you can exchange your purchase for another course. For further details, please contact us here.

Where can I send feedback about an Early Access title? Chevron down icon Chevron up icon

If you have any feedback about the product you're reading, or Early Access in general, then please fill out a contact form here and we'll make sure the feedback gets to the right team. 

Can I download the code files for Early Access titles? Chevron down icon Chevron up icon

We try to ensure that all books in Early Access have code available to use, download, and fork on GitHub. This helps us be more agile in the development of the book, and helps keep the often changing code base of new versions and new technologies as up to date as possible. Unfortunately, however, there will be rare cases when it is not possible for us to have downloadable code samples available until publication.

When we publish the book, the code files will also be available to download from the Packt website.

How accurate is the publication date? Chevron down icon Chevron up icon

The publication date is as accurate as we can be at any point in the project. Unfortunately, delays can happen. Often those delays are out of our control, such as changes to the technology code base or delays in the tech release. We do our best to give you an accurate estimate of the publication date at any given time, and as more chapters are delivered, the more accurate the delivery date will become.

How will I know when new chapters are ready? Chevron down icon Chevron up icon

We'll let you know every time there has been an update to a course that you've bought in Early Access. You'll get an email to let you know there has been a new chapter, or a change to a previous chapter. The new chapters are automatically added to your account, so you can also check back there any time you're ready and download or read them online.

I am a Packt subscriber, do I get Early Access? Chevron down icon Chevron up icon

Yes, all Early Access content is fully available through your subscription. You will need to have a paid for or active trial subscription in order to access all titles.

How is Early Access delivered? Chevron down icon Chevron up icon

Early Access is currently only available as a PDF or through our online reader. As we make changes or add new chapters, the files in your Packt account will be updated so you can download them again or view them online immediately.

How do I buy Early Access content? Chevron down icon Chevron up icon

Early Access is a way of us getting our content to you quicker, but the method of buying the Early Access course is still the same. Just find the course you want to buy, go through the check-out steps, and you’ll get a confirmation email from us with information and a link to the relevant Early Access courses.

What is Early Access? Chevron down icon Chevron up icon

Keeping up to date with the latest technology is difficult; new versions, new frameworks, new techniques. This feature gives you a head-start to our content, as it's being created. With Early Access you'll receive each chapter as it's written, and get regular updates throughout the product's development, as well as the final course as soon as it's ready.We created Early Access as a means of giving you the information you need, as soon as it's available. As we go through the process of developing a course, 99% of it can be ready but we can't publish until that last 1% falls in to place. Early Access helps to unlock the potential of our content early, to help you start your learning when you need it most. You not only get access to every chapter as it's delivered, edited, and updated, but you'll also get the finalized, DRM-free product to download in any format you want when it's published. As a member of Packt, you'll also be eligible for our exclusive offers, including a free course every day, and discounts on new and popular titles.