Blog Kevin Doubleday05.24.22

3 Use Cases for Distributed Ledger Technology in Data Management

Blockchain, Distributed Ledger Technology, or Crypto-Secure Database technology provide the ingredients for digital trust within data records. Let's explore 3 use cases.

The world of blockchain spans from total decentralization (referred to as “crypto-minimalism) to federated distributed ledgers to private cryptographically-secured databases. Across all of these use cases, digital trust is a core benefit of employing the crypto-based elements of a blockchain ledger. For an enterprise data environment, DLTs can lock transactions in a cryptographically-secured database in such a way that information can be verified and traced back to its originator with ease.

How Does Blockchain Provide Trust, Proof, and Provenance to the Information?


Every update to a ledger-backed database is timestamped and embedded into a “block” of data, cryptographically secured by a hashing process that incorporates the hash of the previous block, and joins the ledger as the next chronological update. 

The Blockchain Hash Process that helps to establish digital trust.


In the same way that blockchain provides digital trust for transactions between third parties, distributed ledger technologies leverage cryptography and hashing to build an immutable, verifiable audit trail for records in a database or application. These benefits can be leveraged practically for a developer building an app or integration, for departments of highly-regulated businesses conducting audits, or for data scientists building ethical and explainable AI. 

Let’s look at 3 practical applications for ledger-backed data management for organizations:

1. Non-Repudiation

Non-repudiation is a legal concept that means someone cannot deny the authenticity of something – a piece of information, a document, a signature, etc. Widely used in industries where trust is essential between parties, such as law, finance, or insurance, non-repudiation is an essential information security practice that provides proof of the origin and authenticity of information as it is exchanged between parties. 

Blockchain provides a perfect platform for businesses to practice non-repudiation. The use of digital signatures and public/private key cryptography allows for information to be digitally verified by parties sending or receiving messages. As a result, organizations can adopt better non-repudiation mechanisms for digitally signing documents or retrieving verified messages, which increases the speed and scale of their third-party business dealings. 

2. Change-Data Capture and Trusted Data Audit

Having an immutable log of data changes provides immense benefits – developers can identify, review, and pull all of the changes to a database and make use of the historical information in another system. Because blockchain preserves every state to a ledger database, developers can capture those changes for review analytically or for use in another system. Developers can begin to ask the ledger for historical information, such as “What did the data look like at this point in time, and how has it changed since then?” 

A blockchain ledger can also provide professionals in highly-regulated industries with data provenance guarantees and replayability for instant data audit and reporting. Financial services, healthcare, and other regulated systems can use ledger databases to reproduce data alongside a timestamp to prove compliance.

3. Explainable and Secure AI

In the fields of machine learning and artificial intelligence, data scientists are increasingly concerned with the concept of “explainable AI,” a process in which humans can review, comprehend, and trust the output results of an algorithm. Explainable AI practices help data scientists retrace and explain the results of an AI algorithm, which allows for fairness and transparency in AI development.

If input data existed in a blockchain database system, AI professionals could build tools to audit the algorithm thoroughly to reveal important information on explainability. By providing an immutable and cryptographically-secured approach to data management, a blockchain structure delivers a historical lens for data scientists to review and assess. 

Blockchain can also provide a defense against adversarial attacks to AI algorithms, fighting against “data poisoning” in which attackers will manipulate input data in order to modify the outcome of an algorithm. By leveraging blockchain’s source of truth and anti-manipulation cryptography, AI developers can guarantee that an algorithm is using authentic data from verified sources.

Conclusion

Leveraging a blockchain-secured database can provide organizations with a foundation for trusted and traceable information management. The applicability spans far and wide – from generalized data audit capabilities to industry-specific use cases such as digital evidence management systems (DEMs), AI Explainability systems, or compliance reporting. Whichever the scenario, blockchain can help move organizations away from “black-box” reporting and into a more transparent, accurate, and trusted system for data management. 


There seems to be a lack of — ahem — *consensus* around blockchain’s definition.

While the philosophical spectrum ranges from satoshi minimalists to mass enterprise adopters, it is important to first understand the technical componentry of blockchain in order to deploy it as a useful application. Here is a “feature-first” definition that may be useful in understanding these technical underpinnings:

The image defines blockchain as being a distributed digital ledger that uses beer to peer consensus within a decentralized network to validate transactions and a hashing algorithm to cryptographically link them in a chronological "chain" of records. The image then breaks that definition into 9 different parts to further elaborate on the definition of blockchain and its elements.



Today, we’ll dive into immutability, a core defining feature of blockchain.

Across the hundreds of articles and conversations around Blockchain, you’ll find the term “immutable” almost always present. Immutability — the ability for a blockchain ledger to remain a permanent, indelible, and unalterable history of transactions — is a definitive feature that blockchain evangelists highlight as a key benefit. Immutability has the potential to transform the auditing process into a quick, efficient, and cost-effective procedure, and bring more trust and integrity to the data businesses use and share every day.

We spend trillions of dollars on cybersecurity solutions meant to keep outside prying eyes from accessing our sensitive data. But rarely do we fight the internal cybersecurity battle: ensuring that our data has not been manipulated, replaced, or falsified by a company or its employees. In many cases, we have come to simply trust that the data is correct by methods like private keys and user permissions. But in reality, we cannot prove — methodically or mathematically — that information in a standard application database is unequivocally tamper-free. Auditing becomes our next (and expensive) line of defense.

Blockchain implementation can bring an unprecedented level of trust to the data enterprises use on a daily basis — immutability provides integrity (both in its technical and primary definition). With blockchain, we can prove to our stakeholders that the information we present and use has not been tampered with, while simultaneously transforming the audit process into an efficient, sensible, and cost-effective procedure.

How Immutability is Achieved

A Brief Introduction to Cryptography and Hashing

Before we dive into blockchain immutability, we’ll need to understand cryptographic hashing. Here are the basics:

The image shows a SHA-256 Hashing Machine.
SHA-256 Hashing Machine


Want to test out some basic hashing? Here is a free Sha-256 hash calculator: http://www.xorbin.com/tools/sha256-hash-calculator

Cryptography + Blockchain Hashing Process = Immutability

Each transaction that is verified by the blockchain network is timestamped and embedded into a “block” of information, cryptographically secured by a hashing process that links to and incorporates the hash of the previous block, and joins the chain as the next chronological update.

The hashing process of a new block always includes meta-data from the previous block’s hash output. This link in the hashing process makes the chain “unbreakable” — it’s impossible to manipulate or delete data after it has been validated and placed in the blockchain because if attempted, the subsequent blocks in the chain would reject the attempted modification (as their hashes wouldn’t be valid). 

In other words, if data is tampered with, the blockchain will break, and the reason could be readily identified. This characteristic is not found in traditional databases, where information can be modified or deleted with ease.

The blockchain is essentially a ledger of facts at a specific point in time. For Bitcoin, those facts involve information about Bitcoin transfers between addresses. The below image shows how the checksum of transaction data is added as part of the header, which, in turn, is hashed into and becomes that entire block’s checksum.

The image shows the blockchain hash process for immutability.
Blockchain Hashing Process for Immutability

Benefits, Explained


Why does immutability matter? For the enterprise, immutability as a result of blockchain implementation presents a serious overhead saver as well as simplified auditing efforts & fraud prevention. We’ll break these concepts down:

Complete Data Integrity — Ledgers that deploy blockchain technology can guarantee the full history and data trail of an application: once a transaction joins the blockchain, it stays there as a representation of the ledger up to that point in time. The integrity of the chain can be validated at any time by simply re-calculating the block hashes — if a discrepancy exists between block data and its corresponding hash, that means the transactions are not valid. This allows organizations and its industry regulators to quickly detect data tinkering.

Simplified Auditing — Being able to produce the complete, indisputable history of a transactional ledger allows for an easy and efficient auditing process. Proving that data has not been tampered with is a major benefit for companies that need to comply with industry regulations. Some common use cases include supply chain management, finance (for example, Sarbanes-Oxley disclosures), and identity management.

Increase in efficiencies — Maintaining a full historical record is not only a boon to auditing, but also provides new opportunities in query, analytics, and overall business processes. FlureeDB, for instance, takes advantage of the concept of time travel for business applications — where queries can be specified as of any block — or point in time — and reproduce that time’s version of the database, immediately.

This capability allows for a host of time and cost savings — including tracking the provenance of major bugs, auditing specific application data, and backup and restoring database state changes to retrieve information. Immutability can make the most modern-day data problems that plague enterprise applications irrelevant.

Proof of Fault — Disputes over fault in business are all too common. The construction industry accounts for $1 Trillion dollars in losses as a result of unresolved disputes. While blockchain won’t wholly dissolve this massive category of legal proceedings, it could be leveraged to prevent a majority of disputes related to data provenance and integrity (essentially proving who did what and at what time).

Blockchain finality allows us — and a jury — to fully trust every piece of information.

Fluree secures every transaction — proving who initiated it, when it was completed, and that it is free of tampering.

The image shows the fluree transaction hashing process.
Fluree Transaction Hashing Process

It even tracks the changes a SaaS vendor makes to your transaction before it reaches the data storage tier, meaning you can trust your SaaS data without fully trusting your SaaS vendor:

This image breaks down fluree data integrity into 5 different parts that reveal how fluree works to secure every data transaction.
Fluree Data Integrity

The Asterisk to Immutability

Immutability ≠ Perfect, Truthful Data

Blockchain is a mechanism for detecting mathematical untruths, not a magical lie detector.

Blockchain doesn’t inherently, automatically, or magically make data truthful — its implementation merely cryptographically secures it so that it will never be altered or deleted without consequence. Measures such as sharing your hash outputs directly with stakeholders (customers, auditors, etc.) or setting up a decentralized network of validation nodes are a good complement to the historical immutability the blockchain hashing process provides, to ensure an often-needed validation component.

In addition: the stronger the enforcement rules, the more reliable the data on the blockchain (Exhibit A: Bitcoin’s proof of work).

FAQs on Immutability


What are the disadvantages of immutability? How can they be avoided?

Having an unalterable history of transactions seems like the answer to many modern business problems — and it is, in many ways. But what happens when sensitive data — like employee’s home addresses — is accidentally published to a blockchain? Hopefully, this wouldn’t be an issue, as standard design decisions in building blockchain environments necessitate a separation between sensitive and personally-identifying information.

If running a private, federated blockchain, your company would have to convince the other parties to agree to a “fork” in the blockchain — where a blockchain splits into two paths and the new designated database continues on. All or most parties involved in this blockchain will have to agree on the terms including which block to fork at and any additional rules of the new database. If this blockchain is truly public, it is next to impossible to have this information removed (a hard fork is also required here, but you are much more unlikely to convince the other parties in the network to comply).

In terms of Databases and Infrastructure, isn’t holding the entire transaction history very costly with regard to space?

Holding every database state might have been costly in the 90s, but current storage costs (1GB in AWS = $.023) are incredibly cheap. And the benefits (data integrity, ability to issue queries against any point in time) far outweigh the slight cost difference.

Fluree offers an ACID-compliant blockchain distributed ledger that records every state change in history as an immutable changelog entry. It allows for powerful query capability with FlureeDB, a graph query engine. By bringing blockchain to the data tier, Fluree is a practical and powerful infrastructure on which to build, distribute, and scale custom blockchains.