Linked Life Data <sup>0.4.2</sup>

About the project

LinkedLifeData is a platform for semantic data integration trough RDF warehousing and efficient reasoning that helps to resolve conflicts in the data. One of the major problems that biotechnology and pharmaceutical industries face today is how to combine data from multiple sources and make their research more productive. Data integration takes much time and often leads to errors and redundancies that require more time and resources to resolve. The typical problems in working with biomedical data sources are that information is:

  • Supported by different organizations
  • Highly distributed and redundant
  • Encoded in different syntax and structural formats with special semantics for each data source
  • Locked in vast data silos accessible with limited query functionality

LinkedLifeData is a data warehouse that syndicates tons of heterogeneous biomedical knowledge in a common data model. The platform uses an extension of the RDF model that is able to track the provenance of each individual fact in the repository and thus update the information.

Semantic Data Integration

The RDF model is an abstract data model. It supports the efficient expression of declarative rules that derive new implicit information to be materialized and indexed. A typical company scenario is to link the internal company data with public information in a meangfull way. LinkedLifeData makes this possible by translating all company information into a highly abstract common data model.

Linked Life Data

Once your organization takes advantage of the expressive and uniform way to access information, LinkedLifeData platform supports declarative rules that are used:

  • To describe different objects appearing in different formats as truly equivalent - the internal sequence identifier SequenceXYZ is the same as the Uniprot entry if they have one and the same sequence
  • To maintain a specific form of relationships between objects and to infer new types of information - when/if new information appears in the Uniprot database it will automatically be updated in the internal sequence identifier SequenceXYZ
  • To unlock the data stored in silos and to overcome container-reference dichotomy (e.g. to unlock the knowledge regardless of the way it is represented) - all information about SequenceXYZ that is generated from the merging of multiple database entries and extracted from text can be accessed

To contact us, please send an email to lifeskim[-at-]ontotext.com.