Adoption
Data Package use cases encompass a wide range of scenarios where this standardized data packaging format proves to be invaluable for efficient data management, sharing, and analysis. Here are some key use cases:
Data Portals
Data portals adopting Data Package standard to increase accessibility of published data:
Dryad A pilot project to integrate Frictionless Data Validation within Dryad, a curated resource that enhances the discoverability, reusability, and citation of research data
Our World in Data Our World in Data publishes their datasets using Data Package standard. Most datasets included here are annual time series data for social and economic indicators by country.
Pilot Projects
We work closely with data researchers and institutions to help them integrate Frictionless into their workflow. Click on individual Pilots to learn more.
BCO-DMO A pilot collaboration with the Biological and Chemical Oceanography Data Management Office (BCO-DMO).
PUDL A pilot collaboration with the Public Utility Data Liberation project (PUDL), aimed at simplifying the usability of US energy data.
Data Readiness Group A pilot collaboration with Dr. Philippe Rocca-Serra's Data Readiness Group at Oxford, focusing on streamlining the reporting of scientific experimental results using Data Package specifications.
Data Management for TEDDINET A pilot collaboration applying Frictionless Data approaches to address data legacy challenges within the TEDDINET project, a research network dedicated to transforming energy demand in buildings.
Western Pennsylvania Regional Data Center A pilot collaboration to showcase an implementation that expounds on the quality and description of datasets in CKAN-based open data portals with the Western Pennsylvania Regional Data Center - a part of The University of Pittsburgh Center for Urban and Social Research.
UK Data Service A pilot collaboration to utilize Frictionless Data software for assessing and reporting on data quality, as well as generating visualizations from data and metadata within the UK data context.
eLife A pilot collaboration to explore the use of the goodtables library for validating all scientific research datasets hosted by eLife and advocating for open data reuse in Life and Biomedical sciences.
University of Cambridge - Retinal Mosaics A pilot collaboration to trial Frictionless software for packaging and reading data, supporting computational techniques in the investigation of nervous system development.
Pacific Northwest National Laboratory - Active Data Biology A pilot collaboration to explore the use of Frictionless Data's specifications and software for generating schemas for tabular data and validating metadata within a biological application hosted on GitHub.
Causa Natura - Pescando Datos A pilot collaboration to employ data validation software within the Causa Natura project, enhancing data quality to support fisher communities and advocacy groups.
Community Projects
Here is a list of projects that our community has created on top of Data Package. If you would like your project to be featured here, let us know!
European Commission The European Commission launched a CSV schema validator using the tabular data package specification, as part of the ISA² Interoperability Testbed.
GitHub GitHub uses Data Package standard in some of their research projects.
GBIF The Global Biodiversity Information Facility (GBIF) uses Data Package as a format to publish biodiversity data.
Validata OpenDataFrance created Validata, a platform for local public administration in France to validate CSV files on the web, using the tabular data package specification.
Gapminder Gapminder is an independent educational non-profit fighting global misconceptions. It uses the Data Package standard as a core part of its underlaying DDFcsv data format.
Libraries Hacked Libraries hacked is a project started in 2014 to promote the use of open data in libraries.
HubMAP HuBMAP is creating an open, global atlas of the human body at the cellular level.
Etalab Etalab, a department of the French interministerial digital service, launched schema.data.gouv.fr.
Nimble Learn - datapackage-m A set of functions written in Power Query M for working with Tabular Data Packages in Power BI Desktop and Power Query for Excel.
Nimble Learn - Datapackage-connector Power BI Custom Connector that loads one or more tables from Tabular Data Packages into Power BI.
Zegami Zegami is using Frictionless Data specifications for data management and syntactic analysis on their visual data analysis platform.
Center for Data Science and Public Policy, Workforce Data Initiative Supporting state and local workforce boards in managing and publishing data.
Cell Migration Standardization Organization Using Frictionless Data specs to package cell migration data and load it into Pandas for data analysis and creation of visualizations.
Collections as Data Facets - Carnegie Museum of Art Collection Data Use of Frictionless Data specifications in the release of Carnegie Museum of Arts’ Collection Data for public access & creative use.
OpenML OpenML is an online platform and service for machine learning, whose goal is to make ML and data analysis simple.
The Data Retriever Data Retriever uses Frictionless Data specifications to generate and package metadata for publicly available data.
Tesera Tesera uses Frictionless Data specifications to package data in readiness for use in different systems and components.
data.world data.world uses Frictionless Data specifications to generate schema and metadata related to an uploaded dataset and containerize all three in a Tabular Data Package.
John Snow Labs John Snow Labs uses Frictionless Data specifications to avail data to users for analysis.
Open Power System Data Open Power System Data uses Frictionless Data specifications to avail energy data for analysis and modeling.
Dataship Dataship used Frictionless Data specifications as the basis for its easy to execute, edit and share notebooks for data analysis.