How to start using Data Package
There are many alternatives when it comes to Data Package Standard implementations. We will cover a few the most popular options which will be a good starting point.
Open Data Editor
The simplest way to start using the Data Package Standard is by installing Open Data Editor (currently, in beta):
You can use the visual interface as you usually do in any modern IDE, adding and moving files, validating data, etc. Under the hood, Open Data Editor will be creating Data Package descriptors for your datasets (can be explicitly done by creating a dataset), inferring metadata, and data types. When the data curation work is done a data package can be validated and published, for example, to CKAN.
Please refer to the Open Data Editor’s documentation to read about all the features.
frictionless-py
If you prefer a command-line interface, or Python, there is frictionless-py, a complete framework for managing data packages. Here are main commands available in CLI:
Please refer to the frictionless-py’s documentation to read about all the features.
frictionless-r
For the R community, there is frictionless-r package that allows managing data packages in R language. For example:
Please refer to the frictionless-r’s documentation to read about all the features.