ENGAGE Project Information

ENGAGE Project Information Acronym ENGAGE Title An Infrastructure for Open, Linked Governmental Data Provision towards Research Communities and Citize...
Author: Dayna Jennings
10 downloads 0 Views 5MB Size
ENGAGE Project Information Acronym ENGAGE Title An Infrastructure for Open, Linked Governmental Data Provision towards Research Communities and Citizens Website http://www.engage-project.eu Platform http://www.engagedata.eu

Project participants Research Infrastructures Contract no RI-283700 Project type CP-CSA Start date 01/06/2011 Duration 36 months Partners 9 Framework Programme 7 (2007-2013)

NTUA (Coordinator) GR TU-DELFT NL MIC-GR GR IBM-ISRAEL IL INTRASOFT LU STFC UK FhG-FOKUS DE AEGEAN GR EUROCRIS NL

Ministries / local public agencies websites

Publicdata.eu National Statistical Offices

Unstructured / “Semi-structured” Public data sources

ENGAGE traverses across distributed and diverse public sector information resources

ENGAGE provides a single point of access to PSI sources as well as relevant tools in order to cover the needs of researchers and citizens

0 Integration of original PSI data and derived / curated datasets  created, maintained and extended by users (researchers,  citizens, journalists, computer specialists) in a collaborative  environment. A research / data curation community platform  with focus on the SSH domain. 0 The vision of the ENGAGE infrastructure is to extract, highlight  and enhance the RE‐USE value of PSI data. 0

HOW: Moving slowly from low‐structured, isolated, difficult to find PSI  data to high‐structured, easy to link , easy to process datasets =>  Crowd‐sourcing.

Unstructured / Semi-structured / Structured Public data sources

JSON

Discovery and Context Metadata

Crowdsourcing Moving from low structured, low value datasets to highly structured and / or derived datasets

ENGAGE

Low Re-Use Value / Quality structure / metadata

High Re-Use Value / Quality structure / metadata Conversion Data Enrichment Metadata Enrichment Cleansing “Snapshots”

ENGAGE 2.0 0 On top of ENGAGE basic functions (catalog, search, visualizations, API) Researchers / Citizens / Journalists: 0 Extend other datasets (official or already extended - derived datasets) 0 0 0 0 0 0

Conversions (e.g. HTML- PDF to xls, PDF to RDF) Data Cleansing (e.g. duplicate records, empty rows, errors) Metadata Enrichment (missing metadata, Linked Data Enablers!) Data Enrichment (enrich datasets with more information) Snapshots of real-time data (e.g. Diavgeia_decisions_10_2012_to_12_2012.xls) Mash-ups / Interlinking (e.g. Combine Election results to UV radiation levels!)

0 View the version tree of official – derived datasets (clean solution - easy to understand and manage the contributions / versions)

ENGAGE 2.0 Researchers / Citizens / Journalists: 0 Data Requests 0 0 0 0

Looking for a dataset (e.g. I can’t find it elsewhere. Does it exist?) Looking for a curation / conversion / enrichment (e.g. I am looking for the election results in Greece in XLS. ) Looking for data verification (e.g. Do you think this dataset is valid?) Freedom of Information Requests

0 Integration of tools 0 0 0

Google Refine ScraperWiki Visualizations

ENGAGE 2.0 Data Providers: 0 Maintainers of Official Datasets 0 Work as a group 0 Bring the community which works on their data closer to them/ direct communication 0 See and take advantage of ENGAGE Data Curation Community work (e.g. cleansing, better formats) 0 Easy to see / gather all the Applications that are based on their official datasets. 0 See the impact of their datasets. 0 Understand which datasets have RE-USE value for users. 0 Community Help in the process of Digitalization and Opening of current or older Public Data (history dimension)

Search for a dataset...

...use your own language

Check dataset information...

...and download it

Faceted search available...

...with several filters

Extend the datasets...

...in several ways...

...and keep the provenance information

www.engagedata.eu

Open Refine

Describe the metadata...

Join the community...

..and create your groups

Rate the datasets...

..and share your thoughts

Find out about Open Data sites...

...per country

...or other criteria

Learn more about...

...ENGAGE, the ENGAGE API and Data Curation Methods

Value Proposition through individual tools  Search in diverse and dispersed data sources in EU supported by ENGAGE  Be able to transform your datasets keeping the valuable information with the ENGAGE external tools (Open Refine, Scrapperwiki etc.)  See your results through visualisation tools  Structure your data according to your needs – control all the levels of your dataset (data, metadata, format)  Refine existing datasets by metadata enrichment

Value Proposition through collaboration  Create your community(ies) with members of mutual interests  Each community will be able to increase the value of its data sets by applying their own perspectives based on its unique needs  Upload your work and share it with your community  Find other data sets, valuable for your work, uploaded by your community (Collaborate / Exchange / Ask / Provide)  Combine their results with yours – make new datasets

Gateways and   integrated  tools

ScraperWiki API

Ckan API

Open Refine

User Interface

Django Framework

HTML / Jquery

ENGAGE Core Components

Python / Django Framework

Elastic Search

Postgresql

Heroku

Storage Components

Virtuoso

PostgreSQL

Django Wiki

Translate Apache SolR

Amazon S3

CERIF