MACJa: Metadata and citations jailbreaker

This paper presents the Metadata And Citations Jailbreaker (a.k.a. MACJa - IPA /'matsja/), i.e., a method for processing the research papers available in CEUR-WS.org and stored as PDF files in order to extract relevant semantic data and publish them in a RDF triplestore according to the Semantic Publishing And Referencing (SPAR) Ontologies. In particular, the extraction of all the information needed for addressing the queries of the Semantic Publishing Challenge 2015 (task 2) is guaranteed by MACJa by using techniques based on Natural Language Processing (i.e., Combinatory Categorial Grammar, Discourse Representation Theory, Linguistic Frames), Semantic Web technologies and good Ontology Design practices (i.e., Content Analysis, Ontology Design Patterns, Discourse Referent Extraction and Linking, Topic Extraction).

Publication type: 
Contributo in volume
Author or Creator: 
Nuzzolese, Andrea Giovanni
Peroni, Silvio
Reforgiato Recupero, Diego
Publisher: 
Springer-Verlag, Berlin, DEU
Source: 
Semantic Web Evaluation Challenges, edited by Gandon, Fabien; Cabrio, Elena; Stankovic, Milan; Zimmermann, Antoine, pp. 117–128. Berlin: Springer-Verlag, 2015
Date: 
2015
Resource Identifier: 
http://www.cnr.it/prodotto/i/366580
https://dx.doi.org/10.1007/978-3-319-25518-7_10
info:doi:10.1007/978-3-319-25518-7_10
http://www.scopus.com/record/display.url?eid=2-s2.0-84951299530&origin=inward
Language: 
Eng
ISTC Author: 
Andrea Nuzzolese's picture
Real name: