Our work

Discover our latest projects implemented using our solutions
FOXCUB DATA PLATFORM and FOXCUB AI PLATFORM,

as well as our recent collaborations.

GRAPHIA

GRAPHIA is a major European project aiming to equip the social sciences and humanities (SSH) with a knowledge graph enriched by artificial intelligence. In collaboration with OPERAS, the CNRS, GESIS, TIB, CNR, and other key infrastructures, Foxcub contributes to the development of hybrid RDF/LPG architectures, advanced AI services, and an open ecosystem for the reuse of SSH data.
A strategic project serving the social sciences and humanities

GRAPHIA is a large-scale European research project, led by a consortium of 20 partners from academia, institutional and industrial sectors, including OPERAS, CNRS, GESIS, TIB, and Foxcub. Its goal is to build the very first European knowledge graph specifically dedicated to the social sciences and humanities (SSH). This graph will make it possible to connect, enrich, and exploit today’s dispersed data, promoting the discovery, reuse, and valorisation of SSH research at the European level.

An intelligent graph based on AI and interoperability

GRAPHIA is based on a hybrid architecture combining RDF and LPG, two complementary approaches for representing complex relationships between data. This structure is strengthened by the integration of specialized language models (LLMs4SSH), capable of interacting with the graph to automatically extract knowledge from texts. Thanks to this technological combination, GRAPHIA enables fluid, multilingual, and semantically enriched exploration of data from different SSH disciplines (history, sociology, humanities, religious studies, etc.).

An interdisciplinary dynamic between infrastructures and enterprises

GRAPHIA promotes a close collaboration model between research infrastructures (ESFRI) such as EHRI, OPERAS, RESILIENCE, and GGP, and private sector actors, notably innovative SMEs. This partnership takes shape through concrete use cases, prototypes, and shared services that address both fundamental research needs and industrial applications. The project also aims to develop new innovation workflows via prototyping labs (IPLs), accelerating the sector’s digital transformation.

Foxcub’s Role

Foxcub plays a central role in GRAPHIA as the technical coordinator of the project and a key partner in overseeing activities. Alongside OPERAS, we lead the inter-WP technical coordination and contribute to the overall management of the work.

On the technical side, Foxcub designs, deploys, and industrializes the project’s SSH knowledge graph. We implement enrichment pipelines, evaluate and optimize the use of open-source technologies (RDF, LPG, graph learning), and ensure their integration in cloud-based environments. We are also involved in semantic architecture, data acquisition from heterogeneous sources, and interoperability mechanisms with other European graphs.

Finally, Foxcub contributes to the development of a large-scale SSH citation index, based on advanced AI modules for extraction, disambiguation, and enrichment of references.

A key contribution to the European data market

GRAPHIA is preparing the integration of the Social Sciences and Humanities (SSH) into future European Data Spaces. By promoting mutualization, traceability, and the openness of research data, it addresses the challenges of digital sovereignty, transparency, and responsible innovation set out in the European data strategy. The project thus helps to reposition SSH as a strategic domain for digital transformation and the competitiveness of Europe’s scientific and industrial ecosystems.

LUMEN

LUMEN is an interdisciplinary European project coordinated byOPERAS contributing to the expansion of the European Open Science Cloud (EOSC) by building a federated discovery ecosystem for platforms across mathematics, human and social sciences, Earth sciences, and molecular dynamics, based on an innovative Data Mesh architecture.

Context

LUMEN is a large-scale European project coordinated by OPERAS, aiming to transform how scientific communities discover, access, and enrich research resources. Building on the GoTriple platform (already adopted in the social sciences and humanities), LUMEN extends this logic to new scientific domains: mathematics, Earth sciences, and molecular dynamics, laying the groundwork for generalized interoperability across disciplines.

Objectives

LUMEN’s ambition is to create an interdisciplinary ecosystem of discovery platforms, fostering the cross-discovery of publications, data sets, software, projects, and researcher profiles. The objective is to enhance research practices by developing innovative services (meta-search, visualization, generative AI, semantic enrichment, etc.) that make resources more accessible, interoperable, and aligned with the FAIR principles and EOSC Exchange requirements.

Technical Approach

The project is based on a Data Mesh architecture that combines community platforms, each acting as a shared platform open to all. Each platform is governed by a data contract, guaranteeing semantic quality and responsible exposure of its resources. LUMEN also develops mutualized technical infrastructure with a shared set of transversal functionalities: search engines, discovery interfaces, FAIR management of semantic artefacts, monitoring dashboards, and catalogs of software and data services.

Foxcub’s Role

Foxcub is the technical coordinator of the LUMEN project. In this role, we ensure planning, deliverables, and overall technical consistency, in close collaboration with partners and the requirements of EOSC.

We lead the design of the Data Mesh architecture and the metamodel to facilitate interoperability between platforms. As the leader of development, we are building the FAIR platform for managing semantic artefacts, aimed at enriching data and feeding the discovery platforms with semantically aligned content.

Foxcub also contributes to the integration and improvement of open-source services and actively participates in semantic interoperability across scientific domains.

Impact

LUMEN aims to sustainably transform modes of scientific collaboration across disciplinary silos. By proposing a replicable architecture, open tools, and a model of federated governance, the project strengthens the autonomy of all communities while supporting their networking. It directly contributes to the evolution of the EOSC ecosystem by supporting concrete solutions for open science that are interoperable and sustainable.

SOUNDS OF LIFE

SOUNDS OF LIFE is a platform for depositing and annotating eco-acoustic recordings, developed by the Consortium Sounds of Life (MNHN, UMR Passages, INRAE, IRD…). Labeled by Huma-Num, it structures biodiversity sound data according to FAIR standards and publishes it in Darwin Core format compatible with GBIF.

Implemented using our FOXCUB DATA PLATFORM solution

 

A project led by an interdisciplinary consortium

The Consortium Sounds of Life brings together 12 leading French institutions involved in biodiversity research, ecoacoustics, geography, and signal processing: the Muséum national d’Histoire naturelle (MNHN), UMR Passages (CNRS), IRD, INRAE, Lab-STICC, CEFE, CESCO, GEODE, PRISM, ISYEB, PatriNat, and Huma-Num. The consortium addresses the growing need for the centralization, documentation, and valorization of acoustic recordings within a rigorous scientific framework.

A shared platform for scientific collaboration

The Sounds of Life platform offers a complete environment for depositing, organizing, and enriching sound recordings from field observations or archival collections. It helps researchers structure their campaigns within collaborative workspaces, manage access rights, document each file’s history, and produce ready-to-share data sets, enriched and reused in both scientific and conservation contexts.

Metadata structuring and FAIR interoperability

The platform is based on Darwin Core and GBIF standards and supports many scientific references (TAXREF, GeoNames, INPN, TDWG…). It allows the publication of datasets in the Darwin Core Archive (DwC-A) format, facilitating their archiving, harvesting, and reuse in FAIR infrastructures such as GBIF.

Advanced audio annotation interface

SoL includes an interactive spectral annotation interface to precisely mark vocalizations based on frequency and time criteria. Users can enrich annotations using tags, controlled vocabularies, or hypotheses, supporting reproducibility and future automatic processing. The tool is designed to adapt to diverse user profiles (bioacousticians, ecologists, data stewards, etc.).

Foxcub, a key technical partner

Foxcub leads the technical design of the platform, the development of its interfaces, the modular architecture, and the integration of FAIR standards. We also contribute to the design of the data model, import and annotation workflows, GBIF export, and the roadmap for future automatic enrichment processes (AI, classification, acoustic indexing).

ATRIUM

ATRIUM is a European project structuring the field of digital humanities, aiming to improve the quality and interoperability of metadata at the scale of major research infrastructures (DARIAH, ARIADNE, CLARIN, OPERAS). Foxcub contributes to catalog harmonization and the collaborative enrichment of data, facilitating exchanges between platforms, researchers, and service providers.
Context

ATRIUM brings together leading research infrastructures in the digital humanities (DARIAH, ARIADNE, CLARIN, OPERAS) to improve interoperability, data quality, and reuse at the European level. This structuring project aims to make data and services in the social sciences and humanities more accessible and connected across Europe.

Harmonization of metadata catalogues

Foxcub contributes to the technical alignment between major data catalogues (ARIADNE Portal, GoTriple, SSH Open Marketplace, CLARIN VLO), facilitating the synchronization and exchange of information on shared resources. Our role is to help align editorial and publication practices to ensure metadata consistency for all users.

Enrichment and alignment of vocabularies

Foxcub is involved in the standardization and harmonization of vocabularies used to describe and enrich metadata, ensuring semantic compatibility between platforms and supporting the circulation of enriched content. We ensure the integration of shared reference frameworks and the application of international standards to guarantee long-term interoperability.

Facilitating cross-community collaboration

We actively coordinate exchanges between service providers, researchers, and platforms, facilitating workshops and capacity-building to improve the quality and continuity of services and data. Foxcub also supports training and dissemination of best practices to foster digital transformation in the sector.

GOTRIPLE

GOTRIPLE is an innovative multilingual discovery platform for the social sciences and humanities (SSH). It is the main outcome of the European research project TRIPLE (Transforming Research through Innovative Practices for Linked Intergraduate Exploration).

Implemented using our FOXCUB AI PLATFORM solution
Context

Since 2019, 22 partners from 15 countries have been developing the multilingual discovery service GoTriple, which supports 11 languages (German, French, Slovene, Ukrainian, Spanish, Portuguese, Croatian, Polish, English, Italian, Greek) and 27 disciplines in the social sciences and humanities (SSH) domain.

Objectives

GoTriple provides one of the main access points to discover and reuse research artefacts across a wide range of disciplines in the SSH domain: publications and research data, project descriptions, and researcher profiles are automatically imported from aggregators and source providers.

The GoTriple platform

As part of the transformation process, the GoTriple platform performs a series of data cleaning, normalization, and enrichment procedures, to manage metadata heterogeneity, enhance multilingualism, and improve content discovery. It also stores and indexes enriched metadata records, making them accessible via the GoTriple search engine.

Our contribution

As part of this major European project, Foxcub was in charge of AI-based enrichments (Machine Learning / Deep Learning) to enable the automatic categorization of disciplines at the article level and semantic enrichments supporting keyword annotation.

User satisfaction with the results obtained is perceived as « positive » to « very positive ».
An ambitious 3-year project

Since its launch in 2023, GoTriple has enabled companies and public institutions to use and reuse scientific results for their own purposes, particularly to foster economic, societal, and scientific innovation.

MATILDA

MATILDA is a project to develop a bibliographic and metric tool for open science, aiming to address the overlooked status of reference and citation data within the open science landscape. It is coordinated by the CNRS, the Centre de sociologie de l’innovation (I3, UMR9217) and Huma-Num (UMS 3598).

Implemented using our FOXCUB DATA PLATFORM solution
Context

The open access movement has long emphasized the availability and reusability of academic texts as a key objective in knowledge dissemination, while often giving little attention to the question of metadata. The lack of metadata in major open access declarations has led to a paradoxical situation. While publications have become accessible and reusable, their content has remained difficult to find without relying on expensive, proprietary bibliometric and bibliographic tools

Objectives

The goal of the MATILDA project is to broaden access to, consultation of, and reuse of reference/citation data extracted from academic text, offering researchers and academic communities better control over textual information and metadata, while ensuring fair treatment of all sources.

Matilda is a search engine and a knowledge graph

At the heart of the open citation movement

Matilda is a shared open science tool designed for all scientific communities, based on metadata from scholarly texts, especially citation and reference data.

Matilda deduplicates publications across more than 100 million scholarly works
Matilda’s citation graph also integrates references extracted from publication PDFs
The first version of Matilda was launched in September 2023

Would you like to know more? We’re available to discuss it with you.