LaCAS-IA project winner SESAME 2024

26 November 2024
  • LaCAS

  • Search

The LaCAS project is one of the winners of the SESAME 2024 program: this funding from the Île-de-France region will enable the acquisition of cutting-edge equipment to integrate AI into the LaCAS platform, bringing the project closer to its technical objectives and political ambitions in terms of open science and the preservation of the world's languages.
sesame
LaCAS lauréat SESAME 2024 © LaCAS 2024‎
Contenu central

The "Soutien aux Équipes Scientifiques pour l'Acquisition de Moyens Expérimentaux (SESAME)" (Support for Scientific Teams to Acquire Experimental Resources) scheme co-finances the scientific equipment needed by public research laboratories in the Paris region to carry out large-scale projects. Out of 34 applications received, the LaCAS-IA project is one of the 12 winners.

LaCAS-IA aims to integrate AI into the LaCAS platform (created, in part, as part of a previous SESAME 2020 project) to automate metadata harvesting and classification, train linguistic models on rare languages, and offer advanced processing and search tools.

Technical aspects of the LaCAS-IA project

This funding will enable the acquisition of graphics processors, or GPUs (from the English Graphics Processing Unit) and storage arrays, to optimize computing and data management capacities, two major technical focuses of the project.

Accordéons
Optimum data storage
Process automation

LaCAS-IA project policy guidelines

The technical optimizations, in addition to strengthening the credibility of the LaCAS project in a highly competitive field (AI and NLP), make a decisive contribution to the project's political ambitions. Open science and the preservation of rare languages are two essential axes, which distinguish the project from other similar scientific or technological initiatives and make it a key player in the valorization of areal studies in France.

Accordéons
The objectives of open science

Centralized storage arrays allow data resources to be shared more easily between researchers and collaborators, improving international cooperation and the development of new research based on open corpora.

Preserving rare languages

Large-scale language models (LLMs) are emerging as powerful catalysts in the preservation and study of rare languages. These artificial intelligence tools, capable of processing and generating human language with remarkable accuracy, offer a glimmer of hope for the world's 2,500 or so endangered languages.