Youth & Education

Om goed onderwijs te kunnen bieden aan kinderen is het verwerken van persoonsgegevens van leerlingen soms noodzakelijk. Scholen zijn verantwoordelijk om dat op een zo zorgvuldige en veilig mogelijke manier te doen, zeker omdat kinderen extra kwetsbaar zijn.

The launch of the Erasmian Language Model

On Oct. 9, the launch of the Erasmian Language Model (ELM) took place. Academic staff, staff and students, as well as external parties such as government agencies and startups, joined the ELM development team to learn more about the generative AI model. How was it developed, what can it do and how can we use it?

Erasmus University Rotterdam 3 November 2023

News press release

Introducing ELM: the need for a EUR-based AI language model

The idea behind the Erasmian Language Model (ELM) originated about six months ago, says Evert Stamhuis, Professor of Law and Innovation and Academic Lead for AI Convergence at Erasmus University Rotterdam (EUR). Within the minor AI and Societal Impact, students learn what AI models are, how they work and what possible problems and improvements could be. For the students involved in the minor, issues such as data privacy of closed-source models linked to a central, remote server became clear. As did the environmental impact of the data used. "Another issue at play are the biases that generative AI models have, such as racist and sexist biases that can be seen in programs like ChatGPT," explained Michele Murgia, ELM project leader and coordinator of the minor.

What is ELM?

ELM is a generative AI model developed and based on EUR. Unlike other available AI models, ELM is software that is downloaded onto the hard drive of the computer you use to access it. This solves several privacy issues, unlike an AI model on a remote server. Also, the model is kept small to reduce environmental impact. The model is specifically suited for academic research and teaching. It is a truly open-source model (you have insight into both the model and the data) and it is trained in both English and Dutch, to counter certain English-language biases.

How was ELM developed?

The process behind the development of ELM consisted of three steps. First, LLM (Large Language Model) pre-training was required for the model. This was done using the University Library, through the publicly accessible repository of all master's theses and publications published at EUR and Erasmus MC. Thus, at its core, ELM was trained on the academic research conducted at EUR. The second step of the process is fine-tuning under supervision. During this step, the program is given specific examples and instructions for completing a particular request. The current version of ELM does not have a chat interface like other AI (e.g. ChatGPT), but it may do so in the future. As a final step, reinforcement learning with feedback from humans was used to train the program. That way it can distinguish between good and bad generated results.

Erasmian Language Model - CLI

Currently, there are two versions of ELM: ELM Small and the full ELM. Of these versions, ELM Small will continue to be developed because the full ELM is trained on Llama-2, Google's generative AI. That is not fully open-source and thus would interfere with the desire to make ELM a community-based model. ELM Small is the version intended to be downloaded on laptops for personal use because it can run on most laptop hard drives and uses only 1.8 GB of storage space. This version uses 160 million parameters and gives the user full control over editing the program. This includes adding training material to improve the model, as well as controlled fine-tuning and reinforcement learning options. This is the version of ELM that students in the AI and Societal Impact minor tested, extended and improved.

Who is ELM for?

"We want this model to be efficient and serve the goals we have in mind at EUR. For that reason, we deliberately chose to keep it a smaller model," explained João Goncalves, academic lead of ELM and lecturer in the AI and Societal Impact minor. Both Michele and João emphasized that ELM is not a traditional model, but a community-based model. The end users for whom it was designed, everyone at EUR, are also the co-creators of ELM. Thus, everyone who uses ELM can directly influence the design of the model for their specific academic needs. "The success of the model depends on you, the users, becoming co-creators," says João.

What are the next steps for ELM?

By the end of the event, it was clear that many students and staff were excited about the project and asked relevant questions. These included departmental biases within research, the ability of the program to identify the source of information (which is not feasible due to the nature of the LLM), and whether the program can be aware of its limitations in answering questions based on the information it is trained on.

The next step for ELM is further co-creation. The call for anyone interested in using and developing ELM is open!

To assist in the development of ELM, the team is looking for people interested in the project. Contributions can range from providing information to train the model with, to providing examples and directions for fine-tuning under supervision. You can also provide general feedback on the model. If you are interested, please contact Michele Murgia or João Goncalves.

Comments

MORE REACTIONS

Cyber threats on the rise: Schools must accelerate work on digital security

News/press release

Digitization as drizzle throughout the organization

News press release

SURF advises against using Microsoft 365 Copilot for now due to privacy risks

News press release

Digitization and good education go hand in hand

News press release

"As an administrator, you have to take the subject of digitization seriously and know where you stand. Digitization is so much more than just a prerequisite for good education." A...

Solid €129 million boost for technology education

News press release

Rijksoverheid

VO-raad positive about cabinet plans learning resources and digitization, some question marks though

News press release

Room letter Digitalization and Learning Resources shows important step in the right direction

News press release

Monitor Digitization: the benchmark for education

News press release

SURF and SIVON: Only paid version Google Workspace for Education safe to use

News press release

InformatieBeveiliging Nederland

IT contracts: in-depth analysis, AI, and recent EU legislation

In this in-depth two-day course, you'll learn all about creating and managing IT contracts. Whether you're dealing with hardware and software licensing, custom projects or the challenges of cloud contracts and data transfers, this course will provide you with the knowledge and tools to create strong, watertight agreements. Of course, the latest cybersecurity requirements are also covered.

Learn more

-->

Data Processing and Privacy in the Social Domain - 4 days

Learn more

Introduction to AI for public professionals

Are you already meeting the legal requirements of the AI Act? As of Feb. 1, 2025, the European AI Act requires organizations to adequately train their staff in AI literacy. This means that depending on their role, staff must have specific knowledge about the operation and impact of AI systems, with a focus on ethics, transparency and risk management. Training should be tailored to the level of technical experience and specific use of AI within the organization so that staff can apply AI systems responsibly. In addition, the effects of AI use on internal and external stakeholders must be considered. Are you already compliant with these requirements? In this course, you will receive a thorough and accessible introduction to the world of artificial intelligence to ensure your AI literacy.

Learn more

AI in practice: the legal aspects

In deze cursus krijg je van een een gespecialiseerd Data Scientist (Dr. ir. Linda Terlouw) inzicht in Artificial Intelligence. Mede aan de hand van voorbeelden uit de praktijk bespreekt de docent de juridische achtergronden en valkuilen. Je gaat met een praktijkcasus aan de slag om de kennis meteen toe te passen.

Learn more

-->

Prompts with policy: AI use within ethical frameworks

AI is playing an increasing role in the work of privacy professionals. From drafting texts and analyzing documents to gaining new insights - AI tools like ChatGPT and CoPilot can take a lot of work off your hands. But ... only if you know how to use them smartly and safely. The quality of output hinges on one thing: the question you ask. Want to use ChatGPT smarter and more effectively in your daily work? Learn how to use the right prompts to win time, get better results and avoid mistakes.

Learn more

Menu

FILTER BY CONTENT

The launch of the Erasmian Language Model

Erasmus University Rotterdam 3 November 2023

News press release

Introducing ELM: the need for a EUR-based AI language model

What is ELM?

How was ELM developed?

Erasmian Language Model - CLI

Who is ELM for?

What are the next steps for ELM?

Comments

Leave a comment Cancel response

NEWS

Cyber threats on the rise: Schools must accelerate work on digital security

Digitization and good education go hand in hand

VO-raad positive about cabinet plans learning resources and digitization, some question marks though

IT contracts: in-depth analysis, AI, and recent EU legislation

Data Processing and Privacy in the Social Domain - 4 days

Introduction to AI for public professionals

AI in practice: the legal aspects

Prompts with policy: AI use within ethical frameworks

Join PONT | Data & Privacy

General

Service

Navigate

Berghauser Pont Media Group

CONTACT

Menu

FILTER BY CONTENT

The launch of the Erasmian Language Model

Erasmus University Rotterdam 3 November 2023

News press release

Introducing ELM: the need for a EUR-based AI language model

What is ELM?

How was ELM developed?

Erasmian Language Model - CLI

Who is ELM for?

What are the next steps for ELM?

Comments

Leave a comment Cancel response

NEWS

Cyber threats on the rise: Schools must accelerate work on digital security

Digitization and good education go hand in hand

VO-raad positive about cabinet plans learning resources and digitization, some question marks though

IT contracts: in-depth analysis, AI, and recent EU legislation

Data Processing and Privacy in the Social Domain - 4 days

Introduction to AI for public professionals

AI in practice: the legal aspects

Prompts with policy: AI use within ethical frameworks

Sign up for the newsletter

Join PONT | Data & Privacy

General

Service

Navigate

Berghauser Pont Media Group

CONTACT