The goal of initiative use AI in order to enhance Hoover Institution's mission, support its critical scholarship, and assist the daily operations of the Hoover Library & Archives. With Hoover AI2, the Institution can bridge the gap between artificial intelligence research in public policy and hands-on artificial intelligence projects implemented by Stanford's engineering talents.
Data processing software
Goal: Assistance with a variety of data processing tasks to help standardize and accelerate metadata processing and record keeping.
Recommendation system for archival collections based on content-based filtering algorithm
Goal: Development of a prototype of a recommendation system for the Hoover Library and Archives based on semantic analysis of metadata, providing researchers with customized recommendations for collections.
Recommendation system for archival collections based on user-based collaborative filtering algorithm
Goal: Development of a prototype of a recommendation system for the Hoover Library and Archives based on the user data (2000-present), providing researchers with customized recommendations for collections.
Experimentation with AI image processing tools
Goal: Development and/or experimentation with image processing tools. It may include image manipulation (cropping, rotation etc.), colorization of archival black & white images, image classification tasks, and image captioning tasks (i.e. caption generation describing the contents of an image). Assistance of the Hoover Digital Services & Systems with different projects related to named-entity recognition, OCR/HCR technologies, image analysis, using natural language processing and image processing software.
Exploration of textual data
Goal: This project is designed to create the system that relies on the state-of-the-art natural language processing methodology that would generate speeches in certain style where no one can tell the difference between human written speeches and the computer generated.
a) a prototype of a chatbot (Dialogflow, Amazon Alexa) including recommendation system;
b) experimentation with GPT-3 language model: fine-tuning, generation of creative content (political speeches, stories, ideas, questions), text classification (sentiment analysis of large corpus of text with respect to people, events, organizations), information retrieval (question answering), automatic text summarization;
c) experimentation with GPT-2 language model: the model fine-tuned on the "Firing Line" collection was capable of generating dialogues between William Buckley and his guests based on the clue received.