60 Generative AI Projects for Your Resume
Boost your resume with these amazing Generative AI project ideas, each designed to provide practical experience and highlight your skills with the latest technologies.
Here's a breakdown of each project, relevant tutorials, and code to help you get started and the skills you'll develop.
Develop a system that answers questions based on visual input using the IDEFICS 9B model. It involves managing visual data and answering questions based on the content of an image.
Visual Question Answering (VQA), multimodal models, image understanding.
Create a voice assistant that understands voice and visual inputs using Llava and Whisper. Combines voice recognition, natural language processing, and visual understanding.
Multimodal AI, voice recognition, natural language processing, assistant applications.
Build a model specialized for optical character recognition and visual question answering using the Qwen2-VL model. Requires understanding of text extraction from images and answering questions about visual content.
OCR, VQA, multimodal models, image and text processing.
Combines multimodal models with Retrieval Augmented Generation to answer questions based on images, utilizing Qwen-2 and ColPali. It involves not only processing images but also integrating them with a retrieval system.
Multimodal RAG, image and text processing, retrieval systems.
Focused on video understanding, allowing users to chat, search, and summarize video content. Involves complex processing tasks for video understanding, summarization, and searching.
Video processing, summarization, search, multimodal models.
Build a multimodal Retrieval-Augmented Generation (RAG) pipeline using LangChain and the Unstructured library to query complex PDFs containing various data types, leveraging LLMs like GPT-4 with vision.
Multimodal AI, data extraction.
Creates an ATS that leverages multimodal models to process resume content, including images and text. This is a full application for understanding and processing resume content for tracking.
Multimodal AI, resume processing, ATS development.
A RAG pipeline for real-time applications using MongoDB and Pinecone. Includes building a real time pipeline.
RAG, real-time processing, MongoDB, Pinecone.
Build an intelligent multi-agent system that transforms the way students manage their academic life using LangGraph's workflow framework.
Multi-agent systems, workflow orchestration, personalized academic support.
Develop an AI agent to assist with legal clause analysis and management.
Legal AI, text analysis, clause management.
Create an AI agent for content analysis and intelligence gathering.
Content analysis, intelligence gathering, NLP techniques.
Build a FAQ bot to assist with EU Green Compliance queries.
Compliance assistance, FAQ bots, question answering systems.
Develop an AI shopping assistant to help users find and compare products.
Shopping assistance, product comparison, AI recommendations.
Create an AI agent for managing and responding to weather disasters.
Disaster management, weather forecasting, emergency response.
Develop an AI-powered mentor designed to simplify and support your journey in Generative AI learning, Resume preparation, Interview assistant and job hunting.
Hackathon preparation, career assistance, AI guidance.
Create an AI agent to provide insights and analysis using LangGraph. AInsight automatically collects, processes, and summarizes AI/ML news for general audiences.
Data analysis, insights generation, LangGraph framework.
Develop a swarm of AI agents to assist with blog writing and content creation.
Content creation, blog writing, swarm intelligence.
Build an AI agent to generate business-themed memes for marketing and social media.
Meme generation, marketing assistance, AI creativity.