Innovation & Research Projects
Click on any project to explore the details, technologies used, and access live demos or repositories.
Virtual guide for Menorca Talayótica with RAG
Multi-agent system for automated ML training
Interactive storytelling with AI assistance
Comparing RAG vs Large Context Window approaches
AI-powered fox detection in images and video
Predictive analytics for storage reliability
Excel to JSONL converter for ChatGPT fine-tuning
Enhanced virtual space experience and analytics
I'm always open to interesting collaboration opportunities. Feel free to reach out if you have an idea you'd like to discuss!
Awi is a virtual guide created to enhance the visitor experience at Menorca Talayótica, a UNESCO World Heritage candidate site. This AI-powered assistant provides instant access to information about archaeological sites, history, and visitor recommendations through a user-friendly conversational interface.
24/7 bilingual virtual guide with extensive knowledge of Menorca's archaeological heritage
Advanced RAG system for accurate information retrieval from verified sources
Voice interaction with OpenAI's TTS and STT capabilities
Responsive design for seamless use on mobile devices while exploring sites
Awi leverages a sophisticated RAG architecture built on LlamaIndex, with OpenAI embeddings for semantic search and Gemini Flash 2.0 as the core LLM. The system incorporates voice interaction through OpenAI's text-to-speech and speech-to-text APIs, creating a seamless and natural interaction experience for users exploring archaeological sites.
The knowledge base contains curated information about Menorca's prehistoric monuments, including navetas, talayots, and taulas, as well as practical visitor information. This project represents a practical application of AI to enhance cultural tourism and education.
Beyond being a technical showcase, Awi addresses a real need for accessible information about Menorca's archaeological heritage. Visitors can ask detailed questions about sites they're exploring, get historical context, and receive personalized recommendations - all through a conversational interface that feels like having a knowledgeable guide available at all times.
MIDAS is a sophisticated multi-agent system designed to automate the training of machine learning models. It leverages distributed intelligence to optimize model parameters, feature selection, and performance tuning without human intervention.
Automated model selection and hyperparameter optimization
Intelligent feature engineering and selection
Collaborative learning between multiple specialized agents
Explainable AI capabilities for understanding model decisions
MIDAS utilizes a network of specialized agents, each responsible for different aspects of the machine learning pipeline. The architecture allows for scaling and adapting to various ML tasks while maintaining high performance and accuracy.
LLM StoryTeller is an interactive web application that leverages Large Language Models (LLMs) to help users craft captivating stories effortlessly. This project demonstrates my expertise in Python, Streamlit, and working with LLMs.
Interactive story creation with AI assistance
Multiple story genres and themes to choose from
Character development and plot suggestions
Export stories in multiple formats
The application uses small language models to generate context-aware story segments based on user input. The Streamlit framework provides an intuitive interface that makes the creative process accessible to users of all skill levels.
This project implements and compares two advanced chatbot architectures applied to curriculum information retrieval. It analyzes the differences in performance, accuracy, and user experience between RAG (Retrieval-Augmented Generation) and Large Context Window approaches.
RAG Approach: LlamaIndex + Llama 3.3 70B with real-time information retrieval
Context Window Approach: Gemini Flash with complete document preloading
Intelligent Classification: BERT model for query complexity analysis
Optimized Backend: Flask-based API with session management
Uses LlamaIndex with BGE-M3 embeddings to create vector representations of CV content. The system retrieves relevant chunks based on query similarity and augments the LLM's context.
Leverages Gemini 2.0 Flash's million-token context window to load the entire CV into the initial prompt, allowing the model to have a complete view of all information.
The project reveals interesting trade-offs between approaches. RAG offers greater flexibility and can handle larger knowledge bases, while the large context window provides more consistent understanding of document relationships and faster responses for complex queries.
Fox-Detector is an AI-powered computer vision project that can detect and track foxes in images and video streams. This project showcases my expertise in computer vision and deep learning techniques.
Real-time fox detection in video streams
High-accuracy classification model
Motion tracking capabilities
Lightweight deployment options
The project uses a convolutional neural network trained on a custom dataset of fox images. The model is optimized for both accuracy and performance, allowing it to run efficiently on various hardware configurations.
Gather-Tracker is a JavaScript bot/script that enhances the Gather virtual space experience. It provides tracking and analytics capabilities for virtual events and meetings, enabling better space management and user engagement analysis.
Real-time user activity tracking
The bot integrates with Gather's API to collect positional and interaction data. It processes this information in real-time and generates insightful analytics that help event organizers optimize their virtual spaces and understand user behavior patterns.
XLSX to JSONL is a Python script that converts Excel (.xlsx) files into JSONL format, facilitating the fine-tuning of ChatGPT with structured training and validation datasets. This tool streamlines the preparation of custom training data for language models.
Simple Excel to JSONL conversion
Customizable field mapping
Automatic training/validation split
OpenAI API format compatibility
The script uses Pandas to read Excel data, allowing it to handle large datasets efficiently. It implements intelligent parsing of conversation pairs and formats them according to OpenAI's fine-tuning specifications, making it easy to prepare custom datasets for LLM training.
Hard Drive Failure Prediction is a predictive model designed to forecast hard drive failures using manufacturer data, capacity, and SMART metrics. The project aims to boost storage reliability and enable proactive maintenance before catastrophic failures occur.
Multi-feature failure prediction model
SMART data analysis and interpretation
Risk assessment and prioritization
Drive lifespan estimation
The project uses scikit-learn to implement various machine learning algorithms (Random Forest, Gradient Boosting, and Neural Networks) that analyze patterns in SMART attributes. By training on a large dataset of drives with known outcomes, the model can identify early warning signs of impending failures.