Projects - Taiki Papandreou

Educational Content Generation Platform

An end-to-end GenAI-powered platform that transforms documents into CEFR-leveled language learning materials with automated text simplification and exercise generation.

Python Streamlit OpenAI

Detail

Legal AI Chatbot with RAG

A RAG-based legal assistance chatbot that autonomously resolves inquiries through automated initial consultations and intelligent inquiry classification.

Python Microsoft Azure RAG Streamlit

Detail Demo

Identifying Workplace Accidents in News Articles

Developed a high-accuracy NLP pipeline using fine-tuned BERT models to automatically identify and classify workplace accidents in large-scale news corpora, achieving 0.89 F1-score for research at TNO.

Python BERT RoBERTa NLP

Detail

Jargon-Aware Medical Text Simplification

Research on automatic biomedical jargon detection and LLM-based text simplification. Evaluates cross-dataset transfer learning, establishes baselines on PLABA and MedReadMe datasets, and explores prompting strategies for medical text accessibility.

Python BERT RoBERTa LLMs

Detail GitHub

Graphair: Fair Graph Augmentation for Representation Learning

Comprehensive reproduction and extension of Graphair, a fairness-aware graph augmentation method using adversarial learning, contrastive learning, and reconstruction regularization to reduce bias while preserving informativeness in GNNs.

Python PyTorch GNNs

Detail GitHub

Medical Multimodal Segmentation Using Foundation Models

Comprehensive reproduction and extension of SegVol, a foundation model for 3D medical image segmentation. Evaluates geometric robustness to rotations and proposes novel group equivariant patch embeddings for enhanced rotation invariance.

Python Vision Transformers Medical Imaging

Detail GitHub

What Do YuE's Attention Heads Hear?

Probing musical characteristics in YuE, a generative music model. Investigates genre-specific attention patterns across Jazz, Metal, Pop, and Classical, revealing deeper layers' stronger activations and semantic-genre links, while uncovering biases toward certain prompt words.

Python Transformers Music Generation

Detail GitHub

A Dashboard for Optimizing Prompts in Large Language Models

Interactive dashboard for prompt engineering optimization in LLMs. Features dataset visualizations, prompt experimentation tools, attention score insights, and performance evaluation through confusion matrices to enhance text classification accuracy.

Python Streamlit TinyLlama

Detail GitHub

My Projects

Educational Content Generation Platform

Legal AI Chatbot with RAG

Identifying Workplace Accidents in News Articles

Jargon-Aware Medical Text Simplification

Graphair: Fair Graph Augmentation for Representation Learning

Medical Multimodal Segmentation Using Foundation Models

What Do YuE's Attention Heads Hear?

A Dashboard for Optimizing Prompts in Large Language Models