Data for people,
not just models.

Statistics and Data Science student at Universitas Negeri Semarang. I build NLP systems, forecast electricity loads, and help design policy through data, with Scopus and SINTA publications and national competition wins to show for it.

5+ Research Papers
4 National Awards
2 Scopus & SINTA Journals
2+ Years of Experience

Open to collaboration

Galih Kusuma Wijaya
GK

Machine Learning Engineer Intern

Flyrank AI

Flyrank AI is an SEO intelligence platform powered by machine learning. I joined as the ML engineer responsible for turning messy real-world SEO data into actionable signals. The work spans the full pipeline: wrangling data with complex join constraints, anonymized rows, and nested JSON structures; building embedding and clustering systems that group queries and pages by semantic meaning to surface content gaps and cannibalization patterns; modeling search intent beyond generic buckets and scoring opportunity size against existing data; and translating model output into specific, non-obvious recommendations that connect directly to client action.

Leader

Data Champion Society UNNES

When I took over, I rewrote the vision and mission from scratch, restructured all divisions, and replaced the existing program with a 9-month project-based capstone where members produce publishable outputs every month, supported by 3 sessions per month with practitioners, lecturers, and partner communities. I also designed and led the internal Satria Data selection pipeline to surface the campus's top data talent.

Research Assistant

Universitas Negeri Semarang

Embedded in an active publication pipeline, I co-authored a Scopus-indexed paper on optimized LSTM and BiLSTM for electricity load forecasting, developed the full NLP and K-Means methodology for a SINTA 4 journal on food program prioritization, and helped co-initiate the One Action One Mangrove program that secured Rp25,000,000 in government funding from Kemendiktisaintek.

S1 Statistics and Data Science

Universitas Negeri Semarang

Active presenter at national and international seminars including PRISMA UNNES, EBIC UNPAM, ICDSOS POLSTAT STIS, and GREENOVA UNNES. Research assistant on 5+ papers with 2 published in Scopus and SINTA-indexed journals.

AI Engineer Cohort

Coding Camp by DBS Foundation x Dicoding

Intensive program covering machine learning, deep learning, and reinforcement learning end-to-end. Built NutriVision as the capstone: an anchor-free food detection system using EfficientNetV2, BiFPN, and FCOS to identify fast food items and estimate nutritional content, developed in cross-path collaboration with Full Stack and Data Science cohorts.

Data Analysis Bootcamp

MySkill

Completed end-to-end data analysis projects using PostgreSQL, Python, and Looker Studio, covering sales data exploration, e-commerce EDA, and interactive KPI dashboard development.

001

Conjify: Anemia Screening via Conjunctival Imaging

Built the full AI pipeline for Conjify, a mobile-first web app for non-invasive anemia self-screening via smartphone camera. I designed and trained a dual-head EfficientNet-B0 that simultaneously classifies anemia presence and estimates hemoglobin levels quantitatively, trained on CP-AnemiC dataset (710 images) using multi-task learning with progressive unfreezing. Integrated Grad-CAM to extract spatial activation statistics for structured Llama 3.3 70B clinical interpretation in plain Indonesian. Model achieved 84% accuracy, AUC-ROC 0.902, and Hb regression MAE 1.515 g/dL. Deployed on Hugging Face Spaces via Gradio API.

Computer VisionEfficientNet-B0Multi-task LearningGrad-CAMLLMGradio
View Repository →
002

NutriVision

Built as the AI component of a cross-path capstone at Coding Camp DBS x Dicoding. I implemented EfficientNetV2 as the backbone with BiFPN for multi-scale feature fusion and FCOS as the detection head, a fully anchor-free approach using stride-based convolution instead of predefined anchor boxes. The app identifies fast food items from major brands and estimates their exact nutritional content.

Computer VisionEfficientNetV2FCOSBiFPNAnchor-Free Detection
View Prototype →
003

NALAR: Media Literacy Simulation

Most media literacy tools tell people what misinformation looks like. NALAR makes them responsible for it. Players step into the role of a news editor or social media influencer, receive an AI-generated scenario brief built from verified but incomplete materials, then make real editorial decisions: write the headline, choose the tone, draft the script, pick the thumbnail concept. Once published, a simulated social network powered by live AI calls shares, questions, mutates, and replies to the story in real time, visualized as a spreading force graph. Players can respond mid-spread with a clarification, retraction, or silence, and each choice shifts the outcome. The session ends with a full autopsy: the turning point, the media literacy principle that applied, and a responsibility score. Built on Next.js 14 App Router with Zustand for state, Supabase Postgres for persistence, and a multi-model AI layer across Groq and Gemini with Langfuse for observability.

Next.jsTypeScriptAIGroqGeminiSupabasereact-force-graph-2d
004

Personal AI Agent (WhatsApp)

Built and deployed a personal AI agent directly into my own WhatsApp, handling automated responses, task assistance, and context-aware interactions through a bot pipeline running on personal infrastructure.

AI AgentWhatsAppAutomationPython
005

PyGrind

A browser-only, 100% AI-driven coding practice platform for AI/ML and backend engineers. Every task, curriculum path, and code review is generated dynamically by an LLM (Groq or Gemini) using the user's own API key, with no backend and no database. Built an adaptive tier system where problem difficulty scales per topic, a Monaco-based practice loop with real-time AI code review gating progression, and an owner-maintained HTML handbook synced as reference material for every problem.

ReactTypeScriptMonaco EditorLLM IntegrationZustandTailwind CSS
View App →
006 1st Place · GRAVITASI Essay 2025

Accessible Campus Navigation

Most navigation systems optimize for distance. This one optimizes for people. I built the Accessibility Index for the Physically Disabled (IAT = 0.8) using fuzzy logic across four physical parameters: slope, width, surface condition, and obstacles. That score integrates directly into the A* cost function, so the algorithm finds the best trade-off between distance and real-world walkability for wheelchair users. Tested on FMIPA UNNES (54 nodes, 27 edges), the accessible route was 66% longer than the shortest path while scoring IAT = 0.8. The model also flags high-risk segments, making it a spatial tool for inclusive infrastructure planning.

PythonA* AlgorithmFuzzy LogicGISAccessibility
007 2nd Place · DIMAS-TI Data Mining

LexiLSTM: Campus Complaint Classifier

The real challenge was not the model, it was the absence of labeled data. I used weak supervision with a domain lexicon and fuzzy string matching to auto-generate pseudo-labels across six complaint categories from informal Indonesian social media text. I compared standard BiLSTM, BiLSTM + self-attention, and BiLSTM + multi-head attention. The key finding: added complexity does not consistently win when training labels are noisy. The hybrid approach still delivered +18% accuracy over baseline, with attention weights providing interpretable token-level insight.

NLPBiLSTMSelf-AttentionWeak SupervisionPython
008

IHSG Prediction via Public Sentiment

I built a hybrid forecasting system integrating public sentiment from Twitter/X with IHSG historical data to predict monthly stock index movement. Sentiment was classified using IndoBERT, chosen for its Indonesian-corpus pre-training, then aggregated into a monthly index and combined with price features as multivariate LSTM input. Adding sentiment reduced RMSE from 834.44 to 711.47 and MAE from 722.46 to 611.98, confirming that public sentiment carries genuine predictive signal for Indonesian market dynamics.

NLPIndoBERTLSTMFinanceSentiment Analysis
2026 Scopus

Comparative Study of LSTM-Based Models with Hyperparameter Optimization for Short-Term Electricity Load Forecasting

Iqbal Kharisudin, Insyiraah Oxaichiko Arissinta, Sabrina Aziz Aulia, Muhamad Abdul Qodir Dani, Galih Kusuma Wijaya

BAREKENG: Jurnal Ilmu Matematika dan Terapan

A comparative study of LSTM-based architectures with systematic hyperparameter optimization for short-term electricity load forecasting, evaluating which configuration best captures temporal dependencies in consumption data across varying load patterns.

DOI →
2025 SINTA 4

Regional Prioritization for Free Nutritious Food Programs through Social Data Integration and Public Sentiment Analysis Using K-Means and NLP

Ratna Nur Mustika Sanusi, Galih Kusuma Wijaya, Nur Achmey Selgi Harwanti

UJM: UNNES Journal of Mathematics

An integrated approach combining K-Means clustering on socioeconomic indicators with NLP-based sentiment analysis from social media to build a context-aware regional prioritization model for Indonesia's Free Nutritious Meal program.

DOI →
2025 Proceedings

Comparative Study of Autoencoder and LSTM-AE for Extreme Temperature Anomaly Detection in Semarang

Galih Kusuma Wijaya, Aliyya Anggraeni, Tsalisa Chulaili Sahri Nova, Muhammad Alifian Yusuf, Iqbal Kharisudin

ICDSOS - POLSTAT STIS International Conference

A comparison of standard Autoencoder and LSTM-Autoencoder for detecting extreme temperature anomalies in Semarang's historical climate data, evaluating which architecture better captures temporal dependencies in anomaly patterns.

DOI →
2025 National

Dinamika Sentimen Publik dalam Suksesi Pemerintahan Indonesia berdasarkan Analisis Data Media Sosial

Galih Kusuma Wijaya, Adelia Venie Diniar, Shata Alwan Jalaluddin, Iqbal Kharisudin

PRISMA: Seminar Nasional Matematika UNNES

A sentiment analysis of Indonesian social media across the presidential succession from Jokowi to Prabowo, mapping how public sentiment shifted across key political milestones and what it reveals about digital discourse and public trust.

Read Paper →

1st Place

GRAVITASI Essay Competition

Universitas Sumatera Utara · Oct 2025

2nd Place

DIMAS-TI Data Mining Competition

AMLI (Asosiasi MIPA LPTK Indonesia) · Nov 2025

Special Honor: Distinction

FPCI Global Impact Day 2025 Essay Competition

President University · Jul 2025

1st Place

Lomba Esai Nasional Rumah Disabilitas

Rumah Disabilitas Indonesia · Dec 2025

Hard Skills

Python SQL R Looker Scikit-learn TensorFlow

Soft Skills

Leadership Teamwork Critical Thinking Research Writing Data Storytelling

Certifications

  • Data Analyst Mastery · MySkill
  • Junior Multimedia KKNI Level II · LSP SMKN 8 Semarang
  • Introduction to Machine Learning on AWS · Amazon Web Services
  • Generative AI with LLMs · Amazon Web Services
  • Data Analysis with Python · Dicoding
  • Cybersecurity for Everyone · University of Maryland
  • Introduction to Data Analysis using Microsoft Excel · Coursera

Let's build something.

Open to research, collaboration, and full-time roles.