Mark Sui

Professional Work

Research, engineering, and technical work with measurable range.

My experience spans AI evaluation, embedded sensing systems, digital design, and applied machine learning. Across internships, fellowships, and academic builds, the common thread is structured technical work: turning ambiguous problems into systems, experiments, and results that can actually be inspected.

AI Evaluation Adversarial prompting, safety analysis, QA evaluation, and reporting.
Embedded Systems Radar sensing, sensor integration, calibration, PCB work, and prototyping.
Digital Design RTL, FPGA systems, architecture, and hardware-oriented verification.
Applied ML Computer vision, climate emulation, reinforcement learning, and data analysis.

Research and Industry

The strongest common thread across these roles is not just technical breadth, but structured problem solving: define the system, identify where it breaks, make the failure measurable, and then iterate until the result is stable enough to trust.

Research Fellowship
Jun 2025 - Present
Remote

Research Fellow

Handshake

Scope. Selected for the MOVE AI program to support safety and reliability research for large language models, with work centered on adversarial prompting, evaluation design, and structured review of failure behavior in high-risk scenarios.

Bottleneck. The hard part was that many failures did not appear on short, isolated prompts. Models often looked reasonable at first glance but became inconsistent over longer chains of reasoning, under adversarial reframing, or when the prompt mixed benign context with unsafe intent.

How I approached it. I organized prompts into scenario families, varied wording and escalation patterns, and evaluated not just final answers but whether the model preserved context, grounded facts correctly, and maintained appropriate mitigation behavior as the conversation unfolded.

  • Designed adversarial prompts and jailbreak strategies to stress-test agentic model behavior in biosafety and information-hazard scenarios, with emphasis on repeatable rather than one-off failures.
  • Conducted systematic QA in Project Castor, reviewing contextual accuracy, factual grounding, refusal behavior, and mitigation quality for harmful or high-risk completions.
  • Compared failure patterns across prompt variations to isolate where the model's reasoning diverged, where policy behavior weakened, and which scenarios were most likely to produce unstable responses.
  • Worked with a cross-disciplinary team to translate observations into concrete vulnerability reports and recommendations for safer model behavior.
Python Prompt Engineering LLM Evaluation Safety Analysis
Embedded Systems Internship
Jun 2024 - Dec 2024
Hybrid

Embedded Systems Intern

Shandong Haike Information Technology

Scope. Contributed to a radar-driven IoT monitoring system designed for elderly health and activity sensing without cameras, helping connect sensing hardware, validation workflows, and real-environment testing.

Bottleneck. The core challenge was that radar signals were sensitive to distance, body orientation, room geometry, and reflective clutter. Respiration and fine motion were especially easy to lose once the environment became less controlled.

How I approached it. I treated the system like a sensing experiment rather than a one-time demo: set up repeatable test conditions, compared logs across placements and environments, and traced instability back to specific physical conditions instead of treating missed detections as random noise.

  • Integrated mmWave radar modules to detect movement patterns, restroom usage, and respiration-related signals in privacy-sensitive environments where cameras were not acceptable.
  • Calibrated sensing performance under realistic household conditions, testing how results changed with mounting position, target distance, posture, and environmental interference.
  • Analyzed logged outputs to identify stability limits, weak-signal cases, and false-detection conditions, then fed those observations back into system tuning and reporting.
  • Produced weekly technical summaries that clarified the trade-offs among detection quality, deployment practicality, hardware cost, and privacy requirements.
ESP32 mmWave Radar Python Data Analysis IoT Systems
Software Internship
Jun 2023 - Sep 2023
Qingdao, China

Software Development Intern

Liqun Group

Scope. Helped build a WeChat-based social product that combined messaging, secure payment, and community features, with work spanning front-end flow, API integration, matching logic, and basic performance tuning.

Bottleneck. The main product issues were friction in the payment flow, match quality that felt inconsistent to users, and response time slowdowns once more ranking logic was layered into the experience.

How I approached it. I treated those as connected product-engineering problems: tighten the interaction path, make backend behavior more reliable, and tune the matching logic using measurable feedback instead of intuition alone.

  • Supported front-end interaction redesign and implemented secure payment API integration so the core user flow from browsing to transaction felt more direct and stable.
  • Optimized friend-matching similarity weights against profile attributes and observed behavior, improving match accuracy by about 20%.
  • Ran performance tests to identify slow points in the request path, then tuned the implementation to reduce response latency by roughly 15%.
  • Worked across interface and backend concerns instead of treating them separately, which helped align user experience improvements with system-level changes.
WeChat Mini Program JavaScript MySQL Product Engineering

Selected Technical Work

These projects matter less as course titles and more as evidence of how I work: identify the limiting factor, choose the right level of abstraction, and then build or test until the system becomes explainable.

Embedded Hardware

Indoor Environmental Monitoring System

UC San Diego · Spring 2025 · La Jolla, California

Designed an ESP32-based monitoring system for CO₂, PM2.5, VOCs, temperature, humidity, and noise, integrating BME680, SCD4x, PMSA003I, and ICS-43434 sensors on a custom PCB with TFT display, data logging, and power management.

Bottleneck and resolution. The system challenge was not just reading sensors, but coordinating devices with different protocols, sample rates, and electrical behavior in a way that stayed reliable over longer runtimes. I addressed that by separating acquisition, visualization, and logging into cleaner firmware stages, handling bus coordination explicitly, and calibrating the full stack as a system rather than sensor by sensor.

ESP32 PCB Design I²C / I²S / SPI Sensor Systems
Machine Learning

Deep Learning for Climate Emulation

UC San Diego · Spring 2025 · Kaggle Competition

Built CNN, Vision Transformer, and MLP models for climate-field emulation on CMIP6 SSP data and finished Top 10 in the competition at #9 / 83 with a fast and accurate MLP approach.

Bottleneck and resolution. A key challenge was the trade-off between model complexity and competition practicality: larger models could fit structure well but were slower, harder to tune, and more prone to unstable validation behavior. I benchmarked architectures under the same preprocessing and evaluation pipeline, then selected the simpler MLP because it gave the strongest speed-to-accuracy trade-off rather than just the most sophisticated architecture on paper.

Python PyTorch Vision Transformer Kaggle
IoT + ML

IoT Smart Agriculture Project

National University of Singapore · Fall 2024

Developed a fruit-quality assessment prototype using ESP32, camera, and audio input, then combined CNN and LSTM models to estimate sweetness through non-destructive sensing.

Bottleneck and resolution. The difficult part was that audio and visual signals were noisy, weakly aligned, and constrained by embedded hardware limits. I improved robustness by structuring the collection pipeline more carefully, treating sensing and model input quality as part of the problem, and combining temporal audio cues with image-based features instead of relying on either modality alone.

ESP32-CAM Python ResNet LSTM
Digital Design

FPGA-Based Viterbi Decoder

Advanced Digital Design · UC San Diego

Designed a convolutional encoder and FPGA-based Viterbi decoder in SystemVerilog, implementing BMC, ACS, and Traceback Unit components for accurate signal reconstruction.

Bottleneck and resolution. The main bottleneck was architectural complexity: branch-metric, path-cost, and traceback logic all needed to agree cycle by cycle without turning the design into something impossible to verify. I decomposed the decoder into smaller modules, verified them against controlled error cases, and used module boundaries to keep timing and correctness issues traceable.

SystemVerilog FPGA Digital Design
Security Research

Cryptography Seminar and Research

UC San Diego · 2024 · La Jolla, California

Studied lattice-based cryptography, delivered a one-hour seminar on Babai's Algorithm for the Closest Vector Problem, and analyzed NTRU signature schemes against lattice-reduction attacks.

Bottleneck and resolution. The challenge here was translating abstract lattice geometry into something both mathematically rigorous and implementable enough to explain clearly. I handled that by working from simplified examples upward, building intuition with smaller computational cases, and then connecting those examples back to the full security argument.

Python SageMath Cryptography
Reinforcement Learning

Blackjack AI

UC San Diego · Spring 2025

Implemented Monte Carlo, temporal-difference, and Q-learning agents, simulated more than one million games, and built a custom visualization workflow to compare policy convergence.

Bottleneck and resolution. Reinforcement-learning results were initially noisy because variance dominated short training runs and it was hard to tell whether changes were real improvements or sampling artifacts. I pushed the simulation scale, standardized comparisons across algorithms, and visualized convergence so model behavior could be judged by trend rather than anecdote.

Python NumPy RL Algorithms Simulation
Search and Planning

Monte Carlo Tree Search Gomoku AI

UC San Diego · Spring 2025

Built a full MCTS pipeline with heuristic pruning, tuning search from 1000 to 6000 iterations and outperforming random-play baselines by a wide margin.

Bottleneck and resolution. The core difficulty was search explosion: naïve exploration wasted too much computation on obviously weak branches. I added heuristic pruning and tuned iteration budgets so the search spent more time on plausible continuations, which made the agent meaningfully stronger without requiring an impractical compute budget.

Python Game AI MCTS
Applied Data Science

Weather Forecasting and Air Quality Prediction

Data Analysis and Inference · UC San Diego

Built Random Forest and Polynomial Regression models for precipitation and AQI prediction, reaching R² = 93.2% and MSE = 1.41 for AQI with the Random Forest model.

Bottleneck and resolution. The main challenge was not model selection alone, but making heterogeneous environmental data trustworthy enough to learn from. I cleaned and integrated the datasets, compared model families under the same target definitions, and used feature-importance analysis to understand which variables were actually driving performance.

Python Scikit-learn Environmental Data

Foundation, Service, and Leadership

Operations and Teamwork

Student Staff - UC Davis Housing and Dining Services

Supported high-demand dining operations, inventory handling, and service execution while working closely with a large team in a fast-paced environment.

Teaching and Communication

Tutor - Kumon Learning Center

Tutored 50+ students from elementary through high school, building individualized study plans and improving math performance by about 25% on average.

Community Engineering

UCSD Global TIES Program

Worked with a multidisciplinary team to prototype and document technical solutions for nonprofit partners, applying embedded systems thinking in community-facing projects.

Early Technical Leadership

Team Leader - Yorba Linda High School Robotics Club

Led robot design and programming efforts for school competitions, supported app integration with MIT App Inventor, and helped organize robotics events.

Additional Build Work

Algorithms

Undirected Graph Analysis

Implemented graph construction, BFS-based connectivity, and Dijkstra shortest paths in C++.

Software Design

Battleship Game

Built a Linux terminal C++ Battleship game with an AI opponent and OOP-driven architecture.

Data Structures

Doubly Linked List

Implemented a templated doubly linked list with forward and reverse iterators from scratch.

Data Structures

AVL Tree

Designed a self-balancing AVL tree to support efficient insertion, deletion, and search operations.

Computer Vision

Image Classifier

Built an AlexNet-based image classifier with preprocessing, augmentation, and remote monitoring integrations.