Tutor Co-Pilot

An LLM-powered Tool

For Math Tutoring

As a product design intern at PLUS, I designed an LLM-powered Co-Pilot to help tutors explain math concepts more effectively and enhance engagement during tutoring sessions.

ROLE

Product Designer

TEAM

2 Product Designers

1 Product Manager

1 Designer Manager

1 ML Engineer

1 Full-Stack Developer

DURATION

Feb - July 2024

(6 Months)

WHAT I DID

Product Scoping

Prompt Engineering

GPT Co-Design

Parallel Prototyping

Usability Testing

Model Iteration Doc

Overview

WHAT IS PLUS?

An AI-Driven Online Math Learning platform.

PLUS, where I interned as a product designer, is a math tutoring platform that connects human tutors with AI-powered software to enhance learning outcomes for middle school students from historically underserved communities. The platform supports over 3,000 students and 500 tutors, completing more than 90,000 hours of tutoring each month.

CONTEXT

Racing against the clock

At PLUS, tutors conduct 30-minute math tutoring sessions with groups of 5 students, giving each student roughly 6 minutes of individual attention. This limited time makes it challenging for tutors to fully address math problems, resulting in rushed explanations or going over time.

Under the age of AI

Following the wave of AI, the product team wants to design an LLM-powered solution to help tutors with this issue.

Final Design

MY DESIGN

Conversational AI Tutor Co-Pilot.

I led the design of an LLM-powered Co-pilot to help tutors clearly explain math problems, suggest words of encouragement, and ask strategic questions, enhancing time-efficient and engaging tutoring sessions.

001 Start By Inputting A Math Problem

002

Polish the Model’s Output with Follow-up Suggestions

003

Instant Feedback System For Effective Model Iteration

IMPACT

Elevating tutoring experiences for all stakeholders!

100+

Monthly Active Users Among 500+ Tutors Across The Platform

FOR STUDENTS

+38%

Increase In Students' Session Engagement

FOR TUTORS

20%

Decrease In Time Used To Explain Math Concepts

Tutors' self reported survey data after one month of Co-Pilot usage.

Scoping

RESEARCH

How did we navigate through ambiguity?

I began our research by reviewing 12 recordings of current tutoring sessions. Drawing insights and questions from this, I facilitated 2 focus group interviews with tutor leads and supervisors, mapping their pain points and hidden needs into a tutor journey map.

Goal

What does a normal tutoring session look like?

What are the relationship and interaction patterns between students and tutors?

Goal

What are some challenges tutors are currently facing and why?

Which part of the tutoring process needs the most support?

Goal

What does a normal tutoring session look like?

What are the relationship and interaction patterns between students and tutors?

Goal

What does a normal tutoring session look like?

What are the relationship and interaction patterns between students and tutors?

Goal

What are some challenges tutors are currently facing and why?

Which part of the tutoring process needs the most support?

Based on the research, I outlined the actions tutors take and the challenges they encounter before, during, and after a session on a tutor journey map to provide a holistic overview before focusing on a specific area.

Tutor Journey Map

Centering our product during tutoring sessions

To identify the stage of the tutoring journey with the greatest opportunity, I mapped the current support PLUS provides to tutors at each stage and found that the "during tutoring" stage has the least support.

PROBLEM STATEMENT

How might LLMs help PLUS tutors effectively and engagingly teach math concepts by providing in-session support that addresses their most critical needs?

GenAI Ideation

IDEATION

200 ideas under 2 hours through careful prompting.

Our design manager challenged us to generate high-quality ideas within 2 days. Faced with detailed needs from our research, we experienced design fixation using conventional methods like Crazy 8s and Creative Matrix.

To break out of the fixation, I proposed that we take on the approach to leverage generative AI for ideation, meticulously refining prompts through simultaneous input-output evaluation.

Raw vs. Cooked, what's the difference?

Unpolished

Overlong

Not LLM-Powered

Vague

Irrelevant

Detailed

Actionable

Relevant

LLM-powered

Impactful

How did we cook up the input for better output?

My manager's take on our design process.

(Research) Tutor Co-pilot _ Card 586 _ T

Screenshot 2024-06-02 at 10.47.33 PM.png

Prioritization

PRIORITIZATION

Picking the final design directions through data.

After sorting through 200 ideas using affinity diagramming and a gut check, we landed on 9 promising design directions. To refine these further, I created a survey for tutors to assess the relevance and helpfulness of the ideas and landed on the final 2 design directions.

Survey with 9 scenarios present in a Problem-Solution-Resolution format

Scatter plot with 9 ideas based on perceived relevance and helpfulness

Socratic Method Assistant (S6)

A chat-based tool that uses LLM to pose strategic questions that lead students to discover math concepts independently.

Step-By-Step Guide (S4)

An LLM tool that identifies a problem guide and generates a step-by-step guide with tips for tutors on how to present each step.

Landing on the final 2 design directions with high scores in both criteria

Rapid Prototyping

RAPID PROTOTYPING

Co-design GPT with stakeholders for ideal model output.

Now that we have two design directions, we face two challenges attempting to explore the desired model output.

CHALLENGE 01

We don't know what we don't know.

Gathering feedback for a futuristic experience that tutors have never encountered makes it likely to be unreliable to ask them to imagine and describe their desired model output.

CHALLENGE 02

High cost.

Building a functional prototype for users to envision and suggest improvements intensive in both resource and time. It may also diverge from what users actually want despite the effort.

Inspired by the recent surge of AI tools, the design team thought about a solution that can perfectly solve the above two challenges - co-designing the GPT in real time. I facilitated a total of 5 individual workshops with our end users, ranging from tutors, tutor leads, and tutor supervisors.

INSIGHTS

Initial shape of the ideal model output.

01

Fusing The Two Models

Switching between the two models can be time-consuming, given the limited time tutors have with each student. Combining the two models into one will likely to reduce unnecessary hustle.

02

Words Of Encouragement

In addition to the solution guide and leading questions for each step, including words of encouragement that tutors can directly use with students would be beneficial for extra motivation.

03

Table Format Output

Merging the two models with words of encouragement may make paragraph-based output overwhelming and less scannable. Presenting the information in a table format will enhance readability.

04

Emojis For Emotional Touch

Emojis are added after each word of encouragement because they can evoke positive emotions for tutors and students. They can also making messages more engaging and relatable.

Iterations

ITERATIONS

Spearheading the model feedback cycle.

To ensure continuous model improvement, I facilitated communication between tutor supervisors and machine learning engineers by 1). outlining specific feedback gathering requirements for supervisors and 2). creating two rounds of iteration suggestion documents for engineers.

01

Crafted Feedback Requirement Guide For End Users

Screenshot 2024-07-04 at 11.46.00 PM.png

I prepared 8 pairs of input-output and asked the end users to comment with the following example guided questions "What’s working? 👍 What’s not working 👎 and how can we make it better?"

02

Translating Feedback Into Actionable Iteration Suggestions

I gathered feedback from multiple stakeholders, identified common themes, and transformed them into actionable iteration suggestions for machine learning engineers, following a 'good example' and 'bad example' format after each suggestion.

Reflection

REFLECTION

My learnings designing AI products.

01

Designing AI products is an adventure.

I've discovered that creating AI solutions involves diverse modalities and skill sets compared to conventional product design. For instance, GPT co-design follows an iterative process of rapid prototyping, with real-time feedback and iteration vastly improving efficiency. Initially, this unfamiliar design approach seemed daunting, but exploring these new modalities has been incredibly rewarding. It's made me a more adaptive, well-rounded, and robust designer.

02

Speak up, sync up.

Due to the innovative nature of the design process for this particular product, I paid extra attention to communication with stakeholders. I sent frequent coordination messages to provide progress updates, set clear expectations about how they could contribute at each stage, and explained in detail the rationale behind each design decision. This proactive communication helped build trust and kept the project moving smoothly despite the ambiguity of the project space.

Next Project 🌱 👉

Anxiety? No More.

Led a 0-1 game product to turn work-induced anxiety into intrinsic motivation leveraging persuasive design techniques, resulting in an 88% perceived usefulness.