
As AI chatbots become embedded into everyday workflows, their usability depends not only on speed, but on visibility, interaction patterns, and perceived response quality.
This study evaluated the usability of three major AI chatbots: ChatGPT, Microsoft Copilot, and Google Gemini.
Through controlled usability testing and interviews with 15 participants, I examined how interface design features influence:
⏱️ Task efficiency
📲 Interaction behavior
🔎 Feature discoverability
☑️ User satisfaction
This paper was written for submission to the International Journal of Recent Trends in Human Computer Interaction (IJHCI).
Most existing chatbot research focuses on:
⏱️ Task efficiency
📍 Response accuracy
📈 Performance metrics
But less research examines:
❇️ How interface design elements (icons, visibility, accessibility) influence actual user behavior and satisfaction.
Additionally, traditional usability frameworks were developed for graphical user interfaces, not conversational AI.
How do interface visibility, accessibility, and response quality influence user satisfaction and behavior across AI chatbot platforms?

15 adults (ages 21 – 65)
They had varied experience with AI chatbots.
After removing one extreme outlier, final analysis included 14 participants.
Within-subjects repeated measures design
I used Latin Square counterbalancing to prevent order effects
Each participant completed two tasks on each of the 3 chatbots:

⏱️ Task Completion Times in Seconds

📈 Descriptive Statistics Test of Completion Times in Seconds on JASP

🕹️ User Interactions with Icons vs. Keyboard & Mouse to Submit & Copy

⭐ Participants’ Average Rating for Each Chatbot

Repeated Measures ANOVA showed:
This challenges the assumption that speed = usability.

64–79% used Enter/Return keys instead of submit icon
79% manually (with keypad or mouse) copied text in ChatGPT
100% manually copied text in Copilot and Gemini
Despite visible icons, users overwhelmingly preferred keyboard and mouse shortcuts.
Users interacted with visible features slightly more often, suggesting:
🌟 Discoverability directly influences feature adoption.
Even though efficiency was similar:
This suggests:
🌟 Perceived response quality influences usability ratings more than speed alone.

Based on findings, AI chatbot interfaces should:
Avoid hover-only or nested menus for critical actions.
Keyboard shortcuts are heavily used, they should be preserved and discoverable.
Users equate better responses with better usability.
Consistent icon placement and system feedback build trust.
With a larger sample size and more complex tasks, future research could:

The Fourth Floor Website - Startup Website and Data UX/UI Research and Design, Data Analysis and Visualization