Difference between revisions of "Talk for me application 2024"

From Derek
Jump to: navigation, search
(Specific tasks)
(Weekly progress and questions)
 
(10 intermediate revisions by the same user not shown)
Line 9: Line 9:
  
 
==General project description==
 
==General project description==
The Talk For Me app provides a fully customisable yet easy to use grid to point to communicate for people who can't talk, and supports text-to-speech output and image-recognition suggested text. This project is about extending the capability further, from words to sentences, possibly involving the use of ChatGPT. You will require a Mac computer since this is an iOS app, and familiarity with Swift and SwiftUI is essential.
+
With the constant improvements in technology, there has been a clear shift in the ways that non-
 +
verbal neurodivergent individuals are able to communicate with the general population. This
 +
includes the use of tools, such as phone applications, which using text-to-speech (TTS) can take
 +
the form of an artiAicial voice for the individual. This project looks to take one such tool and
 +
improve it in a way that will create a near seamless artiAicial voice, which also beneAits those who
 +
struggle with the process of typing. These tools are more commonly known as Augmentative and
 +
Alternative Communication (AAC) tools/applications, communication devices, or “talkers”, and
 +
assist in construction of sentences/phrases through the use of linking symbols/keywords and
 +
predictive keyboards [1]. With numerous options on the market, they look to meet the
 +
accessibility needs of neurodivergent people, categorised by cognitive, physical, and speech
 +
needs. The main features expected of a AAC application is a simple layout, usually with larger
 +
buttons to press to assist those with physical needs who mays struggle to type on a keyboard, as
 +
well as predeAined options/the ability to categorise and store symbols/keywords for repeated use
 +
in communication. With the speciAic needs of individuals varying highly from person to person,
 +
there is often a required trial period to determine if the application/tool is appropriate for the
 +
needs of the individual. This can be a slow, and tedious problem, and as a result, a solution that
 +
meets all needs is desirable.
 +
Talk For Me, is an application developed by Across the Cloud Ltd., which in its current alpha
 +
version, presents the user with images/terms to select, then passing these along to a Large
 +
Language Model (LLM), which creates a sentence to be spoken via TTS. It was created by Dr
 +
Matthew Berryman, who suffered a haemorrhagic stroke that left him paralysed and unable to
 +
speak for three weeks. During this time, he was frustrated in the limited tools provided by the
 +
hospital to allow for him to communicate, sighting paper charts that did not even include things
 +
such as the television in his room. As a result, Talk for Me is an application aimed towards
 +
predominately the needs of stroke patients, with other disabilities that affect speech also in mind.
 +
The improvements we have made include User Interface (UI) improvements to better suit the
 +
application towards its neurodivergent user base’s needs, focusing on simple, easy to understand,
 +
and easy to reach buttons. We have implemented features that utilise the user’s current location,
 +
and time, to tailor the experience to the user, by making recommendations for Menu items of
 +
nearby restaurants, as well as sorting of the provided keywords to recommend those commonly
 +
selected by the user. Finally, work was completed to test the performance of a few LLMs available
 +
to the team, to determine the most suitable LLM for the application and use-case, as a focus is
 +
placed on the near seamless user experience of talking.
 +
This report goes into greater detail regarding each of the components of our improvements, with
 +
a Literature Review exploring some of the concepts we looked at working on for the project, as
 +
well as how we achieved the features we have included.
  
 
==Specific tasks==
 
==Specific tasks==
Line 36: Line 71:
  
 
===Semester 1===
 
===Semester 1===
* Start Project Work (Week 1)
+
* Project Proposal [[File:TalkForMe-ProjectPlan.pdf]]
* Proposal seminar (Week 5)
+
* Seminar [https://adelaide.zoom.us/rec/share/ZmSdxd6Drx1VFdIszz-0N71q885vowWd4ge_iUgfHhZevbzT__buvo9mldA4GLDO.AkJ5dqBwcs8UOFh0?startTime=1718062694000] password: 3LC!+USl
** [[File:141 Proposal Seminar2019.pdf]]
+
* Progress Report: [[File:TalkForMe-ProgressReport.pdf]]
* Progress report (Week 12) - only one report needed in wiki format
+
** [[File:141 Progress Report2019.pdf]]
+
  
 
===Semester 2===
 
===Semester 2===
  
* Final seminar (Week 10)
+
* Ingenuity Poster [[File:Talk For Me Ingenuity Poster.pdf]]
** [[File:141 Final Seminar2019.pdf]]
+
* Ingenuity Video [https://youtu.be/rZy3390fFwY]
* Final report (Week 11) - only one report needed in wiki format
+
* Final Report [[File:TalkForMe-FinalReport.pdf]]
** [[Final Report/Thesis 2019]]
+
* Poster (Week 12) - one poster only needed
+
** [[File:141 Poster2019.pdf]]
+
* Project exhibition 'expo'  (Week 12)
+
* CD or stick containing your whole project directories (Week 13)
+
* YouTube video (Week 13) - add the URL to this wiki
+
** https://youtu.be/FcJbR_diNMg
+
  
 
== Weekly progress and questions ==
 
== Weekly progress and questions ==
This is where you record your progress and ask questions. Make sure you update this every week.
+
Weekly progress and questions were recorded in our minutes, see below:
*[[Talk for me application 2024 weekly progress]]
+
 
 +
* Semester 1: [[File:Talk For Me - Semester 1 Minutes.zip]]
 +
* Semester 2: [[File:TalkForMe-Semester2Minutes.zip]]
  
 
==Approach and methodology==
 
==Approach and methodology==
We expect you to take a structured approach to coding.  List dot points outlining approach:
+
The approach involved a number of methods listed in our Progress and Final Reports.
 
+
Due to our NDA, strict methods are not able to be provided, but can be summarised as accessing the existing code, and modifying/adding to it to gain the desired functions/changes.
==Possible extension==
+
TBA
+
 
+
== Expectations ==
+
* To get good marks we expect you to show a logical approach to coding.
+
 
+
* In your conclusion, you need to come up with a short list of likely possibilities and a list of things you can definitely eliminate.
+
 
+
* We expect all the written work to be place on this wiki. No paper reports are to be handed up. 
+
 
+
* It is expected that you fill out a short progress report on the wiki each week, every Friday evening, to briefly state what you did that week and what the goals are for the following week.
+
 
+
* It is important to regularly see your main supervisors.  Don't let more than 2 week go by without them seeing your face briefly.
+
 
+
* You should be making at least one formal progress meeting with supervisors per month. It does not strictly have to be exactly a month, but roughly each month you should be in a position to show some progress and have some problems and difficulties to discuss.
+
 
+
* The onus is on you to drive the meetings, make the appointments and set them up.
+
 
+
* You are expected to make a YouTube presentation of your whole project.
+
  
 
== Relationship to possible career path==
 
== Relationship to possible career path==
The the coding and project management skills you pick up in the project will serve you for a wide range of careers.
+
The the coding and project management skills you pick up in the project will serve you for a wide range of careers. Notably, due to the use of the Swift programming language, this project could lead well into a career in app development for IOS.
 
+
== References and useful resources==
+
If you find any useful external links, list them here:
+
*
+
*
+
  
 
==Back==
 
==Back==

Latest revision as of 13:28, 5 November 2024

Supervisors

Honours students

General project description

With the constant improvements in technology, there has been a clear shift in the ways that non- verbal neurodivergent individuals are able to communicate with the general population. This includes the use of tools, such as phone applications, which using text-to-speech (TTS) can take the form of an artiAicial voice for the individual. This project looks to take one such tool and improve it in a way that will create a near seamless artiAicial voice, which also beneAits those who struggle with the process of typing. These tools are more commonly known as Augmentative and Alternative Communication (AAC) tools/applications, communication devices, or “talkers”, and assist in construction of sentences/phrases through the use of linking symbols/keywords and predictive keyboards [1]. With numerous options on the market, they look to meet the accessibility needs of neurodivergent people, categorised by cognitive, physical, and speech needs. The main features expected of a AAC application is a simple layout, usually with larger buttons to press to assist those with physical needs who mays struggle to type on a keyboard, as well as predeAined options/the ability to categorise and store symbols/keywords for repeated use in communication. With the speciAic needs of individuals varying highly from person to person, there is often a required trial period to determine if the application/tool is appropriate for the needs of the individual. This can be a slow, and tedious problem, and as a result, a solution that meets all needs is desirable. Talk For Me, is an application developed by Across the Cloud Ltd., which in its current alpha version, presents the user with images/terms to select, then passing these along to a Large Language Model (LLM), which creates a sentence to be spoken via TTS. It was created by Dr Matthew Berryman, who suffered a haemorrhagic stroke that left him paralysed and unable to speak for three weeks. During this time, he was frustrated in the limited tools provided by the hospital to allow for him to communicate, sighting paper charts that did not even include things such as the television in his room. As a result, Talk for Me is an application aimed towards predominately the needs of stroke patients, with other disabilities that affect speech also in mind. The improvements we have made include User Interface (UI) improvements to better suit the application towards its neurodivergent user base’s needs, focusing on simple, easy to understand, and easy to reach buttons. We have implemented features that utilise the user’s current location, and time, to tailor the experience to the user, by making recommendations for Menu items of nearby restaurants, as well as sorting of the provided keywords to recommend those commonly selected by the user. Finally, work was completed to test the performance of a few LLMs available to the team, to determine the most suitable LLM for the application and use-case, as a focus is placed on the near seamless user experience of talking. This report goes into greater detail regarding each of the components of our improvements, with a Literature Review exploring some of the concepts we looked at working on for the project, as well as how we achieved the features we have included.

Specific tasks

The tasks identified by the group to complete for this project focus around improving three areas, the UI, the functionality of the App, and the performance of the app through LLM optimisation.


UI

  • Update the UI to fit with similar apps on the market


Functionality of the App

  • Receive Location/Time information to make informed suggestions
  • Suggest Menu Items based off the users location
  • Suggest previously used sentences given a selection of words
  • Generate 1+ messages from the LLM
  • Sort the keywords presented to the users based off the acquired Location/Time information gained above.


Large Language Models

  • Run and compare the time take to respond over a number of datasets reflecting 1-3 word combinations
  • Compare the number of outliers in each performance
  • Compare the models "reliability" as in how well it follows the provided instructions to respond with a response that contains the expected words (and a sentence that can be used for TTS).

Deliverables

Semester 1

Semester 2

Weekly progress and questions

Weekly progress and questions were recorded in our minutes, see below:

Approach and methodology

The approach involved a number of methods listed in our Progress and Final Reports. Due to our NDA, strict methods are not able to be provided, but can be summarised as accessing the existing code, and modifying/adding to it to gain the desired functions/changes.

Relationship to possible career path

The the coding and project management skills you pick up in the project will serve you for a wide range of careers. Notably, due to the use of the Swift programming language, this project could lead well into a career in app development for IOS.

Back