Main

Is ChaTGPT Really a Good Language Learning Tool? - [Case Study]

Ever tried to use #chatgpt for #languagelearning? Is it our next best language teacher? Well let's find out! Science étonnante - "Ce qui se cache derrière le fonctionnement de ChatGPT" -- https://youtu.be/7ell8KEbhJo 00:00 Intro 00:43 Part I - What is Chat GPT 01:33 Machine Learning 02:30 Chat GPT's baseline model 03:00 How GPT works 06:36 The pre-prompt 07:08 Reinforcement Learning 07:47 Part II - Learning grammar with ChatGPT 08:20 The first 4 verb forms of Arabic 11:00 Second trial - the Masdar form 11:59 Masdar - Conclusion 12:39 Reading with Chat GPT 14:37 Learning vocabulary 14:59 Speaking and Listening Practice 17:10 Outro #languagelearning #chatbot #arabicstudies #japaneselanguage #OpenAI

No Friday Night Plans

9 months ago

Ever heard of this AI that will transform every  aspect of our lives, make human intelligence obsolete and accelerate the downfall of normal  brains back to the Stone Age? Yes, you guessed it! Today we're talking about Chat GPT and more  specifically Chat GPT in the context of language learning. I know you have already seen countless videos  on this very topic. So let's try to bring it to the next level, debunk some of the myths generally  associated with Chat GPT, understand why it tells you so
much c*** and understand its strengths  and weaknesses so that we can use it properly for language learning. Of course, if understanding  how GPT works, what it can and cannot do does not interest you, please use the timestamps down below  to go to the part of the video that interests you the most. If Chat GPT is to become our next best  language teacher, it is important to understand how it works and also figure out whether or not  it understands what we are and what it is saying. Chat GPT is
a chatbot that was developed by Open AI  and launched at the end of 2022 for everybody to use. As of June 2023, Chat GPT is not connected  to the internet. Chatbots are nothing new. You find them pretty much on any commercial website  nowadays. They usually take the form of a small tab that pops up on screen to ask you how they can  help you. These chatbots usually work according to keywords and depending on whether or not you find  the right keyword to formulate your question, they may or may n
ot be very helpful to you. Chat GPT on  the other hand is a chatbot that is based on a more advanced technology called "machine learning".  The most widespread machine learning model is called "supervised learning" and it is usually  used for spam detection or image recognition. You can train a machine to recognize what is  on an image: you create a model with a series of features, you use an image as input and the  model will provide a specific word as output. At the start the model will tell y
ou a lot of c*** until it is trained enough to output the right word. The model's features will be tweaked along the  way until it reaches the highest probability of providing the correct answers. This model -although  very interesting, requires an extremely long training period. It also has a major disadvantage  as it requires a lot of human input. However, this is not the way Chat GPT works. Chat GPT was thought  out to take the form of a conversation. So this machine learning model was not ap
propriate. Rather  Chat GPT is based on a baseline model i.e., a model that was first trained to perform generic tasks  before being tweaked to perform more specific ones. In the case of Chat GPT, the baseline model  is none other than GPT, which stands for Generative Pre-trained Transformer. Interestingly enough,  this baseline model was trained to perform a most unusual task, that is guessing the upcoming  word in a text. So the input would be any word, any part of sentence or even any text wh
ile the  expected output would be any word that would be a plausible completion to said input. An example  would be the "mouse is eaten by ... "with "the cat" as output. You may wonder why anybody would want to  train a machine to guess the upcoming words in a text as it doesn't really serve much purpose.  Well the great thing with this strategy is that it doesn't require a lot of human supervision  at all as you can simply train the model using an existing corpus of data i.e., any text, any boo
k  or encyclopedia that's ever existed in digital form. For instance, GPT was fed the entire English  version of Wikipedia. Not sure it's a guarantee of quality but that's another matter... As with other  machine learning models, the features of the model were tweaked, adjusted along the way to ensure it  gives the most plausible answer; that is the word that is most likely to appear as completion  of a specific sentence, phrase or text in input. GPT3 is equipped with no fewer than 175 billion 
parameters. It is important to understand that GPT was not trained to provide true answers. The  model was trained to provide a "plausible output". For instance, if you ask a GPT when Christopher  Columbus discovered America, it will tell you 1492. Not because it knows it as a historical  fact but because most -if not all - the texts it was trained with associate Christopher / Columbus / discovered / and America with this very date. However, the proof that it doesn't understand what you are  ask
ing is that if you change "Christopher Columbus" for another name, it will give you the exact same  answer. I was not able to access GPT myself as you apparently need an API access to be able to use  it and I just couldn't be bothered. However, if you you would like to have more examples, I would  encourage you to check out the video on this very topic that was made by David Louapre from the  French channel "Science étonnante". It's a French channel so obviously it's going to be in French ^.^. B
ut there's always  the possibility to use automatic subtitles if you do not understand French. The information I  have provided is a quick summary of this video. So if you want more details, please check it  out. Besides the fact that it was not trained to produce correct answers, it also does not only  produce sentences that it has learned. It rather endeavors to produce the most plausible output, as  in anything that would not clash with existing texts. This model presents an obvious advantage
: you  can repeat the task endlessly so as to produce an entire text based on a single starting word. When it comes to languages, GPT was trained using different languages. However, about half of  its training was done in English, which causes a major problem as we will see later in this video. Unlike what people tend to assume, GPT does not have any memory of any conversation you may have with it. Neither does it know anything beyond 2021. It is a frozen model. Its knowledge stops at a  certai
n date. Of course, this may change when it gets connected to the internet but for now  that's the way it is. Now that we understand the baseline model on which GPT is based, let's  learn a little bit more about ChatGPT itself. As I said GPT was not trained to be a chatbot. It was trained to complete sentences or texts. Chat GPT uses a pre-established prompt which  is an introductory phrase or paragraph that will help the machine to understand what you  are expecting from it. For instance "This i
s a conversation between a human and a well-learned  chatbot'". We do not know what pre-prompt is used exactly in the case of Chat GPT. But we know that it is  most likely a long list of instructions. Since GPT is trained to complete sentences or texts  and not to have conversations. It was "fine-tuned", using a database of real human answers. The third  and last training phase is called "reinforcement learning" and during this phase humans judged the  quality of the answers that were provided b
y GPT. With this short summary, we now understand that  Chat GPT does not think, it does not ponder, it does not reflect. Neither does it have any memory  of any kind of conversation you may have had with it. And it is not connected to the internet. It  generates plausible answers in which words go well together based on its database. Now that  we know this, let's try to find out if GPT is an appropriate language teacher... Since Chat GPT was trained to be a chatbot, it can potentially break dow
n complex grammar points to adjust them  to the normal flow of a conversation that would be easier to understand and more user friendly. In my  previous video on how to start reading in Arabic or in any other language, I broke down some of the  most important elements of the Arabic grammar, one of which was none other than the "Masdar form". So  I asked Chat GPT to teach me the Masdar form of the first 4 verb forms. "In Arabic grammar, the  Masdar form is a verbal noun that represents the action
or the concept of a verb. It is derived from  the root letters of the verb. Here are the Masdar forms of the first 4 verb forms along with  simple examples and their English translations". The 1st verb form apparently is "fa'la", which is  a form I did not know existed... It tells us that the Masdar form is "fi'lun". Well I suppose that if the 1st verb form was "fa'ala", "fi'lun" would indeed be a possibility. It is for instance the Masdar form of the verb " 'alama" (to know)- 'ilmun (knowledge
). Interestingly enough, for the 1st verb form, there are different Masdar forms, which it  does not seem to explain here. The 1st verb form explanation is consequently quite problematic,  especially if we consider that in the example قرأت الكتاب. القراءة ممتعة. well the example does not match at all  the Masdar form that was provided. Let's move on to the 2nd form. It tells us  that the 2nd form is "fa'aala", which is correct and then that the Masdar form is " taf'iil', which  is correct too.
However, the example is صممت الخطة. التصميم جيد. "Tasmiim" is indeed the Masdar form of verb "sammama". Unfortunately, "el jatat" is not properly vowelised. It should have been "eljuttat". Let's see what it tells us for  the 3rd form. The 3rd form according to Chat GPT is "faa'ala", which is correct. But it tells us  that the Masdar form is 'fa'aala", which is not correct at  all... Funny enough in the example سجدت في صلاة. السجود واجب. Here, the vowelisation is correct... Unfortunately, none of
the words that appear in this example belongs to the 3rd form! Finally, it tells us that "af'ala" is the  4th form - this is correct. And that the the Masdar form is "if'al", which is correct too. And  then it provides the following example سعدت بالنجاح. السعادة مهمة. Unfortunately, again the vowelisation is incorrect. It should have been سَعَدتّ بالنجاح. السَعادة مهمة. And the 2nd problem is obviously that this  example does not include any word that could potentially belong to the 4th verb f
orm. I was  quite confused with this 1st conversation so I asked Chat GPT to provide more explanations when it  comes to the example sentence it provided for the 3rd verb form. I asked "You are an Arabic expert  tell me if the 3rd form of Arabic appears in this sentence." It responded "Yes, the 3rd verb form "fa'aala" (which is definitely not the 3rd verb form) of Arabic does appear in the sentence  you provided. The verb form you are referring to is "sejeda" in the past tense, which means to  "
prostrate". In the sentence, it is conjugated as "sajadto." The content of the sentence makes me  understand that Chat GPT does not understand anything it is saying about the Arabic grammar. Since it was telling me a lot of c***,I tried to stir the conversation in the right direction by  asking "Are you sure the verbal noun of the 3rd verb form is fa'aala". "You are correct and I apologize for the mistake. The Masdar of the 3rd verb form "fi'aala" in Arabic is not "fi'aala" but rather "istif'aal
!! I stopped the conversation here simply because it went on like that for 2 or 3  more answers in which it provided completely nonsensical answers. At least for anybody who knows  the Arabic grammar, which tends to indicate that it will go on like that until you are satisfied with  the answer it provides. Obviously, it's impossible to check all the aspects of the Arabic grammar  in one single video but if it struggles with something as simple and straightforward  as Masdar, it's unlikely that i
t would be much more competent for other more complex  aspects of the Arabic language. But I might be wrong. Chat GPT can also be used to practice  reading especially at the beginning. Reading Arabic without vowels can be quite challenging  especially as a beginner. So you can ask Chat GPT to write short stories for you while adding  vowelization. So I asked Chat GPT to provide a short story in Arabic about Japan and to make  sure that every word was vowelised. The text reads كَانَ يُوسُفُ وَاح
ِدًا مِنْ الشَبَابِ الَّذِينَ كَانُوا يُحِبُّونَ الثَّقَافَةَ الْيَابَانِيَّةِ. لَقَدْ قَرَأَ الْعَدِيدَ مِنْ الْكُتُبِ وَشَاهَدَ الْعَدِيدَ مِنَ الْأَفْلَامِ عَنْ هَذَا الْبَلَدِ الْبَعِيدِ. طَمَوْحُ يُوسُفِ كَانَ أَنْ يَزُوْرَ الْيَابَانَ يَوْمًا مَا وَأَنْ يَسْتَكْشِفَ جَمَالَهَا الْفَرِيدَ قَرَّرَ يُوسُفُ أَنْ يُقَوِّمَ بِرِحْلَةٍ إِلَى الْيَابَانِ. حَضَرَ جَوَازَ سَفَرِهِ وَأَمْوَالَهُ وَبَدَأَ الْبَحْثَ عَنْ تَذَاكِرِ الطَّيَارَانِ. We're about halfway through and there are  already quite
a few mistakes, so I definitely encourage you to be cautious here too. The fact  that there are so many problems when it comes to vowels in Arabic might be a little bit strange. However, if we go back to the 1st part of the video and remember that GPT was trained using a  corpus of existing texts, we can easily understand why there are so many problems as most existing  texts in Arabic do not include vowels. Of course, you could argue that many short stories or even  novels have a lot of similar
typos. However, the problem here is that it's supposed to teach me a  grammar point that it does not seem to understand and also that it tells you a lot of c*** in a  very confident way, which in my opinion is quite problematic. Even though Chat GPT is not perfect  when it comes to Arabic grammar and vowelisation, it does do quite a good job at explaining the  meaning of words. For instance, here, in Japanese I asked the meaning of the expression "umaga au" and it provided a most adequate answe
r. It's also quite good at explaining slang terms in French. So  even though it's not the best grammar teacher, it is a good vocabulary assistant. Now Chat GPT of course can be used for many other purposes when it comes to language learning. One  of which is actually listening and speaking. You can do so by adding a Chrome extension, either  "Talk to Chat GPT" or "Voice control for Chat GPT" Both extensions come with a lot of different  features. You can for instance adjust the "read out loud sp
eed" to make it faster or slower. You  also have the possibility to choose your voice preference, so if you prefer the voice of a man or  a woman. However, most voices (if not all) sound quite artificial... Depending on your needs, you can also  make the input and output languages different. Both extensions allow for oral conversations. However , I do not use these extensions for this purpose, mostly because I cannot bear the sound of these  robot sounding voices. I personally prefer to use thes
e extensions to produce texts on topics  that interest me, which can then be read out loud for pronounciation purposes or to  train listening comprehension skills. Of course, you can generate texts based on your current  level, for instance by asking Chat GPT to only use vocabulary known to primary school students,  junior high or senior high students. With both extensions, you can play with speed adjustments,  depending on the kind of training that you wish to have. Of course, you can also impo
rt your own  texts or articles, short stories, parts of novels that you would like to read or be read out loud  for listening comprehension training. You simply need to copy and paste the text into Chat GPT and then activate the sound. For further training, you can ask Chat GPT to ask you questions  about the content of the text you have just read or listened to and respond in your target  language. Unfortunately, these extensions do not yet include Arabic. So listening and speaking  skills cann
ot yet be trained on chat GPT using these extensions. In my opinion, these do  not replace at all the many benefits of having a language partner, mostly because conversations are  also about facial expressions, tone, filling words, which you will not get with this chatbot. That's  all for today's video. It ended up being quite long. At least, I hope that you got to understand  how Chat GPT works if you did not know already and also understand that it still has many flaws and  that it's not neces
sarily the go-to resource if you want to master a language. Of course, this may  change the day it gets connected to the internet but for now, it's not the case. Thank you for  watching and I will see you in the next one ^.^

Comments