38
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
this post was submitted on 21 Jul 2023
38 points (100.0% liked)
Technology
124 readers
1 users here now
This magazine is dedicated to discussions on the latest developments, trends, and innovations in the world of technology. Whether you are a tech enthusiast, a developer, or simply curious about the latest gadgets and software, this is the place for you. Here you can share your knowledge, ask questions, and engage in discussions on topics such as artificial intelligence, robotics, cloud computing, cybersecurity, and more. From the impact of technology on society to the ethical considerations of new technologies, this category covers a wide range of topics related to technology. Join the conversation and let's explore the ever-evolving world of technology together!
founded 2 years ago
Bing Chat is doing quite well:
I even asked Bing to calculate the time dilation for the person on earth. It answered correctly with the formula and steps shown clearly.
Then it was pulling its calculations directly from a web source, not using generative large language models. I'm not saying a chatbot can't do this, I'm saying language models can't do this.
Apparently Bing Chat is able to do some maths. I asked Bing multiple variations of this question based on different speeds of the solar sail (e.g. what if it travels at 50% the speed of light). It was able to calculate both the travel time and the time dilation.
If it is only pulling the answer from web sources, how did it handle the variable speeds?
It's also possible that Bing's chatbot is using a math-specific plugin in addition to its websearching plugin.
Your failure in reasoning here is assuming that all of them are purely and only language models. That they receive no other source of learning other than language models -- for example, they aren't fed any kind of pop science math.
It's clear that this is true of models like ChatGPT, but isn't the Bing thing powered by GPT4 with a number of other enhancements? Fixing this "can't do math" thing is a low-hanging fruit for development improvements.
They announced to build in plug ins like for WolframAlpha, so maybe that is where it is pulling this data from.