478
submitted 1 year ago by soyagi@yiffit.net to c/technology@lemmy.ml

Source: https://front-end.social/@fox/110846484782705013

Text in the screenshot from Grammarly says:

We develop data sets to train our algorithms so that we can improve the services we provide to customers like you. We have devoted significant time and resources to developing methods to ensure that these data sets are anonymized and de-identified.

To develop these data sets, we sample snippets of text at random, disassociate them from a user's account, and then use a variety of different methods to strip the text of identifying information (such as identifiers, contact details, addresses, etc.). Only then do we use the snippets to train our algorithms-and the original text is deleted. In other words, we don't store any text in a manner that can be associated with your account or used to identify you or anyone else.

We currently offer a feature that permits customers to opt out of this use for Grammarly Business teams of 500 users or more. Please let me know if you might be interested in a license of this size, and I'II forward your request to the corresponding team.

top 50 comments
sorted by: hot top controversial new old
[-] hypelightfly@kbin.social 48 points 1 year ago

Not using grammarly also works.

[-] diffuselight@lemmy.world 33 points 1 year ago

Oh they are downright into extortion. Try asking them to delete your data and see what happens.

[-] Moonrise2473@feddit.it 25 points 1 year ago

Chatgpt was banned in Italy because they didn't allow the opt-out. Wonder what happens if someone flags this to some European agency...

[-] x_x@beehaw.org 3 points 1 year ago

But ChatGPT offers an opt-out iirc, at the price of loosing the chat history feature.

[-] Moonrise2473@feddit.it 13 points 1 year ago

Yes but the opt-out was added only after they got banned

[-] merde@sh.itjust.works 13 points 1 year ago

if only there was a software that can rewrite the last phrase "selling back your data back to you" 😋

selling your data back to you?

selling back your data to you?

[-] Sanctus@lemmy.world 4 points 1 year ago

If only there was a software to smell your own farts.

Smell your own farts?

Smell farts your own?

[-] CaptObvious@literature.cafe 9 points 1 year ago

It still isn't clear why anyone uses a product developed by non-native speakers to check their writing. For anyone who knows grammar, Grammarly sometimes makes... interesting... suggestions.

[-] eager_eagle@lemmy.world 26 points 1 year ago* (last edited 1 year ago)

As a non-native speaker I'm surprised to the amount of grammar mistakes native speakers make. Being a native speaker is not a testament to how much of the language you know. And even that being true, it's not like a real human corrects your text, so the creators being native or not is pretty much irrelevant.

[-] kifujin@kbin.social 15 points 1 year ago

at the amount of grammar mistakes

[-] eager_eagle@lemmy.world 9 points 1 year ago
[-] CaptObvious@literature.cafe 4 points 1 year ago

They’d’ve gotten it wrong too. Prepositions and postpositions are their own category of linguistic hell, especially in idioms and phrasal verbs.

[-] SheeEttin@lemmy.world 2 points 1 year ago* (last edited 1 year ago)

They'dn't've necessarily gotten it wrong. With a big enough dataset, an ML tool should be pretty accurate, at least in that it will make the same choices as most people have made in their writing.

[-] CaptObvious@literature.cafe 2 points 1 year ago

They'd'n'tve

Apostrophe mistakes aside, no native speaker would stack contractions like this. There’s an upper limit of three words in a single contracted form. It would be “They wouldn’t’ve gotten” or “They’d not’ve gotten.”

ML tools don’t write grammatically correct complex sentences precisely because their training sets contain too many discrepancies. They may learn how to apply prescriptive rules consistently one day, perhaps even one day soon, but this is not that day.

[-] SheeEttin@lemmy.world 3 points 1 year ago

Who says there's an upper limit? You might not be one of those people, but I'm.

Also, that'll teach me to try to write tricky comments while also doing other things. Fixed.

[-] sugar_in_your_tea@sh.itjust.works 2 points 1 year ago* (last edited 1 year ago)
[-] CaptObvious@literature.cafe 2 points 1 year ago

LOL! How did I not know about this? Thanks!

load more comments (1 replies)
[-] CaptObvious@literature.cafe 5 points 1 year ago

Native speakers don’t usually make major grammar mistakes. They may not follow prescriptive rules, but they’re generally understandable by other native speakers because grammar is so deeply embedded in their subconscious that they can’t help handling the language correctly. You do the same in your native language. Everyone does.

The problem with non-natives, and I include myself as a non-native speaker of a few languages, is that we don’t usually have the same instincts. It would be pretty arrogant to tell a native that they don’t know how to use their own language when we, almost by definition, cannot possibly understand it in the same way that they do.

[-] merde@sh.itjust.works 3 points 1 year ago

well said/written

it's not only that "we don't usually have the same instincts", we have a burden of confusing loans, imports, translations, false friends &c.

When you start dealing with gendered languages, it's even worse. There's no logic to it. A hand is a she in one language, a he in another and neutral in third.

also, this pronoun question of culture wars is ridiculous for someone who can speak non-gendered languages 🤷

[-] CaptObvious@literature.cafe 2 points 1 year ago

(Love your handle)

I get what you're saying about gendered languages. But if you speak one long enough, even as a non-native, you'll start to develop a feel for genders and be able to predict them to some degree. So far as I know, the mechanism that determines gender is so deeply subconscious that no one has been able to find and articulate its rules, but it seems to exist.

Re: culture wars - The pronoun question is probably moot point in truly genderless languages. English, unfortunately, is not completely genderless, so it's a bone of contention in the current climate.

[-] eager_eagle@lemmy.world 2 points 1 year ago

all of which is irrelevant to how grammarly works

[-] skullgiver@popplesburger.hilciferous.nl 9 points 1 year ago* (last edited 11 months ago)

[This comment has been deleted by an automated system]

[-] argv_minus_one@beehaw.org 5 points 1 year ago

Email spam usually has heavily flawed English.

I've heard that this is intentional. It would be a waste of the spammer's time to be contacted by people who are smart enough to not be fooled. Those smart people won't bother contacting the spammer and wasting the spammer's time if they see grammatical errors in a message that purports to be from a reputable organization, so the spammer throws in some errors to make the smart people filter themselves out. Or so the theory goes.

[-] skullgiver@popplesburger.hilciferous.nl 5 points 1 year ago* (last edited 11 months ago)

[This comment has been deleted by an automated system]

[-] CaptObvious@literature.cafe 2 points 1 year ago

I've seen this filtering hypothesis, and it seems plausible. OTOH, it also gives James Veitch some fantastic material for his comedy routine.

[-] SheeEttin@lemmy.world 4 points 1 year ago

*nitpicker (but I prefer pedant in polite circles, and grammar nazi on the Internet, or at least I did until actual nazis started showing up again)

load more comments (1 replies)
[-] CaptObvious@literature.cafe 3 points 1 year ago

Certain uni composition students had better learn to write flawless English if they expect to earn their desired grade in my courses.

[-] skullgiver@popplesburger.hilciferous.nl 5 points 1 year ago* (last edited 11 months ago)

[This comment has been deleted by an automated system]

load more comments (1 replies)
[-] drmoose@lemmy.world 7 points 1 year ago

It feels like common sense to don't use Spyware on anything private. I only use Grammarly on my public posts and it's good at that. If you let someone spy on your private emails etc. Then that's on you tbh.

[-] ALERT@sh.itjust.works 6 points 1 year ago

What's wrong with using one's typed text to improve Grammarly? Why is anyone obsessed with opting out of improving products using the results of their usage? I always allow products to use my data, track my activity, send non-anonymous bug reports, etc. Am I a bad person?

[-] sugar_in_your_tea@sh.itjust.works 6 points 1 year ago* (last edited 1 year ago)

You're not a bad person, you're just not concerned with privacy or the value of things you make.

I almost never allow any of that because I expect companies to compensate me for helping them improve their product. If they can't provide a quality product without that, they should increase their prices to hire more staff or pay customers for access to their data.

That said, I'm a lot more willing to agree for open source projects because there isn't a profit motive there, and it's a small way of helping the project. I still occasionally refuse (esp. for a company like Mozilla that often ignores community feedback), though I'll try to help in other ways (evangelism, detailed bug reports, etc).

But if a company is profiting off me, I expect as least a little of that back if I'm going to allow them to use my data.

[-] SMITHandWESSON@lemmy.world 2 points 1 year ago* (last edited 1 year ago)

I totally get what youre saying, but its going to be nearly impossible for you to live up to that if you use social media in any form.

The statement you just typed is probably going to be scrapped and used to train AI. The only way you can win is to not use the internet to interact with people.

Yup, and that's a large part of why I don't use larger SM. Yeah, Meta, Twitter/X, Microsoft, Google, etc can scrape lemmy, but that's unlikely to be a huge source of info. But if they scrape, they scrape, and I guess I'm okay with that.

My larger concern is with privacy. I try to mitigate the privacy concerns by recreating my SM accounts every year or two, so that way at least I won't likely be doxxed.

So on the side, I'm looking into ways of building more robust SM. I'm interested in fully decentralized systems, which can optionally be encrypted and limited to a few. ActivityPub gets close, but it still relies on public servers to store content, and many services aren't encrypted. I'm more interested in p2p systems like IPFS, and I think a reasonably intuitive system can be built in that way. In fact, I'm planning on building a lemmy-compatible instance that uses Iroh as the backend storage and connection mechanism once Iroh gets to a usable state.

But I'm not exactly a zealot here. I didn't switch to lemmy until Reddit announced its third party API pricing change (I switched before my app of choice shut down), because I value convenience. Lemmy is now big enough for me, so I'm looking into the next step.

[-] Polar@lemmy.ca 2 points 1 year ago* (last edited 1 year ago)

No you're not a bad person.

You have to realize that Lemmy is currently hardcore nerds. Linux, Firefox, and FOSS only users.

People here are so hateful against Sync for Lemmy (previously one of the most beloved Reddit apps) because it's not FOSS.

It's actually quite annoying.

https://lemmy.world/comment/2195946

[-] jackpot@lemmy.ml 2 points 1 year ago

is langaugetool foss? it's an alternative

load more comments
view more: next ›
this post was submitted on 08 Aug 2023
478 points (100.0% liked)

Technology

34879 readers
43 users here now

This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.


Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.


Rules:

1: All Lemmy rules apply

2: Do not post low effort posts

3: NEVER post naziped*gore stuff

4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.

5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)

6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist

7: crypto related posts, unless essential, are disallowed

founded 5 years ago
MODERATORS