@OctopusNemeses

OctopusNemeses@lemmy.world · edit-2 1 day ago

I wonder if millennials will ever come to grips with getting older and out of touch. Will they be more like the boomer generation and clutch to their reality and force it upon the world into their geriatric years. I suspect it will be latter.

It’s like they’ve learned nothing.

Reddit was something a solid 15 years ago. Even 10 years ago it turned to shit when the_donald overran the platform as Huffman sat idly by. These days there is no intelligence on reddit. There’s petty social media bickering.

Which begs the question. Why has nobody been talking about how an ostensibly AGI like system will need AGI level data. No such thing exists. There’s no dataset that captures the entirety of humanity’s smartest minds. They’re feeding these models random social media comments. Like, what? Am I taking crazy pills.

OctopusNemeses@lemmy.world · 4 days ago

I bet they could measure a steep decline over time. Probably a sharp cliff around 2016. Along with discussion about any policy.

Furthermore, I wonder how many Americans realize their politics hasn’t been about politics all anymore.

OctopusNemeses@lemmy.world · 4 days ago

That was my experience playing with the older neural net models back in the day. Usually the initial models and datasets are nearly the best it will get. Trying to feed it more data or trying to tweak the model only gains marginal improvements. Unless you’ve made a critical error in the initial work or miraculously happen upon a much better dataset then not much will change. I mean the whole premise that the machine finds the optimal solution within the limits of its capabilities. Beyond that you’re, rolling the RNG until the output lands on a result you like better, but the capabilities remain the same.

That’s why I suspect LLMs have peaked and these companies must be applying smoke and mirrors to keep the this iteration of AI going. There’s not going to be another leap forward until scientists put out another revolutionary paper for everyone to copy. That seems to be the cadence of AI.

OctopusNemeses@lemmy.world · 7 days ago

The modern day banality of evil.

OctopusNemeses@lemmy.world · edit-2 16 days ago

That’s pretty accurate to my experiences. A notable point being that these projects are seemingly made by kids with zero experience in anything. It’s like investors gave 20 year olds billions of dollars and let them do whatever. Not just any 20 year old but tech bros who think they know everything.

Project requirements flip flop from day to day depending on their mood or tea leaves or the way the wind blows. One day you’re thinking you’ve followed all the project guidelines. The next day they are the complete opposite. Guess what. You’re on the hook for submitting work that no longer meets the spec.

Quality control reviewers are genuine workers or assholes or not paying attention at all. Your quality scores are what determines whether you stay on the project or get summarily removed. As the article said, you’re instantly dropped from everything without notice. You can be in the middle of working on a task and the webpage will vanish along with all your access to everything. It’s a bit of a jump scare that workers fear the most.

It’s all shrouded in much secrecy. There layers upon layers of obfuscations (read: clueless “managers” as the article said). They’re not really managers but messengers passing along the own interpretations of communications from the layer of obfuscation above them.

Still I’m 99% sure the top of the pyramid blame their shit results on us lowest peons. It becomes very obvious why LLMs are hilariously bad at times. The datasets must be so garbage given that they keep flip flopping on their requirements. Those guidelines have been passed through who knows how many layers of other workers like us. It’s a game of telephone.

The article is wrong about managers all being Gen Z. I don’t know why they put that there other than to clickbait and stoke the pointless generational war. The mid level managers are anybody from younger to middle age workers. It’s impossible to actually get contact with any of the actual engineers behind the projects.

There have been times where I’ve pointed out glaring contradictions with in project. Nobody had been paying attention at all. Not the managers. Not other workers. It’s people mindlessly reading off documents they made based on other communications they received from higher up. I was unceremoniously removed from that one project.

I was on one project where a reviewer was giving out a high rate of bad scores and entering random justifications. All of us workers spotted the trend. It took the longest time for anyone above to notice because of the layers of obfuscation.

It’s actually impossible to get in contact with anyone of consequence. The managers can only say they’ll pass along whatever concerns you might have. It the equivalent of putting your message into a suggestion box that never gets read.

I’m pretty sure most projects stack ranked. So you’re easily dropped from a project if you encounter problematic reviewers. There really isn’t any training or anything. It’s sort of a Black Mirror like world. Where workers are monkeys on keyboards sitting in individual pods. Every day a few of those pods are randomly dropped out of the system because that monkey pulled the wrong sequence of levers. Sometimes people say it’s like Severance. Nobody knows what’s really going on because we’re all kept in the dark. We just know the work is very important so we keep clicking the numbers on the screen.

Quite often the datasets are so bad, it’s impossible to annotate the data. The article pointed out the regional accent. Sometimes the images, videos, audios are so bad it’s impossible do anything with them. But it’s work that must be done so you try to invent some way to justify annotating the thing based on the guidelines. What even is the point of using that useless data but to create an LLM model for the sake of having something to show for the investors. It’s a sham.

The quality control process needs to be much better but as the article said, it’s other workers who were picked because they supposedly had better quality scores. The pay remains the same too like the article said. The whole system is a mess to begin with so they might as well be picking reviewers at random. Also keep in mind the project guidelines can change constantly. All things considered, it’s garbage in garbage out as someone already commented.

A lot of workers are grinding time to collect paychecks rather than actually care about a project. It’s a shitshow at both ends. I’ve heard workers remark on how they’ve lost faith in AI after working these jobs.

This is where a portion of the billions of investment money is going. They’re exploiting a work force to churn out datasets because they’ve already devoured the existing internet. The lowest paid I’ve seen so far is $1.50 USD for certain workers in India. On the same project, people in other countries were getting up to $20. The same projects are farmed out to different data brokers so the pay can be drastically different depending on the company.

There’s so many scammers too. Given the disparity in pay, there are people from India and Africa who will farm accounts from other regions of the world and work through VPNs. There have been speculations that some of these workers even get to be reviewers. That’s where people think is a source much of the bogus reviews that are written in broken English.

OctopusNemeses@lemmy.world · 16 days ago

Same. Internet Karens are enabling these authoritarian times.

OctopusNemeses@lemmy.world · 2 months ago

It makes more sense when viewed as a fancy autocomplete, not an intelligence. There’s no intelligence behind it that is reading your statement and understanding your meaning. It’s responding with text that mathematically likely matches some sort of reply that would fit your statement.

Your statement included Y and the algorithm landed on result that includes Y. There’s no intelligence that could understand that you meant no Y.

That bullshit about the model getting fine tuned just means they are data mining you. It doesn’t make the more LLM intelligent. All it does is add your data to their dataset of which the LLM can draw from for possible future replies. The fundamental limitations of the technology still exists.