When DeepSeek V4 and R2?

pepperfree@sh.itjust.works · 11 days ago

When DeepSeek V4 and R2?

pepperfree@sh.itjust.works · 11 days ago

They got the whole Twitter database. It’s kinda the same with Gemini. But somehow Meta isn’t catching up, maybe their llama 4 architecture isn’t that stable to train.

veroxii@aussie.zone · 11 days ago

Or maybe Facebook data is even worse than Twitter?

pepperfree@sh.itjust.works · 10 days ago

Llama 3.3 was good, tho. For the multimodal, llama 4 also use llama3.2 approach where the image and text is made into single model instead using CLIP or siglip.