If the fediverse had to deal with the amount of traffic and content from YouTube in its current state, it would collapse immediately.
The Fediverse would be a very different place if it was hosting anything remotely close to YouTube tier traffic. FFS, how much of the Fediverse is even outside English speaking countries? None of our systems are getting bombarded with hundreds of gigabytes of Good Morning messages like Whatsapp is dealing with in India, for instance.
So much of the content on these big services is both trivial in terms of audience and enormous in terms of relative file size. My sister-in-law sent me a thirty minute compilation video from their latest summer vacation, which she hosted to YouTube. That video is going to get maybe five views, unless one of us goes back to watch it a second time. How much is it costing YouTube to host and stream? Obviously far more than what they make from any of us.
Now scale that up to millions.
The Fediverse isn’t trying to do anything remotely like that.
This specific example is one thing that self hosting is arguably better for. I’ve made a few shitposting memes and the like that are five seconds long and uploaded unlisted just to share with friends that get immediately flagged and banned for DMCA that I’ve taken to just self hosting them. They’re getting like three views anyway because the world was never meant to see them.
People sharing videos with friends and family seems like a problem that’s already solved, if you really don’t want to use YouTube. Big channels that get millions of views are the real issue.
Self-hosting, certainly (to a degree anyway). But the Fediverse isn’t self-hosted. I’m not keeping a catalog of comments on my computer that you lose access to when I close my laptop.
Self-hosting also tends to require dedicated hardware. Less of a big deal as the real cost of your own personal little server setup has plummeted. But still something that’s predicated on always-on internet connectivity in a way that’s not always practical.
It’s not just file size either. Video basically has several different things going on, where improving on one aspect tends to require compromise on the others:
Resolution
Frame rate
Quality
Bit rate (file size)
Encoding complexity
Decoding complexity (which affects battery life of mobile devices viewing the content)
Robustness for dropped or corrupted data
Over time, the standards improve, but generally benefit from specialized hardware for decoding (thus making decoding complexity a bit more complicated when serving a lot of people with different hardware).
Netflix, for example, serves a small number of very large files to many, many people on demand. That means they benefit from high encoding complexity, even if it shaves off a tiny bit of file size, because spending a few extra hours on encoding a movie that’s 10mb smaller is worth it if 10 million people watch that movie, as that’s 100 terabytes of traffic saved.
But YouTube/Facebook and the others with a lot of user-submitted video, they’re ingesting hundreds of hours of content every minute, chopping it up into like 5 different resolutions/quality levels.
Then YouTube has a shitload of processes for determining which video gets which treatment. A random upload of a kid’s birthday party might get a few hundred views at most, so YouTube cares less about file size and more about saving that computational complexity up front. But if a video hits 1000 views in a few minutes, that means it’s on the cusp of going viral, and it might be worth re-encoding with the high cost encodings that save space/bandwidth.
If a service doesn’t scale, it won’t be necessary to have that kind of complexity in the service. But those videos will load a bit slower, use a little more battery and bandwidth to watch, be more prone to skipping/distortion, etc.
Video is hard. User submitted video is harder. Especially at scale.
The Fediverse would be a very different place if it was hosting anything remotely close to YouTube tier traffic. FFS, how much of the Fediverse is even outside English speaking countries? None of our systems are getting bombarded with hundreds of gigabytes of Good Morning messages like Whatsapp is dealing with in India, for instance.
So much of the content on these big services is both trivial in terms of audience and enormous in terms of relative file size. My sister-in-law sent me a thirty minute compilation video from their latest summer vacation, which she hosted to YouTube. That video is going to get maybe five views, unless one of us goes back to watch it a second time. How much is it costing YouTube to host and stream? Obviously far more than what they make from any of us.
Now scale that up to millions.
The Fediverse isn’t trying to do anything remotely like that.
This specific example is one thing that self hosting is arguably better for. I’ve made a few shitposting memes and the like that are five seconds long and uploaded unlisted just to share with friends that get immediately flagged and banned for DMCA that I’ve taken to just self hosting them. They’re getting like three views anyway because the world was never meant to see them.
People sharing videos with friends and family seems like a problem that’s already solved, if you really don’t want to use YouTube. Big channels that get millions of views are the real issue.
Self-hosting, certainly (to a degree anyway). But the Fediverse isn’t self-hosted. I’m not keeping a catalog of comments on my computer that you lose access to when I close my laptop.
Self-hosting also tends to require dedicated hardware. Less of a big deal as the real cost of your own personal little server setup has plummeted. But still something that’s predicated on always-on internet connectivity in a way that’s not always practical.
It’s not just file size either. Video basically has several different things going on, where improving on one aspect tends to require compromise on the others:
Over time, the standards improve, but generally benefit from specialized hardware for decoding (thus making decoding complexity a bit more complicated when serving a lot of people with different hardware).
Netflix, for example, serves a small number of very large files to many, many people on demand. That means they benefit from high encoding complexity, even if it shaves off a tiny bit of file size, because spending a few extra hours on encoding a movie that’s 10mb smaller is worth it if 10 million people watch that movie, as that’s 100 terabytes of traffic saved.
But YouTube/Facebook and the others with a lot of user-submitted video, they’re ingesting hundreds of hours of content every minute, chopping it up into like 5 different resolutions/quality levels.
Then YouTube has a shitload of processes for determining which video gets which treatment. A random upload of a kid’s birthday party might get a few hundred views at most, so YouTube cares less about file size and more about saving that computational complexity up front. But if a video hits 1000 views in a few minutes, that means it’s on the cusp of going viral, and it might be worth re-encoding with the high cost encodings that save space/bandwidth.
If a service doesn’t scale, it won’t be necessary to have that kind of complexity in the service. But those videos will load a bit slower, use a little more battery and bandwidth to watch, be more prone to skipping/distortion, etc.
Video is hard. User submitted video is harder. Especially at scale.
Great analysis