Who Is Reinforcing Who?: Recommendation Algorithms

Recommendation Algorithms determine what content we see next on many social media platforms. These algorithms have risen to take the place of the previous method, where posts were sorted by newest to oldest. Algorithmic sorting prevades nearly all our social media apps. Instagram, TikTok, Youtube, X, and even LinkedIn, have all transistioned over to this new “For You” type experience. Not by mistake, social media companies adopt recommendation algorithms because of how effective they are at keeping users on app. Freshness sorting relies on randomness to hope that the sequence of content is high quality. You could happen to logon where the last ten posts were all horribly boring, in which case you’d quit the app almost immediately. However, the app instead can harvest the best content for you and present it for you at the top of your feed. This means you have a curated stream of exactly the type of content you like.

How Do Recommendation Algorithms Work?

Recommending content is really hard. Think about what it takes to recommend a movie to a friend. It requires me to know a lot about you and a lot about movies. When I say ‘a lot’ I mean ‘a lot’. Are you an analytical person? Do you like drama? Are you into older movies? Foreign Films? Do you like long movies? Then I need to know a lot of movies because the more movies I know the more likely I can find one that fits your criteria. This, to me, essentializes two of the core problems of recommending: 1. Relevant Data Collection and 2. A Robust Content Collection. If I know everything there is to know about your movie watching habits, but I don’t know many movies I won’t be a good recommender. And vice versa, if I am a movie buff and know nothing about you, I won’t give you anything good.

Social media recommendation algorithms therefore must have some way to deal with these challenges.

Two Requirements for Recommendations:
Knowing You:
An understanding of the user’s preferences
Having Content:
A library of content

Social media algorithms solve the first requirement by collecting informatiom about you. This includes basic information like age, gender and location. It also includes your interaction history on the application: what content you like, view, search for and click on. This interaction history ends up telling a significantly richer story about your preferences than just your basic peronal information. For instance, your viewing recipe videos tell platforms you likely enjoy food and cook. Certain specific political content you interact may tell platforms your political views. Beyond even interest segments, certain unique pockets of overlap may begin to form a deeply rich profile of your preferences. The more you use the platform the more data it collects on you, the better it knows you.

Cold Start Problem:
When joining a platform for the first time recommendations are made with little data about who you are. This challenge is known as the Cold Start problem. Applications often have an onboarding experience where you can select certain interests like Sports, Cooking, or Programming. These help the platform know where to start.

It is clear that Social Media platforms can collect an inordinate amount of data on us. An amount of data which moves beyond any intuition we could have or use in normal human recommending. It is almost like recommending movies feels like mapping onto a three dimensional field, where what social media platforms are doing is mapping onto a million dimensional field.

Solving Content Collection:

The knowledge of user preferences is only useful if you have the right content to fit those preferences. For instance, if I predict you will really enjoy vegan cat cooking videos, but I don’t have any, my prediction is useless. Early social media only would show you content which was created by your friends. This meant that the platforms were severely supply constrained for content. This is when the move toward discover pages, for you pages and the “social” in social media became lost. Suddenly algorithms didn’t just have to pull from the 20 posts your friends uploaded that day, it could pull from the billions of clips. So the chances are the content that you wanted I could find. There would be a need to label this data and learn which features each video had. Social media platforms monitor which users interact with which videos. Then they begin to build an understanding of which kinds of people like the content. If no one watches it, then it will not be recommended. If only men who like car videos watch it then it will be shown to only those users. This process uses the preferences of people like you to predict what you might like.

Reinforcement Learning:

The algorithm which does the recommending has one goal, to increase your app screen time. It does not care about how much you enjoy the content, or how long you watch a video, it only cares about how long you are on the application. It uses this quanitative metric to assess it’s quality. Many different iterations of algorithms are tested and the best performing algorithm ends up on the platform. These reinforcement learning algorithms are essentially massive mathematical equations which intake all the data about you, the content collection, other user’s behaviors then they compute content which you’ll likely view. These reinforcement learning algorithms are “unsupervised” in that no one person knows exactly how they work. They are essentially black boxes which have free range to use any data and draw any connections necessary to render good recommendations. They have no understanding of concepts or anything a human would think about when recommending. Adding rules like that would only limit the algorithm’s ability to be creative and find rules human’s couldn’t have anticipated.

The Dual Effect of Recommendation Algorithms:

Recommendation algorithms essentially collect data about you, build a labeled data collect and serve you the content you are likely to watch. Most tech savvy people use these platforms are aware of this dynamic. The more they use the platform the better the recommendations will be. The more it will begin to discover the type of content I like to watch. You often hear people who talk about they discovered their passion for pimple popping videos or hydraulic press videos. It is talked about as if the algorithms are just getting better at anticipating us. As if we are unmoving in our preferences and the model is getting reinforced to learn us. Instead, I think we need to begin to see this relationship as much more interactive. I believe as much as the recommendations are reinforced by us, we are simultaneously being conditioned by them to like the type of content they have. We are being programmed into being the type of people who like watching content for long periods of time, scrolling and swiping. A recommendation algorithm which is a black box can learn the rules to curate to exactly what you’d want to keep you on. Or it can learn what content it needs to show you, in which order, to begin to slowly shape you into being a person who likes the content it has to share. The content which is much more sharable and bingable. If you come to YouTube with an interest in only long for conversations it won’t be long until the algorithm begins serving you shorter and snapper content. We often think that these platforms are just finding the content we were always looking for, but instead I think what it is doing is far more dangerous than just showing us content we can’t look away from.

It is fundamentally changing each one of us to become the perfect viewer. To no longer want to spend time creating, but to spend more and more of our time consuming. Your hobbies only get in the way, how about you throw those away and just watch videos as your new hobby, I’m sure you’ll enjoy that just as much. And suppose the algorithm realizes that anxious, depressed, and addicted people are way easier to show tons of content to. Then it will begin to show you the content which does all those things. When we talk about black boxes and reinforcement learning for recommendations we are saying, get the watch time up by all means necessary. Worse of all we have no idea even what is happening. Surely there is some work done to isolate algorithm features, but as neural networks get deeper and larger, it becomes increasingly impossible to understand. All we know is that our population has become increasingly and alarmingly anxious, isolated, depressed, addicted, mentally sick. Attention disorders seem dime a dozen and focus has become increasingly rare. When we use AI algorithms like this we have to admit we are engaging with an alien intelligence. One in which has no morality, no defined methods, no common sense, except to follow its goal to maximize watch time, by any means necessary. What else do you need to know? Delete these apps, you seriously have no idea what you are up against every time you open them.

How Do Recommendation Algorithms Work?#

Two Requirements for Recommendations:#

How Social Media Knows You:#

Solving Content Collection:#

Reinforcement Learning:#

The Dual Effect of Recommendation Algorithms:#

How Do Recommendation Algorithms Work?

Two Requirements for Recommendations:

How Social Media Knows You:

Solving Content Collection:

Reinforcement Learning:

The Dual Effect of Recommendation Algorithms: