On Wednesday I had lunch with Raph Levien, and came away with a picture of how a website that fostered the highest quality discussion might work.
Principles:
- It’s possible that the right thing is a quick fix to Less Wrong as it is; this is about exploring what could be done if we started anew.
- If we decided to start anew, what the software should do is only one part of what would need to be decided; that’s the part I address here.
- As Anna Salamon set out, the goal is to create a commons of knowledge, such that a great many people have read the same stuff. A system that tailored what you saw to your own preferences would have its own strengths but would work entirely against this goal.
- I therefore think the right goal is to build a website whose content reflects the preferences of one person, or a small set of people. In what follows I refer to those people as the “root set”.
- A commons needs a clear line between the content that’s in and the content that’s out. Much of the best discussion is on closed mailing lists; it will be easier to get the participation of time-limited contributors if there’s a clear line of what discussion we want them to have read, and it’s short.
- However this alone excludes a lot of people who might have good stuff to add; it would be good to find a way to get the best of both worlds between a closed list and an open forum.
- I want to structure discussion as a set of concentric circles.
- Discussion in the innermost circle forms part of the commons of knowledge all can be assumed to be familiar with; surrounding it are circles of discussion where the bar is progressively lower. With a slider, readers choose which circle they want to read.
- Content from rings further out may be pulled inwards by the votes of trusted people.
- Content never moves outwards except in the case of spam/abuse.
- Users can create top-level content in further-out rings and allow the votes of other users to move it closer to the centre. Users are encouraged to post whatever they want in the outermost rings, to treat it as one would an open thread or similar; the best content will be voted inwards.
- Trust in users flows through endorsements starting from the root set.
More specifics on what that vision might look like:
- The site gives all content (posts, top-level comments, and responses) a star rating from 0 to 5 where 0 means “spam/abuse/no-one should see”.
- The rating that content can receive is capped by the rating of the parent; the site will never rate a response higher than its parent, or a top-level comment higher than the post it replies to.
- Users control a “slider” a la Slashdot which controls the level of content that they see: set to 4, they see only 4 and 5-star content.
- By default, content from untrusted users gets two stars; this leaves a star for “unusually bad” (eg rude) and one for “actual spam or other abuse”.
- Content ratings above 2 never go down, except to 0; they only go up. Thus, the content in these circles can grow but not shrink, to create a stable commons.
- Since a parent’s rating acts as a cap on the highest rating a child can get, when a parent’s rating goes up, this can cause a child’s rating to go up too.
- Users rate content on this 0-5 scale, including their own content; the site aggregates these votes to generate content ratings.
- Users also rate other users on the same scale, for how much they are trusted to rate content.
- There is a small set of “root” users whose user ratings are wholly trusted. Trust flows from these users using some attack resistant trust metric.
- Trust in a particular user can always go down as well as up.
- Only votes from the most trusted users will suffice to bestow the highest ratings on content.
- The site may show more trusted users with high sliders lower-rated content specifically to ask them to vote on it, for instance if a comment is receiving high ratings from users who are one level below them in the trust ranking. This content will be displayed in a distinctive way to make this purpose clear.
- Votes from untrusted users never directly affect content ratings, only what is shown to more trusted users to ask for a rating. Downvoting sprees from untrusted users will thus be annoying but ineffective.
- The site may also suggest to more trusted users that they uprate or downrate particular users.
- The exact algorithms by which the site rates content, hands trust to users, or asks users for moderation would probably want plenty of tweaking. Machine learning could help here. However, for an MVP something pretty simple would likely get the site off the ground easily.
(I prefer to ignore the specific details of the implementation and only discuss the idea in general. Because debating the details feels like "too soon; I need to be sure about the higher level first".)
There is a profound difference between building "commons of knowledge" and a "discussion website". In some sense, they are the opposite of each other. Discussion websites, by their nature, attract people who have a preference for discussion. Curiosity seeks to annihilate itself, but social interaction seeks to grow and persist. And if people who feel that the debate is no longer productive start leaving, well, those who don't mind will become the new norm.
But how do you build the commons of knowledge without having the discussion first? How would people contribute? How would they point out the mistakes? How would they conclude whether the supposed "mistakes" are actual mistakes or not? How would they coordinate in time, so that the published piece of knowledge is reviewed now, and the author can read the responses and update the text now? (as opposed to e.g. someone randomly pointing out an error ten years later, and the original author no longer being active)
Seems to me that successful knowledge-oriented websites, such as Wikipedia and Stack Exchange, solve this problem by separating the knowledge from the discussion. There are different ways to do it.
On Wikipedia, the article presents the "final answer", and the whole discussion happens on the Talk page. (There is typically one Talk page per article, but if the discussion grows unusually long, the older parts are gradually moved into separate read-only Archive pages.) If someone believes they have found a mistake in the article, they edit the article, and optionally explain the reason on the Talk page. If someone else believes they were wrong, they revert the changes in the article, and optionally explain their reasons on the Talk page. This sometimes leads to an "edit war", when various solutions are implemented, for example the page can be "locked" so the ordinary users can't edit it anymore, and all they can do is argue for their case on the Talk page; only privileged remain able to edit the article.
On Stack Exchange, each user can provide their own answer to the question, and all answers are displayed below the question; the ones with the most votes are displayed first. There is further an option to write a short comment below the question or an answer, and to upvote the comment. (The comment structure is linear, not hierarchical.) To prevent the growth of the discussion, only a limited amount of text is displayed immediately; displaying the remaining text requires further user action. For example, if there are more than N answers, only the highest upvoted N answers are displayed below the question; the remaining ones are moved to page 2 or more. Similarly, if there are more than M comments below a question or an answer, only the highest upvoted M comments are displayed, unless the user explicitly clicks "show more comments". Together this means that regardless of the number of answers and the number of comments, the user clicking on the question still receives a relatively short page, containing the most relevant things. Furthermore, debate that is not strictly on-topic is discouraged, the content of such debate is frequently removed, and the users are advised to debate their opinions on a chat, outside the question-and-answers area.
We can't copy either model directly. Less Wrong is neither an encyclopedia nor a Q&A website. (We discuss things that sometimes don't have official names yet; and we often provide insights and opinions without being asked first.) We are more like a blog, or maybe a news site. But blogs are personal (worked well while Eliezer was writing the original Sequences) or for a predefined group of authors; they don't scale well. A the purpose of news is typically to maximize paperclips... ahem, pageviews... which is best achieved by techniques contrary to the goal of developing and spreading rationality. (I suspect that the next step in the online news business will be websites with fake news generated automatically using machine learning and A/B testing. Using thousands of different domain names, to make it impossible to write a domain-name filter; generating enough content to keep all domains active and seemingly different. Also, automatically generated comments below the articles. And automatically generated replies to humans who get fooled by the system.)
Seems to me these would be reasonable guidelines for a good solution:
Keep the "final product" and the "intermediate products" unambiguously separated, in a way that clearly communicates to the reader's System 1. Preferably, each of them on a different domain name, with different design. Separate "LW, the sacred tome of rationalist knowledge" from "LW, where highly intelligent people procrastinate".
The change from the "intermediate" to the "final" state must always be done by a conscious decision of a group of trusted people. (It is not necessary to make all of them approve each change, more like: any two or three of them may promote the content, and then any one of them can veto the change, and if that happens, then all of them will debate behind the closed doors until they reach a consensus.) No amount of votes is enough to automatically promote an article; and no amount of karma is enough to automatically promote a person to the trusted circle. On the other hand, members of the trusted circle are publicly known. And within the trusted circle, they can review all other other's actions and votes.
Changing the article from the "intermediate" to the "final" state may include asking the original author to rewrite it (or to consent with someone else's rewrite). Thus we would avoid the dilemma of "some part of this article are low-quality and lenghty, but here are a few really valuable points, so maybe the cost-benefit analysis suggests we promote it anyway". Nope, just rewrite it to keep the good parts and remove the bad parts; it will take you an afternoon, and the updated version will be there displayed to everyone for years.
Promoting the article to the "final" state does not mean promoting comments below it to the "final" states. First, no comments should get promoted automatically; and second, most comments should not be promoted at all. So, the "final" website should just display the article without the comments (but maybe with a link to the original discussion). If some comments are considered worthy enough to be canonized along with the article, they could become a part of the article itself, e.g. either by fixing the mistake, or adding a footnote. Paraphrasing Dijkstra, we should not regard LW comments as "lines produced" but as "lines spent".
New visitors to the website should be directed to the "final" version foremost, and to the "intermediate" version only as a secondary option. So unless they have a specific plan, they will read some parts of the "final" version before they start participating in the "intermediate" section. In other words, having people debating on LW without reading the Sequences first is a result of bad web design.
EDIT:
In some sense, having LW separated into forum and wiki is a partial step in this direction, but it differs from this proposal in many important ways:
The highest-value content stays in the forum section, along with the low-value content. (It is merely linked from the wiki.) Thus, at least to my System 1, the good articles and the bad articles are connected more than any of them is with the wiki. Also, all comments, whether good or bad, remain connected with the good articles.
When new people visit the page for the first time, the first thing they see is... a description that seems like a Wikipedia article stub; four recent articles; and four non-recent articles. Not good. The title page should contain at least 95% of nicely designed hand-picked best content, and only a small link towards the discussion somewhere in the corner.
On the other hand, a technically simple implementation of my suggestion would be to create another website -- which could be technically a wiki, but it absolutely shouldn't look like one; all the wiki-related buttons should be hidden from the average observer, and only available for the "inner circle" -- and copy the hand-picked best articles there. For the average visitor, there would be no way to post anything; only to click the links and read the contents. Occassionally, a link would point them towards a relevant discussion on LW, but to their System 1 that would be obviously a link to an external website, with a different design. (The "inner circle" would have a Slack debate somewhere, where they would talk about which articles deserve to be copied. All contributors to LW would have to consent in advance with the possibiity of having their article copied to the other website -- and removed from LW, to avoid having duplicates.)