Wikipedia:Thảo luận/Wiki labels & Revision Scoring as a Service for Vietnamese Wikipedia

Hello Vietnamese Wikipedia,

I apologize for my complete lack of Vietnamese skills. I would most welcome if my post is translated to Vietnamese.

So computers are very good at crunching numbers. Your average calculator can out smart you in arithmetic. However computers are terrible at pretty much in everything else. Programming computers to under take any task no matter how simple beyond computing tends to be very difficult. This is where Artificial Intelligence comes in. With Artificial Intelligence we teach computers how to solve problems without explicit programming for the solution. This is what we are doing.

We are working on a project called m:Research:Revision scoring as a service which aims to provide quality control Artificial Intelligence infrastructure for Mediawiki and Wikimedia projects. We already have our system implemented and running on Azerbaijani, English, French, Vietnamese, Persian, Portuguese, Spanish, Turkish and Vietnamese editions on Wikipedia. We are hoping to adapt our tool to serve Vietnamese language as well as a number of other languages.

We are currently mainly focusing on vandalism detection where we provide an API (m:ORES) that provides scores. We have made an effort to keep our system robust.

The examples I'll provide are based on a machine learning algorithm that was trained to use 20,000 reverted edits. This is kind of modelling is problematic for two reasons. First is, there are non-vandalism related reasons for edits to be reverted such as mistakes from new users, this would develop such an unproductive bias. Second problem would be it lacks the ability to distinguish good faith users from malicious ones. To demonstrate our system I will give three examples from English wikipedia. I have picked these three semi-random.

Score of 90% diff en:Moncef Mezghanni
- As visible in the diff, it is clearly something that shouldn't be welcome on English wikipedia. Algorithms confidence also matches my human assessment.
Score of 75% diff en:Monin
- When I look at the diff it isn't immediately clear to me if this should be reverted. Detailed look reveals that prior version had more neutral information, but new version at a glance isn't exactly clear cut vandalism, albeit spammy. Algorithms confidence drops just as my human assessment.
Score of 19% diff en:Curiosity killed the cat, but satisfaction brought it back
- As visible in the diff this edit clearly improves the article. The algorithms confidence plummets as well. Algorithm is more confident that this edit should NOT be reveted.

We are also working towards a system for article quality where we use existing assessment by en:Wikipedia:Version 1.0 Editorial Teamto train our system. We only have this system on English wikipedia at the moment but we would be more than happy to expand to other language editions. I am uncertain if Vietnamese Wikipedia has a similar quality assessment scale. I have picked 5 random articles to demonstrate this.

Predicted: Start class (not even assessed) Perm link en:Maidenhead Advertiser
Predicted: Stub class (actually marked Stub class) Perm link en:Joel Turrill
Predicted: C class (actually marked stub class) Perm link en:Kajaanin Haka
Predicted: C class (actually marked C class) Perm link en:Castell Arnallt
Predicted: Featured class (actually marked Featured article) Perm link en:Hurricane Diane

Typical problem is that humans typically do not re-asses articles over time or articles are never assessed in the first place. Our system circumvents this problem by automating this.

We have already gathered some language features such as bad words and informal words.

We need a localization of en:Wikipedia:Labels serving as our local landing page. After this is done, we would like to start an edit quality campaign where we request the local community to hand code/label ~2000 revisions labeling them productive/damaging and good faith/bad faith. This would be similar to the campaign on English Wikipedia en:Wikipedia:Labels/Edit quality.

After this we will be able to generate scores for revisions that is usable by gadgets such as ScoredRevisions as well as (potentially) tools like huggle. If community desires it, it can even be used to create a local vandalism reversion bot.

So in a nutshell our algorithm relies on community input to support the community. Feel free to ask any questions. Either here, on meta or on IRC on the freenode server and #wikimedia-ai channel where we hang out. You can also reach us at https://github.com/wiki-ai

-- とある白い猫 ^chi? 11:55, ngày 17 tháng 8 năm 2015 (UTC)[trả lời]

Wikilabels localization, final few things sửa

Hello all, we have concluded the older campaigns for English, Portuguese, Persian and Turkish which have concluded recently which was partially why we had this gap. So we are very close in launching the edit quality campaign for this wiki as well. All we need is the translation of the relevant entry on m:Wiki labels/Interface translation and m:Wiki labels/Interface translation/Edit quality. We are very excited to expand our work to include this wiki and can start the campaign as soon as we have the two pages translated. Thanks! -- とある白い猫 ^chi? 17:18, ngày 11 tháng 10 năm 2015 (UTC)[trả lời]

One last thing, one thing we do is we auto label revisions we think are likely good these include revisions that are not reverted in a while and revisions that were made by users with higher access (such as sysop). What user groups aside from sysop are "trusted" on this wiki? User groups I see are: autopatrolled, bot, bureaucrat, checkuser, eliminator, flood, flow-bot, import, ipblock-exempt, patroller, rollbacker, sysop. -- とある白い猫 ^chi? 22:41, ngày 22 tháng 10 năm 2015 (UTC)[trả lời]

Hi とある白い猫, the trusted user groups on this wiki including checkuser, bureaucrat, sysop and eliminator, which approved by community via consensus (see Wikipedia:Biểu quyết phong cấp). Your sincerely. --minhhuy ^{(thảo luận)} 15:11, ngày 22 tháng 11 năm 2015 (UTC)[trả lời]