Wikipedia talk:Bot policy/Archive 30
This is an archive of past discussions on Wikipedia:Bot policy. Do not edit the contents of this page. If you wish to start a new discussion or revive an old one, please do so on the current talk page. |
Archive 25 | ← | Archive 28 | Archive 29 | Archive 30 |
Mass creation section
@BilledMammal: Would you mind explaining this revert further? I don't understand what you wrote in your edit summary. – Joe (talk) 09:05, 7 July 2024 (UTC)
- My understanding is that WP:MASSCREATE applies to all mass creations, even when automation is not used - for example, WP:MEATBOT mass creation.
- I also think it is a little redundant. I can't think of any circumstances where true mass creation can occur without some level of automation - for example, the use of boilerplate text. H BilledMammal (talk) 09:13, 7 July 2024 (UTC)
- Thanks. I think it pretty clearly does not apply to all mass creations though, for the following reasons:
- It's part of the bot policy, not the editing policy
- It says so:
any large-scale automated or semi-automated content page creation task [...]
, not any large-scale content page creation task [...] - The (only) requirement created by this section is to seek permission at WP:BRFA. If I went to BRFA and said I wanted to write a series of 50 articles on a subject without automation, I think they'd tell me to move along.
- A 2022 proposal to amend MASSCREATE to
clarify that mass-creation through repetitive editing by hand is not different for policy purposes to automated/semi-automated mass-creation
andmake getting consensus for creation prior to mass creation per WP:MASSCREATE mandatory
failed to gain consensus.
- It does apply to bot-like edits by humans, yes, but that is still within the limits of the bot policy. It is possible to create large numbers of articles without automation; I see editors doing it every day at NPP. For example, writing a stub on a species or location from scratch could take as little as ten minutes. So if you sit down and crank them out all day, you could break 50 and still have time for a long lunch. – Joe (talk) 09:47, 7 July 2024 (UTC)
- It also says
all mass-created articles
, and the final paragraph, as the exception that proves the rule, demonstrates that WP:MEATBOT applies to the creation of content pages and that such creations are required to go through BRFA. However, I also agree that BRFA isn't the right place for various reasons, not least that per WP:LOCALCONSENSUS they shouldn't be approving mass creation. I suggest we reword the policy to direct editors first to the village pump, and clarify that once consensus has been obtained there only bot operators need to go through BRFA. A 2022 proposal to amend MASSCREATE to
clarify that mass-creation through repetitive editing by hand is not different for policy purposes to automated/semi-automated mass-creation
andmake getting consensus for creation prior to mass creation per WP:MASSCREATE mandatory
failed to gain consensus.- It also failed to get a consensus against the proposal. Given that, I don't think it's appropriate to amend WP:MASSCREATE to exclude that interpretation when there wasn't a consensus that it is the wrong interpretation.
It is possible to create large numbers of articles without automation; I see editors doing it every day at NPP. For example, writing a stub on a species or location from scratch could take as little as ten minutes.
- I might be wrong, but I believe those tend to use boilerplate text - which I consider semi-automation as the boilerplate is a primitive tool. BilledMammal (talk) 10:09, 7 July 2024 (UTC)
- I think you are wrong, yes, but it's kind of beside the point. If you consider all forms of mass creation to be semi-automated, then what is the problem with amending the title of the section to read "Mass automated and semi-automated creation"? What's left out? – Joe (talk) 11:37, 7 July 2024 (UTC)
- It's redundant, and will make it harder to enforce the policy as editors have previously claimed that their mass creations are manual, even when there is clear evidence to the contrary such as them admitting to using scripts. BilledMammal (talk) 11:41, 7 July 2024 (UTC)
- I agree that it's redundant. I'm suggesting we add it anyway for clarity, because many people come here via a section link and do not realise that this section is part of the bot policy – that's all. I'm not sure I follow how that would make it harder to enforce the policy against people who are lying? – Joe (talk) 11:53, 7 July 2024 (UTC)
- It's redundant, and will make it harder to enforce the policy as editors have previously claimed that their mass creations are manual, even when there is clear evidence to the contrary such as them admitting to using scripts. BilledMammal (talk) 11:41, 7 July 2024 (UTC)
- I think you are wrong, yes, but it's kind of beside the point. If you consider all forms of mass creation to be semi-automated, then what is the problem with amending the title of the section to read "Mass automated and semi-automated creation"? What's left out? – Joe (talk) 11:37, 7 July 2024 (UTC)
It's part of the bot policy, not the editing policy
This always seems to get in the way when people start arguing over WP:MASSCREATE. It's fairly clear the community wants to consider mass creation in general, not just automated mass creation, but for historical raisins WP:MASSCREATE is in WP:Bot policy and so it has to be "about" bots in some manner. I suggested changing that in 2023, but few were interested in discussing it. Anomie⚔ 10:50, 7 July 2024 (UTC)It's fairly clear the community wants to consider mass creation in general, not just automated mass creation
– is it? How? As you said yourself, you didn't get support for that interpretation when you proposed it just last year. And as I said, in the 2022 RfC, a proposal to change MASSCREATE to say this explicitly failed, with the closing statement specifically noting opposition on the basis thathuman editing falls outside of the scope of bot policy
.- I think if you guys want MASSCREATE to apply to all articles you should obtain a consensus and then move it to the editing policy. In the mean time, what is wrong with clarifying in the title that a section of the bot policy applies to automated edits, using words copied verbatim from that section? – Joe (talk) 11:29, 7 July 2024 (UTC)
I think if you guys want MASSCREATE to apply to all articles you should obtain a consensus
- Doesn't that apply equally in the opposite direction? If you don't want it to apply to manual mass creations, you should obtain a consensus? BilledMammal (talk) 11:41, 7 July 2024 (UTC)
- I'm not proposing to change anything. You just said yourself that my edit was "redundant", i.e. it merely restates what is already there (verbatim). – Joe (talk) 11:43, 7 July 2024 (UTC)
- I think you have misinterpreted what I am saying. I see it as redundant because I see it as a tautology. The current text acknowledges that the policy applies to manual mass creation - regardless of my personal views on whether such a thing is possible - through the final paragraph which, as the exception that proves the rule, makes it clear that the WP:MEATBOT mass creation of content pages is required to go through BRFA. BilledMammal (talk) 11:44, 7 July 2024 (UTC)
- The final paragraph says that automated, semi-automated or bot-like creation of non-content pages do not need to go through BRFA. I don't see how that's relevant? – Joe (talk) 11:51, 7 July 2024 (UTC)
- I think you have misinterpreted what I am saying. I see it as redundant because I see it as a tautology. The current text acknowledges that the policy applies to manual mass creation - regardless of my personal views on whether such a thing is possible - through the final paragraph which, as the exception that proves the rule, makes it clear that the WP:MEATBOT mass creation of content pages is required to go through BRFA. BilledMammal (talk) 11:44, 7 July 2024 (UTC)
- WP:MASSCREATION applies to all mass creation of articles, both from bots and from WP:MEATBOTS. If you edit in a bot-like manner, it does not matter if you are actually a bot or just a random person making articles quickly from boilerplate text. Headbomb {t · c · p · b} 11:45, 7 July 2024 (UTC)
- @Headbomb: Yep, there's absolutely no disagreement on that point. My edit added the words "automated and semi-automated" to the section heading, and as WP:MEATBOT defines bot-like editing is equivalent to automated/semi-automated editing, the meaning remains unaltered. – Joe (talk) 11:47, 7 July 2024 (UTC)
- In that case, can I propose a compromise? Title the section
Mass automated, semi-automated, or meatbot page creation
. "meatbot" could perhaps be replaced with "bot-like". BilledMammal (talk) 11:52, 7 July 2024 (UTC)- Sure, I'm good with that. I'd avoid 'meatbot' – it's not the most dignified piece of wikislang. Although actually, since it's getting a bit of a mouthful, do we really need the word "mass"? Nobody's using bots, silicon or flesh, to create one or two articles, right? – Joe (talk) 11:55, 7 July 2024 (UTC)
- (edit conflict) Ugh. Please let's not make a long and confusing heading. Anomie⚔ 11:57, 7 July 2024 (UTC)
- Agreed. If the meaning remains unaltered, then there is no reason to make the change. Primefac (talk) 12:39, 7 July 2024 (UTC)
- The reason I've suggested above is that because many people come here via a section link, they don't realise that this section is part of the bot policy and so end up reading it out of context. Most other sections already have the word "bot" in their title or shortcut, which ameliorates that. – Joe (talk) 12:46, 7 July 2024 (UTC)
- Agreed. If the meaning remains unaltered, then there is no reason to make the change. Primefac (talk) 12:39, 7 July 2024 (UTC)
- In that case, can I propose a compromise? Title the section
- @Headbomb: Yep, there's absolutely no disagreement on that point. My edit added the words "automated and semi-automated" to the section heading, and as WP:MEATBOT defines bot-like editing is equivalent to automated/semi-automated editing, the meaning remains unaltered. – Joe (talk) 11:47, 7 July 2024 (UTC)
- I'm not proposing to change anything. You just said yourself that my edit was "redundant", i.e. it merely restates what is already there (verbatim). – Joe (talk) 11:43, 7 July 2024 (UTC)
- (edit conflict × 2)
As you said yourself, you didn't get support for that interpretation when you proposed it just last year
No, I said I didn't get much discussion at all. More specifically, xaosflux and BilledMammal supported, Rhododendrites refused to consider it outside the context of a full rewrite, and jc37 took it on a bit of a tangent. No one else replied.is it? How?
Have you read through the actual discussions, with an eye for how WP:MASSCREATE being in WP:Bot policy restricts how people can consider ways of handling mass creation? Look at the very close you linked as "failed to gain consensus", three of the seven oppose bullets hinge on "WP:BOTPOL can't regulate non-bot behavior".I think if you guys want MASSCREATE to apply to all articles
Personally I don't care. I'm just sick of WP:BOTPOL and WP:MEATBOT getting bent out of shape when people like you and BilledMammal argue over non-bot creations. Anomie⚔ 11:57, 7 July 2024 (UTC)- The main issue here is that people are trying to solve a problem that isn't a problem. If you've got a idiot on a stub-creating campaign using a boilerplate "X is a fictional small village in the Chronicles of Narnia.[1], you can block them under WP:MEATBOT, WP:MASSCREATE, WP:ENGAGE, WP:DISRUPT, WP:CONSENSUS... and per WP:MEATBOT the method they use to create these undesired stubs is irrelevant. If it's disruptive, it must stop. This should be straightforward to understand. Headbomb {t · c · p · b} 12:26, 7 July 2024 (UTC)
- The problem is enforcement. While it's clear that editors who wish to create significant numbers of nearly-identical articles are required to get approval from the community, it is difficult to determine an action when they fail to do so, and the articles they created are usually accepted as fait accompli - Lugnuts is the clearest example of this. We need a streamlined process to stop editors who are engaged in mass creation without approval, and to remove the articles created in violation of this policy. BilledMammal (talk) 12:35, 7 July 2024 (UTC)
- I think you'd probably want to start by make it more obvious where and how they're supposed to get that approval. Above you infer that people falsely claim to be creating articles by hand to evade this policy. Maybe that happens sometimes. But I think there's a larger group of editors to who are say, creating stubs on similar topics by copying and pasting the last stub and changing the details, who genuinely don't think that the "bot policy" has anything relevant to them. Even if they did find their way to making a BRFA, as directed by WP:MASSCREATE, they'd certainly conclude they were in the wrong place when asked to "create an account for your bot", specify "the computer language that this bot will be written in. E.g. Python, Java, C, VB, AutoWikiBrowser", provide "a link to the source code", and so on. The cruellest thing we do on this project is punish people for doing things that we never told them were forbidden. You have to set out the process before you can expect people follow it. – Joe (talk) 13:19, 7 July 2024 (UTC)
- "The problem is enforcement." If the problem is enforcement, fix enforcement. As for Lugnuts, he was banned in 2021 from created stubs under 500 words. And with Lugnuts, the problem never was policy, but WP:IDHT. And that's why he's now indef banned. Headbomb {t · c · p · b} 14:03, 7 July 2024 (UTC)
- I'm not sure that there's anything to enforce. When was the last time you saw someone creating more than 25 articles per day? It's unusual for anyone to even create 25 articles per week, and I don't think anyone has created 20–50 articles per day for any sustained, uninterrupted period of time. Even when Dr. Blofeld was creating 15,000+ articles per year, it was often 150 this day and 200 the next, but then nothing (or very little) for the next several days. WhatamIdoing (talk) 03:11, 8 July 2024 (UTC)
- The problem is enforcement. While it's clear that editors who wish to create significant numbers of nearly-identical articles are required to get approval from the community, it is difficult to determine an action when they fail to do so, and the articles they created are usually accepted as fait accompli - Lugnuts is the clearest example of this. We need a streamlined process to stop editors who are engaged in mass creation without approval, and to remove the articles created in violation of this policy. BilledMammal (talk) 12:35, 7 July 2024 (UTC)
- The main issue here is that people are trying to solve a problem that isn't a problem. If you've got a idiot on a stub-creating campaign using a boilerplate "X is a fictional small village in the Chronicles of Narnia.[1], you can block them under WP:MEATBOT, WP:MASSCREATE, WP:ENGAGE, WP:DISRUPT, WP:CONSENSUS... and per WP:MEATBOT the method they use to create these undesired stubs is irrelevant. If it's disruptive, it must stop. This should be straightforward to understand. Headbomb {t · c · p · b} 12:26, 7 July 2024 (UTC)
- It also says
- Thanks. I think it pretty clearly does not apply to all mass creations though, for the following reasons:
- @Anomie: You've reverted the compromise suggested by BilledMammal. Could you please explain why? This is not an RfC – nobody is being asked to !vote support/oppose. I can see that you said "Please let's not make a long and confusing heading", which I tried to do, and Primefac asked (after I had made the edit) what the reason for it would be, which I answered. – Joe (talk) 13:34, 7 July 2024 (UTC)
- Exactly because Primefac and I opposed the change, and I've made the counterproposal in the section below to address the concerns you have. Anomie⚔ 13:52, 7 July 2024 (UTC)
- As you know, consensus comes from reasoned discussion, and just saying you oppose something doesn't get us any closer to that. I think your idea to split the section is a good one but we needn't wait to see whether it consensus to fix this title. Do I understand correctly that your objection to "Automated, semi-automated or bot-like page creation" is that it's too long? In which case, how about "Bot or bot-like page creation", which aligns with other sections on this page and is no longer than most of them. – Joe (talk) 14:08, 7 July 2024 (UTC)
- The 'compromise' is bad and inaccurate. The issue is mass-creation, not "Automated, semi-automated or bot-like page creation" because that literally means any page creation whatsoever. Headbomb {t · c · p · b} 14:09, 7 July 2024 (UTC)
- BilledMammal's original suggestion was indeed "Mass automated, semi-automated or bot-like page creation", which I'm also fine with. I dropped the "mass" to try and address Anomie's complaint that it was too wordy, but he reverted anyway. – Joe (talk) 14:13, 7 July 2024 (UTC)
- FWIW I'm not so sure about that addition, having read the context. Too easy to make it seem like "automated and semi-automated" includes "bot-like editing", and I don't see why the addition has any benefit. — Rhododendrites talk \\ 14:22, 7 July 2024 (UTC)
- BilledMammal's original suggestion was indeed "Mass automated, semi-automated or bot-like page creation", which I'm also fine with. I dropped the "mass" to try and address Anomie's complaint that it was too wordy, but he reverted anyway. – Joe (talk) 14:13, 7 July 2024 (UTC)
- Exactly because Primefac and I opposed the change, and I've made the counterproposal in the section below to address the concerns you have. Anomie⚔ 13:52, 7 July 2024 (UTC)
- The policy is currently that automated/semi-automated creation has to get authorization, plus a line that MEATBOT applies. MEATBOT, in turn, is almost entirely about making mistakes while editing quickly. The only clue in MEATBOT that it could extend beyond holding people accountable for their mass-mistakes is
processes which operate at higher speeds, with a higher volume of edits, or with less human involvement are more likely to be treated as bots
. That seems reasonable to me. It doesn't prohibit any manual creation (and, BM, can we just stop with this argument that you alone make that "semi-automated editing tools" extends to include things like Microsoft Word or a boilerplate stored in notepad?), but if you go really fast and hard despite urges to slow down -- and especially if you make mistakes -- you may be asked to go through the bot authorization process. The problem is we seem to have a handful of "this 100% applies to everyone making more than a couple articles" folks on the front line, so it would help to have some additional clarity as to when going fast turns into bot-like editing of the sort that needs preauthorization. Unfortunately, we didn't do so well at figuring that out last time. :/ In light of all this, I don't quite understand the purpose of the heading change or its reversion. — Rhododendrites talk \\ 14:06, 7 July 2024 (UTC)- See the Narnia example. And if it's not clear, MEATBOT is clear. We don't care how you do it, if it's disruptive, stop. Headbomb {t · c · p · b} 14:11, 7 July 2024 (UTC)
if it's disruptive
- The problem is, some users consider any fast article creation disruptive. — Rhododendrites talk \\ 14:15, 7 July 2024 (UTC)- Exactly, that's why I changed the heading – sometimes you get people badgering other users for creating more than 25 articles at once manually, citing WP:MASSCREATE, and either missing or deliberately overlooking the fact that it's in the bot policy and therefore can only be read within the context of bot or bot-like editing. As for why it was reverted, I'm stumped too. First it was because it changed the meaning, then it was because it didn't change the meaning, then it was because it was too long, now it's because we should split it instead. I think. It's hard to keep up. – Joe (talk) 14:20, 7 July 2024 (UTC)
- @Joe Roe: Regarding
sometimes you get people badgering other users for creating more than 25 articles at once manually, citing WP:MASSCREATE
, can you give some examples? BilledMammal (talk) 02:56, 8 July 2024 (UTC)- Or even three articles: User talk:Markussep#WP:MASSCREATION. WhatamIdoing (talk) 03:08, 8 July 2024 (UTC)
- I think you miscounted, and looking at a few of those they weren’t manual - they were boilerplate. BilledMammal (talk) 03:20, 8 July 2024 (UTC)
- I specifically refer to the statement that "creating three articles between 18.44 and 18.47 is a much a higher frequency than 25-50 per day".
- Manual edits can be boilerplate, just like automated edits don't have to be boilerplate. WhatamIdoing (talk) 04:33, 8 July 2024 (UTC)
- I went through all of that editor's non-redirect article creations back to 2020. There was only one day in which that tool counts 25 articles (21 November 2022). They never exceeded that level, and rarely came close to it. However, Special:Contributions for that date finds only 22, and six of those are redirects. WhatamIdoing (talk) 04:53, 8 July 2024 (UTC)
- (edit conflict) I think their point with that statement was that if you are creating three articles in three minutes, you're obviously not doing it manually.
- However, we're getting off topic here. Examples of editors being badgered for genuine manual creations would be helpful to see, if you have them. BilledMammal (talk) 04:57, 8 July 2024 (UTC)
- Is your definition of "genuine manual creations" approximately "using completely different wording and sources in each article"? WhatamIdoing (talk) 05:05, 8 July 2024 (UTC)
- No, but I don't think us discussing this is going to be productive, so I will step back now. If editors like Joe have examples or want to discuss further, I will happily do so. BilledMammal (talk) 05:25, 8 July 2024 (UTC)
- Is your definition of "genuine manual creations" approximately "using completely different wording and sources in each article"? WhatamIdoing (talk) 05:05, 8 July 2024 (UTC)
- I think you miscounted, and looking at a few of those they weren’t manual - they were boilerplate. BilledMammal (talk) 03:20, 8 July 2024 (UTC)
- Or even three articles: User talk:Markussep#WP:MASSCREATION. WhatamIdoing (talk) 03:08, 8 July 2024 (UTC)
- @Joe Roe: Regarding
- Exactly, that's why I changed the heading – sometimes you get people badgering other users for creating more than 25 articles at once manually, citing WP:MASSCREATE, and either missing or deliberately overlooking the fact that it's in the bot policy and therefore can only be read within the context of bot or bot-like editing. As for why it was reverted, I'm stumped too. First it was because it changed the meaning, then it was because it didn't change the meaning, then it was because it was too long, now it's because we should split it instead. I think. It's hard to keep up. – Joe (talk) 14:20, 7 July 2024 (UTC)
- See the Narnia example. And if it's not clear, MEATBOT is clear. We don't care how you do it, if it's disruptive, stop. Headbomb {t · c · p · b} 14:11, 7 July 2024 (UTC)
References
- ^ CS LEWIS "The Chronicles of Narnia"
Kicking it out of botpol?
- The following discussion is closed. Please do not modify it. Subsequent comments should be made in a new section. A summary of the conclusions reached follows.
- RFC posted below. Anomie⚔ 23:16, 9 July 2024 (UTC)
I've drafted an RFC at User:Anomie/Sandbox2. Anyone have comments before I post it somewhere? Opinions as to whether we should do it here or WP:VPP? Anomie⚔ 13:26, 7 July 2024 (UTC)
- I think that's a good idea. Even if the content doesn't change, this discussion illustrates of the difficulty of relying on a local consensus of technically-focused editors to manage a policy on article creation. I don't think you need to do an RfC, though. A consensus of editors on this page that the section is no longer in scope would be sufficient, since we're only moving accepted policy around, not significantly changing it. – Joe (talk) 13:42, 7 July 2024 (UTC)
- I don't see what's to be gained by separating it from botpol. It's clearly bot-related. Headbomb {t · c · p · b} 13:59, 7 July 2024 (UTC)
- Except that people above are insisting that it also applies to creations that don't involve bots or bot-like edits, and therefore accurately titling it as part of the bot policy is unacceptable. – Joe (talk) 14:15, 7 July 2024 (UTC)
- It's not always bot related. Various discussions have wanted to consider more manual mass creations as well, but have had to struggle against it being part of WP:BOTPOL. So some have used that as an objection, and others try to stretch WP:MEATBOT to somehow make it apply. Even in the original proposal that created this section there was concern over restricting it to bots. Anomie⚔ 16:38, 7 July 2024 (UTC)
- I don't see what's to be gained by separating it from botpol. It's clearly bot-related. Headbomb {t · c · p · b} 13:59, 7 July 2024 (UTC)
- Thanks for getting it started. Not opposed to this in principle, but in this draft while the moved policy seems like it retains the same meaning, the summary text left behind says something different. Mainly, you've created a new page that applies to "automated and semiautomated content page creation" and summarized it with a line saying
[all] Mass page creation requires approval by the community
. Probably an assumption is built in because of the scope of the BOTPOL, but it would be good to spell out. — Rhododendrites talk \\ 14:20, 7 July 2024 (UTC)- The controlling policy on that is the new mass-creation page rather than botpol anyway, the point is to note the mass creation policy exists rather than to restate it in every particular. I'm wary of putting too much in here that may easily become obsolete once people have the opportunity to discuss just how much they want it to cover non-bot mass creations, but if others want to nitpick it to that extent too then 🤷. Anomie⚔ 16:38, 7 July 2024 (UTC)
- Regardless, right now the summary left behind in the draft changes policy. If your intent isn't to run a such an RfC, that line would need to change. — Rhododendrites talk \\ 12:37, 8 July 2024 (UTC)
- I still think you're over-interpreting it, but I adjusted the wording slightly to try to make you happy. Anomie⚔ 23:43, 8 July 2024 (UTC)
- Regardless, right now the summary left behind in the draft changes policy. If your intent isn't to run a such an RfC, that line would need to change. — Rhododendrites talk \\ 12:37, 8 July 2024 (UTC)
- The controlling policy on that is the new mass-creation page rather than botpol anyway, the point is to note the mass creation policy exists rather than to restate it in every particular. I'm wary of putting too much in here that may easily become obsolete once people have the opportunity to discuss just how much they want it to cover non-bot mass creations, but if others want to nitpick it to that extent too then 🤷. Anomie⚔ 16:38, 7 July 2024 (UTC)
- It’s a good idea. Let’s get this done, and then we can discuss other changes, such as the one proposed below. BilledMammal (talk) 02:14, 8 July 2024 (UTC)
- @Anomie, I think that the RFC question may be so long that many editors won't read it. WhatamIdoing (talk) 01:11, 8 July 2024 (UTC)
- I know you think no one will read more than the headline of anything, although why you think even "Should WP:MASSCREATE be severed from WP:Bot policy?" is too long I have no idea. That's the question, which I bolded to make it easy to pick out. The part before is background and everything after is defining what exactly that means because experience tells me that otherwise people will start arguing over how to rewrite the whole thing and we'll wind up with no consensus for anything. Anomie⚔ 11:10, 8 July 2024 (UTC)
- Your sandbox contains 730 words, which is more than most editors will read.
- Even if you place your signature (everything before the timestamp is "the RFC question") after the bold-face question, that's 138 words. I'd rate that as being possible, but still being longer than the average RFC question. WhatamIdoing (talk) 17:09, 8 July 2024 (UTC)
- I know you think no one will read more than the headline of anything, although why you think even "Should WP:MASSCREATE be severed from WP:Bot policy?" is too long I have no idea. That's the question, which I bolded to make it easy to pick out. The part before is background and everything after is defining what exactly that means because experience tells me that otherwise people will start arguing over how to rewrite the whole thing and we'll wind up with no consensus for anything. Anomie⚔ 11:10, 8 July 2024 (UTC)
Defining mass creation as >50 articles per day
- Separately, I've been wondering whether the way to address BilledMammal's (specifically his) ongoing concerns about MASSCREATE is to explain it in specific, unambiguous detail. When you picked that quotation from @Xeno out of the 2009 RFC, there were other options:
- "anything more than 25 or 50"
- "rapid creation"
- "in a rapid manner"
- "25-50 articles per day"
- "25–50
+
articles per day" - "clicking "save" every 5-10 seconds"
- "more than 50 articles in a short period"
- "more than 50 articles in a short amount of time".
- Thinking back at BilledMammal's multiple attempts to get rid of articles or prevent future creations, then general themes seem (to me) to be:
- He interprets "25 to 50" as having no time limit whatsoever. If you create one article a week, a year from now, you may be guilty of "mass creating" articles.
- He is primarily concerned about very short, very similar fill-in-the-blank articles, especially if it cites the same source as all the others, and most especially if that source is a database. For example, "_____ is a British cricket player" or "_____ is a fungus in the genus ______".
- I think if we replaced the quotation with a more detailed summary, that would resolve quite a lot of this.
− | While no specific definition of "large-scale" was decided, | + | While no specific definition of "large-scale" was decided, editors who want to create more than 50 articles in any 24-hour period should obtain prior approval.
|
- I do not expect this to make the anti-stub editors happy, but it would provide clarity about when creating a lot of articles is actually a WP:MASSCREATION matter, and when it's just creating a lot of articles.
- BTW, to the best of my knowledge, there have never been any actual mass creation attempts that were not automated or semi-automated. The idea that someone could manually write 50+ articles per day is not realistic. WhatamIdoing (talk) 01:11, 8 July 2024 (UTC)
- More than 50 per day would be more than 18,250 per year. For context, it was very rare for Lugnuts to exceed fifty articles per day.
- This change would result in the policy endorsing mass creation, not requiring it to get community approval. You’ve also misunderstood my interpretation of this; only similar articles created using mass creation techniques count towards the limit.
- I’ve also split this into a seperate section, to avoid derailing Anomie’s proposal. BilledMammal (talk) 01:25, 8 July 2024 (UTC)
- Lugnut's problem was IDHT, not that MASSCREATION was unclear. And 50 a day is too high a limit. 50 in a short term is better. 25 in a short term is also OK by me. We can leave that part undefined per "you know it when you see it" because as soon as you set a precise number, someone will go "but I made sure to edit at "X-1/time period", so MASSCREATION doesn't apply!" Headbomb {t · c · p · b} 01:51, 8 July 2024 (UTC)
- The Lugnuts situation came from two issues; WP:IDHT, and because the lack of clarity in WP:MASSCREATE made it hard for the community to enforce and thus address the IDHT issue.
- Largely agree on leaving it undefined; if an editor is creating 30 boilerplate articles a week for many months, then that’s obviously mass creation that requires community review and approval. BilledMammal (talk) 01:57, 8 July 2024 (UTC)
- MASSCREATE doesn't have anything to with "boilerplate articles". That's your idea. It's never been part of the policy. WhatamIdoing (talk) 02:01, 8 July 2024 (UTC)
- Falls under MEATBOT and/or semi-automated. BilledMammal (talk) 02:08, 8 July 2024 (UTC)
- Automated and semi-automated article creation does not have to use a boilerplate. (See also Wikipedia:Large language models.)
- MEATBOT applies to "high-speed or large-scale edits that a) are contrary to consensus or b) cause errors an attentive human would not make". It has nothing to do with the edits being "boilerplate" or repetitive in any way. WhatamIdoing (talk) 02:57, 8 July 2024 (UTC)
- If you're creating boilerplate articles, you're behaving like a bot. MASSCREATE doesn't prohibit boilerplate articles, but it does says that if you want to do that on a large scale, i.e. more than 25-50, you need consensus to do so. Headbomb {t · c · p · b} 08:02, 8 July 2024 (UTC)
- No, you're behaving like a human who used a boilerplate. Ditto if I take the time to write 20 totally different articles but publish them all at the same time. Even if, against the odds, you found consensus for calling the use of a boilerplate to manually create articles without errors a WP:MEATBOT issue, it still doesn't fall under that 25-50 rule, which is specifically about automated or semiautomated editing. — Rhododendrites talk \\ 12:42, 8 July 2024 (UTC)
- "behaving like a human who used a boilerplate", that's exactly what a WP:MEATBOT is. Again, it does not matter if you use an actual bot, semi-automation, or do things fully manually, if what you are doing is disruptive, you must stop. Headbomb {t · c · p · b} 14:38, 8 July 2024 (UTC)
- Headbomb, I quoted the relevant sentence from MEATBOT for you. MEATBOT does not actually say anything about boilerplates or repetitive tasks. It might be typical to interpret it that way, but it does not actually say that.
- It does say that editing against consensus (e.g., being disruptive) is unacceptable regardless of the method used to edit against consensus. WhatamIdoing (talk) 17:18, 8 July 2024 (UTC)
- Meatbot: A human (made of meat, unlike a robot) editor that makes a large amount of repetitive edits from their own account, often with semi-automated tools, much like a bot would. For the purpose of dispute resolution, it is irrelevant if edits are made by actual bots or by meatbots. See also WP:MEATBOT.
- Boilerplate editing is bot-like editing. Which, again, for the purpose of dispute resolution, is irrelevant, because if what you're doing is disruptive, you must stop and discuss and get consensus for what you're donig. I don't know why that's so hard to understand. Headbomb {t · c · p · b} 17:24, 8 July 2024 (UTC)
- Wikipedia:Bots/Dictionary#meatbot is not the policy.
- Nobody claims MEATBOT if you find and fix the same typo once a day for a year, because that's not bot-like editing.
- Nobody claims MEATBOT if you use the same format ("a boilerplate") to write a single article each day for a year, because that's not bot-like editing. WhatamIdoing (talk) 17:27, 8 July 2024 (UTC)
- "behaving like a human who used a boilerplate", that's exactly what a WP:MEATBOT is. Again, it does not matter if you use an actual bot, semi-automation, or do things fully manually, if what you are doing is disruptive, you must stop. Headbomb {t · c · p · b} 14:38, 8 July 2024 (UTC)
- No, you're behaving like a human who used a boilerplate. Ditto if I take the time to write 20 totally different articles but publish them all at the same time. Even if, against the odds, you found consensus for calling the use of a boilerplate to manually create articles without errors a WP:MEATBOT issue, it still doesn't fall under that 25-50 rule, which is specifically about automated or semiautomated editing. — Rhododendrites talk \\ 12:42, 8 July 2024 (UTC)
- WP:MEATBOT was created specifically because we trouble with an editor trying to claim that the bot policy didn't apply to their bot-like editing because they were completely manually filling in a boilerplate with no automation at all. The intent was to cut the knot with a duck test, and "or large-scale" was specifically included to apply to "slow and steady" bot-like editing. See WT:Bot policy/Archive 24#Clarification regarding high-speed human editing for the original discussion. Possibly some of the arguments in here have gotten things reversed or taken it too far (I'm not feeling up to reading through all of it in enough detail to work that out), but
It has nothing to do with the edits being "boilerplate" or repetitive in any way
is wrong. Anomie⚔ 23:39, 8 July 2024 (UTC)
- If you're creating boilerplate articles, you're behaving like a bot. MASSCREATE doesn't prohibit boilerplate articles, but it does says that if you want to do that on a large scale, i.e. more than 25-50, you need consensus to do so. Headbomb {t · c · p · b} 08:02, 8 July 2024 (UTC)
- Falls under MEATBOT and/or semi-automated. BilledMammal (talk) 02:08, 8 July 2024 (UTC)
- MASSCREATE doesn't have anything to with "boilerplate articles". That's your idea. It's never been part of the policy. WhatamIdoing (talk) 02:01, 8 July 2024 (UTC)
- BilledMammal, this change would result in the policy more precisely representing what the 2009 RFC (the one that eventually resulted in its creation) actually said. I grant that this would make it more difficult for editors to make up their own claims about what it says (e.g., that it was intended to prevent editors from creating more than 50 articles ever – a limit you're coming up on, by the way).
- It would be hardly surprising if Lugnuts usually complied with MASSCREATE in at least some minimal fashion, since about 90% of his article creations were after the RFC that led to the MASSCREATE rule. MASSCREATE was not about Lugnuts; it was primarily about an editor who was creating more than a thousand articles per month, and sometimes hundreds per day, with only a few seconds in between each article, and the effect that this volume had on review processes. Also, he's written more FAs than you've written articles of any kind, so please don't assume that he's a bad editor or doesn't know what he's doing.
- If you want to make MASSCREATE stricter, then you could make such a proposal, but a sound basis for that future discussion would first be understanding what the long-standing rule actually says (25–50 per day, not per month/year/lifetime), what it was supposed to do (avoid overwhelming review processes and give admins a chance to stop CSD-worthy problems before there were hundreds or thousands of articles to deal with), and how it has or hasn't worked for us (e.g., it has stopped flooding review queues, but it hasn't stopped the creation of low-quality articles).. WhatamIdoing (talk) 01:54, 8 July 2024 (UTC)
- I also highly question the need to do anything with our mass creation policy if the primary objective is to retroactively prevent an indef banned editor from IDHT behaviour. Headbomb {t · c · p · b} 01:57, 8 July 2024 (UTC)
- I use Lugnuts as a convenient benchmark to determine whether a proposal is non-viable; because the community considers his creations to be mass creations, any proposal that would redefine MASSCREATE in such a way that his creations would not be covered is very likely to be rejected. BilledMammal (talk) 02:06, 8 July 2024 (UTC)
(e.g., that it was intended to prevent editors from creating more than 50 articles ever – a limit you're coming up on, by the way)
- I don’t think anyone - who isn’t making a WP:POINT - interprets it that way, so can we please stop using that interpretation as a reason it’s problematic? It’s a straw man.
- In any case, the 2009 RfC was 15 years ago. It is too late to contest that close; if you think the wording is wrong, then open a new RfC. BilledMammal (talk) 02:00, 8 July 2024 (UTC)
- We don't need an RFC to choose a different quotation from the 2009 RFC, or to re-word it so that it accurately represents the 2009 RFC without using a direct quotation. WhatamIdoing (talk) 02:03, 8 July 2024 (UTC)
- We do, because any substantial WP:BOLD change to MASSCREATE is certain to be reverted - and what you propose is going to be seen by many editors as substantial, even if you disagree. BilledMammal (talk) 02:06, 8 July 2024 (UTC)
- There's a stage in between WP:PGBOLD and an RFC, called "forming consensus on the talk page".
- BTW, if you want to talk about strawman arguments, I suggest looking at the one implying that if the MASSCREATE approval process kicks in at 50 per day, then someone might actually create 50 articles per day, 365 days per year, and that the community would be helpless to stop them (assuming we wanted to, which is not always the case). WhatamIdoing (talk) 02:10, 8 July 2024 (UTC)
- We do, because any substantial WP:BOLD change to MASSCREATE is certain to be reverted - and what you propose is going to be seen by many editors as substantial, even if you disagree. BilledMammal (talk) 02:06, 8 July 2024 (UTC)
- We don't need an RFC to choose a different quotation from the 2009 RFC, or to re-word it so that it accurately represents the 2009 RFC without using a direct quotation. WhatamIdoing (talk) 02:03, 8 July 2024 (UTC)
- I also highly question the need to do anything with our mass creation policy if the primary objective is to retroactively prevent an indef banned editor from IDHT behaviour. Headbomb {t · c · p · b} 01:57, 8 July 2024 (UTC)
- Lugnut's problem was IDHT, not that MASSCREATION was unclear. And 50 a day is too high a limit. 50 in a short term is better. 25 in a short term is also OK by me. We can leave that part undefined per "you know it when you see it" because as soon as you set a precise number, someone will go "but I made sure to edit at "X-1/time period", so MASSCREATION doesn't apply!" Headbomb {t · c · p · b} 01:51, 8 July 2024 (UTC)
- I think you're misinterpreting the 2009 RFC. As I read it, the "anything more than 25 or 50" suggestion was not strictly time limited, but was limited to a "task". Extremely fast creation is one problem, but "slow and steady" can also add up to a problem. Some of the replies focused on speed, while others did not. An advantage of choosing a quote from the proposal rather than some other comment is that it was the proposal that everyone should have read (even if they didn't).Also you probably shouldn't be ignoring the 2022 RFC, where Question 3 focusing on rate limits (with much more nuance than this proposal) was rejected. Anomie⚔ 11:35, 8 July 2024 (UTC)
- @Anomie, that quote wasn't from the proposal. The proposal was very short: "Proposal: Any large-scale semi-/automated article creation task require BRFA".
- You quoted part of the OP's vote in favor of his own proposal, from a sentence that said ""Large-scale" is up for discussion, but I would say anything more than 25 or 50." Every single !vote that mentioned those numbers afterwards specified that this was to be interpreted per day or in a short time period. I think this was specified precisely because "slow and steady" does not cause the problems that they were trying to solve (e.g., too many articles for the review processes to handle in a single day or bot-like editing).
- Nobody minds if someone creates one article a day for a month. That never overwhelms review processes. That is never considered bot-like editing. WhatamIdoing (talk) 17:25, 8 July 2024 (UTC)
You quoted part of the OP's vote in favor of his own proposal
You see it as the vote. I see it as a clarification of the proposal. Not everything has to be in the headline. 🤷Every single !vote that mentioned those numbers afterwards specified that this was to be interpreted per day or in a short time period
Considering there were only three such !votes, two of which were opposes, I don't find that a very convincing argument. Meanwhile, a not insignificant number of the comments talk about mass creation needing review without reference to the rate of that creation.[one article a day for a month] is never considered bot-like editing.
Did I say it was? Anomie⚔ 23:09, 8 July 2024 (UTC)- Considering there were only three such !votes, I count four:
- "25-50 articles per day"
- "25–50
+
articles per day" - "more than 50 articles in a short period"
- "more than 50 articles in a short amount of time".
- plus several more specifically talking about "rapid" editing (e.g., "clicking "save" every 5-10 seconds"). One article per day does not involve clicking "save" every 5 or 10 seconds, even if @Headbomb says above that something as small and slow as fixing one typo each day is exactly what bot-like editing is. That it's happening slowly is irrelevant.
- Note, too, the section heading you linked above, which was "Clarification regarding high-speed human editing", not "Clarification of slow and steady human editing" or even "Clarification of human editing that is repetitive but happening at an average manual speed". The problem with high-speed editing, no matter what method is being used for it, is that someone can make a huge mess before anyone has a chance to notice. Slow and steady, no matter what method is being used for it, does not have that same risk. WhatamIdoing (talk) 00:09, 10 July 2024 (UTC)
- Your #2 is not a !vote, it's a comment in reply to someone else's !vote. You're the one who limited it to !votes mentioning those specific numbers. AnomieBOT has several tasks that only edit once per day, and in the past had one that only edited once per month, if that can illustrate for you that bot editing doesn't have to be fast. As for the section heading, are you trying to prove that you're one of the people who only reads the heading and not the discussion? And I note people can also make a huge mess while editing slowly to fly under the radar. Anomie⚔ 00:22, 10 July 2024 (UTC)
- The purpose of MEATBOT is not to prevent people from editing. It's to prevent people from editing so quickly, in such enormous volume, that the rest of us are at risk of having a huge mess to clean up later. One edit per day does not have that risk. Hundreds of edits per hour does. WhatamIdoing (talk) 00:32, 10 July 2024 (UTC)
- 🤷 Well, you're free to believe whatever you want, no matter how wrong it may be. Anomie⚔ 01:15, 10 July 2024 (UTC)
- The purpose of MEATBOT is not to prevent people from editing. It's to prevent people from editing so quickly, in such enormous volume, that the rest of us are at risk of having a huge mess to clean up later. One edit per day does not have that risk. Hundreds of edits per hour does. WhatamIdoing (talk) 00:32, 10 July 2024 (UTC)
- Your #2 is not a !vote, it's a comment in reply to someone else's !vote. You're the one who limited it to !votes mentioning those specific numbers. AnomieBOT has several tasks that only edit once per day, and in the past had one that only edited once per month, if that can illustrate for you that bot editing doesn't have to be fast. As for the section heading, are you trying to prove that you're one of the people who only reads the heading and not the discussion? And I note people can also make a huge mess while editing slowly to fly under the radar. Anomie⚔ 00:22, 10 July 2024 (UTC)
- Considering there were only three such !votes, I count four:
- IMO it's also factored in that if editors are doing some real work on each article that they create, we don't want to discourage that, and vice versa. No exact minimum, but something more than a stub from a database or database-like source. North8000 (talk) 19:51, 8 July 2024 (UTC)
- This factor is not mentioned in MEATBOT or MASSCREATE, but I assume it would be considered by the community if someone made a proposal under either policy provision. WhatamIdoing (talk) 00:37, 10 July 2024 (UTC)
- I think that two reasons for attention this are:
- Even though it's on the bot page, it's our main or only real rules regarding even non-bot mass creation. (unless I'm a real dummy in that area)
- There have been many discussions at different wp:notability pages where a common sentiment was avoiding mass creation, but it then gets said "but that is covered elsewhere", so it's important that it really is effectively "covered elsewhere". So another reason is that it's really needed to enable evolution of wp:notability guidelines.
- North8000 (talk) 19:51, 8 July 2024 (UTC)
- It doesn't cover mass creation that doesn't involve a bot or bot-like editing, because it's in the bot policy, and because it explicitly says so (
Any large-scale automated or semi-automated content page creation task [...]
). This has been discussed at great length recently; see above. – Joe (talk) 11:08, 9 July 2024 (UTC)
- It doesn't cover mass creation that doesn't involve a bot or bot-like editing, because it's in the bot policy, and because it explicitly says so (
- I'm also don't think that a hard limit is the way to go here. It's practically guaranteed to encourage gaming, i.e. posting pregenerated articles exactly every 28 minutes. It also doesn't address what I think Headbomb is trying to get at, which is that it's the disruptive outcome that is the problem, not exactly how it happened.
- If anything, I'd go in the other direction. Instead of trying to define mass creation, identify the problems it causes, and shift the guideline to address those. So you'd say that we don't want people to create articles so fast that they overwhelm the ability of other editors to patrol them, or create articles without checking that the individual contents and formatting is correct, or create articles from a single source without a strong expectation of notability, that kind of thing.
- That also fully detaches it from the bot policy, which I'm more and more convinced it should be. Using bots without approval is already forbidden, for anything. Why do we need to restate that it is extra forbidden for creating lots of articles? – Joe (talk) 11:27, 9 July 2024 (UTC)
- The problems are:
- The articles might be bad (e.g., non-notable, even hoaxes).
- And while we get bad articles every hour of the day, we don't want hundreds more bad articles all at once.
- Posting one article every 28 minutes would actually be great. We would actually prefer to have someone post one new article every 28.8 minutes round the clock than 49 articles at 12:01 a.m. and then disappear. Why? Because if the first few turn out to be really bad, we can block you before you've posted any more. We'd have to clean up (e.g., delete) the handful you've already posted, but we wouldn't have to delete dozens or hundreds.
- The goal with MEATBOT is to give the community a chance to intervene. Yes, please, be bold and post some articles. But don't dump hundreds on us; trickle them in at a rate that we can actually manage – and by "manage", we mean "determine whether you're making a mess and we need to stop you". Also, if you want to dump hundreds in one go, then please determine whether there's consensus first. In theory, if you're going to dump hundreds of articles in one go, and we both want those topics and approve of your content, we'll approve those. Rambot created a lot of articles with the consent of the community; here's a typical example of the fully automated output from its first day: the lead plus two sections amounts to 374 words, and every single fact taken from a single database. WhatamIdoing (talk) 00:30, 10 July 2024 (UTC)
- The problems are:
- I agree (and I think few would disagree ) that there must be guidance on non-bot mass creation of articles. Whether we acknowledge that this section in the bot policy also applies to non-bot activity, or have a separate guideline or policy for that. North8000 (talk) 13:22, 9 July 2024 (UTC)