Revision as of 13:12, 17 April 2008 editAKAF (talk | contribs)Extended confirmed users872 edits →BetacommandBot Task 9: edit comment← Previous edit | Revision as of 13:17, 17 April 2008 edit undoBetacommand (talk | contribs)86,927 edits →DiscussionNext edit → | ||
Line 49: | Line 49: | ||
There is reasonable consensus that another account should be created, for a multitude of reasons. My approval is conditional on a new account for that part of the bot. — ''']''' '']'' 12:48, 17 April 2008 (UTC) | There is reasonable consensus that another account should be created, for a multitude of reasons. My approval is conditional on a new account for that part of the bot. — ''']''' '']'' 12:48, 17 April 2008 (UTC) | ||
:And since you've already been running the bot from your user account (I count about 700(?) edits in your user contributions, but since you're also running the defaultsort bot and the crosswiki link removal bot, it's a bit hard to be sure), I don't see the difference between migrating the code to betacommandbot or to betacommandbot_t9. ] (]) 13:09, 17 April 2008 (UTC) | :And since you've already been running the bot from your user account (I count about 700(?) edits in your user contributions, but since you're also running the defaultsort bot and the crosswiki link removal bot, it's a bit hard to be sure), I don't see the difference between migrating the code to betacommandbot or to betacommandbot_t9. ] (]) 13:09, 17 April 2008 (UTC) | ||
::Please stop making unfounded attacks against me. you have no clue what your talking about. instead of making assumptions why not ask me, I do use some python tools but who says they are bot? there is no proof so shut up. ] 13:17, 17 April 2008 (UTC) |
Revision as of 13:17, 17 April 2008
BetacommandBot Task 9
Function: Replacing all images on en.wiki with commons versions that have the same SHA1 hashes, (gathered from toolserver queries) and then double checked with MD5 hash checks when running. once all en.wiki usages are converted to the commons name it tags it as a commons dupe.
Discussion
Will this end up tagging a PNG image if the commons version is an SVG? And will the SHA1 and MD5 hashes ensure pixel by pixel similarity under all circumstances? Also if the description is different, what will happen? MBisanz 03:25, 17 April 2008 (UTC)
- images of different file types dont have the same hashes, only exact copies have the same hashes. β 03:27, 17 April 2008 (UTC)
- I have an article for you to test this on, if you want. In general, how do you know a local copy isn't needed for some reason, for instance as an anti-vandalism measure (eg. DYK)? In some cases the local copy may not have the same name. Gimmetrow 03:37, 17 April 2008 (UTC)
- if there are templates used for DYK I can add them to the bots ignore list, but in general {{NoCommons}} works. As for different names the bot replaces them with the new name. β 03:39, 17 April 2008 (UTC)
- Will it then correct any articles that may have include the renamed picture? Q 03:41, 17 April 2008 (UTC)
- Define renamed image. what Gimmetrow was saying was that images on commons may not have the same name as the version on en. what I said was that I would update en with the commons name if they were different. β 03:43, 17 April 2008 (UTC)
- Well say a picture is uploaded to commons as ILoveBeans.jpg, and then some user uploads it here as I_Love_Beans.jpg. From what I can tell from your bot will say "Hey, these two pictures are the same, lets delete this local one and use the commons pic". So what happens to articles that were including I_Love_Beans.jpg? Q 03:46, 17 April 2008 (UTC)
- I replace I_Love_Beans.jpg with the commons name (ILoveBeans.jpg). (I thought I said that :/) β 03:47, 17 April 2008 (UTC)
- Oh, ok, ignore me, I was just equating replace with something else :) Q 03:53, 17 April 2008 (UTC)
- I replace I_Love_Beans.jpg with the commons name (ILoveBeans.jpg). (I thought I said that :/) β 03:47, 17 April 2008 (UTC)
- Well say a picture is uploaded to commons as ILoveBeans.jpg, and then some user uploads it here as I_Love_Beans.jpg. From what I can tell from your bot will say "Hey, these two pictures are the same, lets delete this local one and use the commons pic". So what happens to articles that were including I_Love_Beans.jpg? Q 03:46, 17 April 2008 (UTC)
- Define renamed image. what Gimmetrow was saying was that images on commons may not have the same name as the version on en. what I said was that I would update en with the commons name if they were different. β 03:43, 17 April 2008 (UTC)
- If an image from commons is used for DYK, a local copy is uploaded to avoid vandalism. It's supposed to have {{c-uploaded}} and usually has the same name, but not always. Such images shouldn't be replaced by the commons name. Gimmetrow 03:50, 17 April 2008 (UTC)
- Images with {{c-uploaded}} are now skipped. β 03:52, 17 April 2008 (UTC)
- Presumably such images would also not be tagged as dupes because they are linked from a protected page and the bot therefore couldn't replace all links with the commons link. Would it get halfway through replacing the links before it discovered the protection? Bovlb (talk) 07:30, 17 April 2008 (UTC)
- Images with {{c-uploaded}} are now skipped. β 03:52, 17 April 2008 (UTC)
- Will it then correct any articles that may have include the renamed picture? Q 03:41, 17 April 2008 (UTC)
- if there are templates used for DYK I can add them to the bots ignore list, but in general {{NoCommons}} works. As for different names the bot replaces them with the new name. β 03:39, 17 April 2008 (UTC)
- I have an article for you to test this on, if you want. In general, how do you know a local copy isn't needed for some reason, for instance as an anti-vandalism measure (eg. DYK)? In some cases the local copy may not have the same name. Gimmetrow 03:37, 17 April 2008 (UTC)
Looks pretty solid to me. You might want to look around a bit for other templates like {{NoCommons}}. How common is it's use, in the described situation? SQL 03:53, 17 April 2008 (UTC)
Another question, will it tag the old enwiki images for CSD? SQL 03:54, 17 April 2008 (UTC)Reading the function summary twice should be required. SQL 03:55, 17 April 2008 (UTC)
It seems to be that the request function seems to be missing the second half of the process. As I read the request, it relates to replacing images that are found in the Image name space and renaming them, if necessary, to match the name used in Commons. No mention seems to be made of the articles in the Article name space that make use of the images. Can the function request be expanded to include replacing (updating) all image links where the image name has been changed to match that used in Commons? Dbiel 04:11, 17 April 2008 (UTC)
- it will replace all usages of en.wiki's copy with the commons name regardless of the namespace. (a complete replacement) β 04:14, 17 April 2008 (UTC)
- Thanks for the reply; I thought it would, but it just was not clearly stated in the request or in the discussion that followed. So thanks for clearing that up. It sounds like a good use for the bot. Dbiel 04:59, 17 April 2008 (UTC)
- Ok, my questions are answered. As long as commons will be able to get deleted image page stuff on request, then I have no issue. MBisanz 04:31, 17 April 2008 (UTC)
Could the bot also do duplicates on only en.wiki? Example: Image:Tunday_Akintan.jpg and Image:Tunday.jpg. My count has us at about 4,000 duplicates on en.wiki alone. Perhaps simply give favor to the one with more image links? --MZMcBride (talk) 04:39, 17 April 2008 (UTC)
- that would be a separate request that is down the road. β 04:43, 17 April 2008 (UTC)
I fundamentally disagree with adding yet another task which will produce a large number of edits to this already function-heavy bot. I would request that this new task be run under a new bot account. AKAF (talk) 06:44, 17 April 2008 (UTC)
Sounds like a good task, but I echo AKAF in that there shouldn't be a big deal with moving it to another account. — Werdna talk 06:58, 17 April 2008 (UTC)
This sounds like a good and helpful task. Please create a new bot account for it instead of adding yet another task to BetacommandBot. rspeer / ɹəədsɹ 07:22, 17 April 2008 (UTC)
Do you have an estimate on how many images are affected, and how many articles? Bovlb (talk) 07:30, 17 April 2008 (UTC)
Suppose an image on en.wiki is being validly used under fair use. Someone (wrongly) copies it to Commons. The bot then deletes the original image. Then, on Commons, the copied image is deleted because fair use images are not permitted. The image is then lost from its original fair use article. What is the best approach? Should the bot not move images for which fair use is being claimed? Thincat (talk) 09:32, 17 April 2008 (UTC)
- Reading all this again (and I am not familiar with these sort of operations) will the bot merely edit articles to point to identical commons images, or will it additionally flag en.wiki images for deletion, or delete identical en.wiki images? Thincat (talk) 10:58, 17 April 2008 (UTC)
As stated in the second sentence of the summary, it will tag them (i.e. flag them for deletion). — Werdna talk 10:59, 17 April 2008 (UTC)
- Thank you.
So, regarding my "fair use" issue, I'll try and find how this is dealt with for images tagged "as a commons dupe". Any pointers? Of course, the bot wouldn't create a new problem but it might amplify any existing problem. Thincat (talk) 11:11, 17 April 2008 (UTC)Referring to Category:Images on Wikimedia Commons and Category:Images with the same name on Wikimedia Commons as of 17 April 2008 (are these the best references?), there seem to be no warnings about not deleting images with an apparently valid fair use claim (and possibly deleting the commons duplicate). The instructions should be improved before the bot is run or the bot should not handle images with a fair use claim. Thincat (talk) 11:30, 17 April 2008 (UTC)
- Ill add in to skip pages with {{non-free or {{non free which should filter out all fair use media. but as for creating a new account I dont really see a need. it just makes my life harder. β 12:44, 17 April 2008 (UTC)
- Thank you, I think this is the best way to deal with fair use claims. There is clearly a problem with duplicated images where one has a fair use claim on en-wiki. Perhaps a bot listing of such images might help towards manual investigation? Thincat (talk) 12:54, 17 April 2008 (UTC)
There is reasonable consensus that another account should be created, for a multitude of reasons. My approval is conditional on a new account for that part of the bot. — Werdna talk 12:48, 17 April 2008 (UTC)
- And since you've already been running the bot from your user account (I count about 700(?) edits in your user contributions, but since you're also running the defaultsort bot and the crosswiki link removal bot, it's a bit hard to be sure), I don't see the difference between migrating the code to betacommandbot or to betacommandbot_t9. AKAF (talk) 13:09, 17 April 2008 (UTC)
- Please stop making unfounded attacks against me. you have no clue what your talking about. instead of making assumptions why not ask me, I do use some python tools but who says they are bot? there is no proof so shut up. β 13:17, 17 April 2008 (UTC)