Commons:Bots/Requests
If you want to run a bot on Commons, you must get permission first. To do so, file a request following the instructions below.
Please read Commons:Bots before making a request for bot permission.
I | Create a user account (while logged in to your normal account) and user page for the bot
On the bot's userpage, add {{Bot}}, which automatically adds the page to Category:Commons bots. Then add the following information to the bot's userpage (all this is mandatory):
|
---|---|
II | Create your bot request:
Add your bot request to the list here:
|
III | Test run
You can be demanded to make a short test run with your bot account (30–50 edits/uploads) to allow other users to review your bot's tasks. Unauthorized test run is not allowed. |
IV | Waiting for approval.
You now need to wait for community approval. A bureaucrat will close the request and will also grant a bot flag, where necessary. Closed requests are moved to Commons:Bots/Archive. |
|
Requests made on this page are automatically transcluded in Commons:Requests and votes for wider comment.
Requests for permission to run a bot[edit]
Before making a bot request, please read the new version of the Commons:Bots page. Read Commons:Bots#Information on bots and make sure you have added the required details to the bot's page. A good example can be found here.
When complete, pages listed here should be archived to Commons:Bots/Archive.
Any user may comment on the merits of the request to run a bot. Please give reasons, as that makes it easier for the closing bureaucrat. Read Commons:Bots before commenting.
OsmappBot (talk · contribs)[edit]
Operator: Zbytovsky (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information)
Bot's tasks for which permission is being sought: upload images, update their description and structured data
Automatic or manually assisted: automatically triggered, when upload happens on OsmAPP.org
Edit type (e.g. Continuous, daily, one time run): when user uploads an image on OsmAPP.org
Maximum edit rate (e.g. edits per minute): 20 ?
Bot flag requested: (Y/N): N
Programming language(s): TypeScript
Note: Bot is currently in development, asking to be able to test it. Zbytovsky (talk) 19:20, 19 November 2023 (UTC)
- Discussion
- Do I get it right? Someone adds an image to osmapp and your bot will transfer it to Commons. If so, how is ensured that the license of that image is a) valid and b) meets Commons' requirements? --Achim55 (talk) 19:50, 19 November 2023 (UTC)
- @Achim55 Yes, i think you got it right. As OpenStreetMap is also a open data project, we aim to have open licenses. Please see the design of upload dialog. I tried to write it the best I can, but I welcome any suggestions. It will add images with direct link to OSM feature, which means also proper map coordinates and category (eg. castle, guidepost, school, bridge etc.) Zbytovsky (talk) 07:35, 20 November 2023 (UTC)
- And what about rejecting photos that violates Commons:Freedom of panorama? Wiki Loves Monument did not fix this obvious problem for many years. --EugeneZelenko (talk) 15:51, 21 November 2023 (UTC)
- @EugeneZelenko Well, I didn't think of that, thanks for bringing it to my attention :-) As we are a map application, it is pretty easy to inform users based on country of the object. I created a mockup here - it would show up for the "NO" countries. Do you think it is sufficient for the beginning? I don't expect many users soon, but if it turns out to be an issue, it is quite easy to be more restrictive, or eg. check if there is a building in 1km, etc. Zbytovsky (talk) 20:22, 21 November 2023 (UTC)
- It'll be OK for beginning, but will be good idea to extend database (Wikidata is perfect place to share with WLM if organizers will finally comprehend the need to do so) to include information about sculptress/architects, so it'll be possible to allow what is in public domain. There are also countries with partial freedom of panorama, where photos of buildings are allowed, but not of works of art. --EugeneZelenko (talk) 16:00, 22 November 2023 (UTC)
- @EugeneZelenko Well, I didn't think of that, thanks for bringing it to my attention :-) As we are a map application, it is pretty easy to inform users based on country of the object. I created a mockup here - it would show up for the "NO" countries. Do you think it is sufficient for the beginning? I don't expect many users soon, but if it turns out to be an issue, it is quite easy to be more restrictive, or eg. check if there is a building in 1km, etc. Zbytovsky (talk) 20:22, 21 November 2023 (UTC)
- And what about rejecting photos that violates Commons:Freedom of panorama? Wiki Loves Monument did not fix this obvious problem for many years. --EugeneZelenko (talk) 15:51, 21 November 2023 (UTC)
- @Achim55 Yes, i think you got it right. As OpenStreetMap is also a open data project, we aim to have open licenses. Please see the design of upload dialog. I tried to write it the best I can, but I welcome any suggestions. It will add images with direct link to OSM feature, which means also proper map coordinates and category (eg. castle, guidepost, school, bridge etc.) Zbytovsky (talk) 07:35, 20 November 2023 (UTC)
FlickypediaBackfillrBot (talk · contribs)[edit]
Operator: Alexwlchan (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information) , working for the Flickr Foundation
Bot's tasks for which permission is being sought:
- Improving structured data for Flickr photos which have been uploaded to Wikimedia Commons, e.g. adding creator, license metadata.
- Adding the new Flickr photo ID (P12120) property to all files, to make it easier for other tools to work with Flickr photos
Automatic or manually assisted: unsupervised
Edit type (e.g. Continuous, daily, one time run): manually triggered
Maximum edit rate (e.g. edits per minute): tbc, probably 5–10 edits per second
Bot flag requested: (Y/N): Y
Programming language(s): Python
- Discussion
- Please don't make manual edits with the bot account. Please make few test edits. --Krd 14:31, 1 November 2023 (UTC)
- Interesting proposal. I made one edit to Commons:Flickypedia/Data Modeling, otherwise this looks good. Curious how you will handle conflicting existing SDC claims? --Schlurcher (talk) 17:53, 1 November 2023 (UTC)
- Good question! My general approach with these things is to be extremely conservative – imo the V1 bot should be purely additive, and any conflicts should be flagged for manual inspection.
- Then a couple of things might happen:
- The existing SDC looks wrong, so I make a manual edit from my account to fix it. e.g. I’ve already been looking at the use of source of file (P7482) for Flickr photos in the SDC snapshots, and I found ~200 cases where the URL points to the Flickr URL’s profile (
/photos/{username}
) rather than the photo itself (/photos/{username}/{photo_id}
). Those got dropped on a queue and I’ve been gradually tidying them up by hand – opening the files in question and making a manual edit from my account to point to the more specific URL. - The existing SDC looks right, so I work out why the bot is disagreeing. Is it a bug in my code, have I interpreted the data mapping wrong, is the data mapping at odds with the community approach to SDC, is the bot missing some bit of info on the Flickr photo. But the bot won't do anything on its own.
- The existing SDC looks wrong, so I make a manual edit from my account to fix it. e.g. I’ve already been looking at the use of source of file (P7482) for Flickr photos in the SDC snapshots, and I found ~200 cases where the URL points to the Flickr URL’s profile (
- There might also be cases where the existing SDC is wrong in large numbers and we'd want to write an automated fix, but that's somewhat risky and I’d want to be extremely careful before doing that. Two possible examples spring to mind:
- License versions. Flickr photos use CC 2.0 licenses, so that's what the bot will write into the SDC. But what if it finds a Wiki Commons file which links to the 4.0 version of the CC license? That sounds like an easy candidate for a fix buuuut I think there are Flickr users who leave descriptions on their photos saying "I license this as CC 4.0". A human copying their photo across would notice that; the bot might not. So in this case the bot would likely leave it as-is to avoid deleting info.
- Date granularity. Flickr has different levels of granularity for "date taken". Most photos are DDMMYY, but there are some which are MMYY or YY or "Circa YY". If there are lots of cases where there's an imprecise data but the SDC claims it's a full DDMMYY, we might consider automating that. (It's pretty obvious when this has happened – Flickr always returns a full timestamp from its API, but it sets all the unknown values to 0/1. So a YYYY becomes
taken="1950-01-01 00:00:00" takengranularity="6"
.) The bot could be written to fix these. But I don't know if that's a widespread issue in practice.
- If/when the bot does start editing existing SDC claims, I'll make sure we document those with examples – and if there are cases that seem contentious, I'll bring them back for community discussion before actually implementing them. Alexwlchan (talk) 08:13, 2 November 2023 (UTC)
- Interesting proposal. I made one edit to Commons:Flickypedia/Data Modeling, otherwise this looks good. Curious how you will handle conflicting existing SDC claims? --Schlurcher (talk) 17:53, 1 November 2023 (UTC)
- 👍 I’ll probably get to making some test edits early next week, and I’ll link them here for inspection when they’re done. Alexwlchan (talk) 07:46, 2 November 2023 (UTC)
- I know it’s been a couple of weeks and nothing has happened on this.
- I am planning to get back to this bot eventually, but right now I’m prioritising getting the “uploader” part of Flickypedia working. Once that’s done, I’ll come back to the Backfillr bot. Alexwlchan (talk) 09:47, 23 November 2023 (UTC)
- I left some comments about the data model at Commons_talk:Flickypedia/Data_Modeling#Some_feedback_based_on_User:GeographBot. Where can we find the source code? The bot I mentioned is at https://github.com/multichill/toollabs/blob/master/bot/commons/geograph_uploader.py . Multichill (talk) 20:52, 16 November 2023 (UTC)
- Thanks for your feedback on the model; I’ll address that there.
- The source code isn’t public yet, but by the time we run the bot properly (and probably before I have time for test edits) it’ll be available here: https://github.com/Flickr-Foundation/flickypedia Alexwlchan (talk) 09:49, 23 November 2023 (UTC)
MappeComuniBot (talk · contribs)[edit]
Operator: Sette-quattro (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information)
Tasks: Upload up-to-date maps of Italian municipalities. There are about 8000 municipalities, boundaries are updated yearly by the national statistical agency. Here is an example of image that will be uploaded, the description will be automatically updated for each municipality: [1]
Bot's tasks for which permission is being sought: upload several images automatically
Automatic or manually assisted: automatic supervised (through OpenRefine)
Edit type (e.g. Continuous, daily, one time run): One time run each year
Maximum edit rate (e.g. edits per minute): I don't know the edit rate allowed using the new OpenRefine function for uploading images. As fast as it can, i suppose.
Bot flag requested: (Y/N): Y
Programming language(s): used through OpenRefine Batch Uploading https://www.youtube.com/watch?v=sc6aNNmsNCI
Sette-quattro (talk) 16:03, 20 October 2023 (UTC)
- Discussion
- Please make test run. --EugeneZelenko (talk) 15:34, 22 October 2023 (UTC)
- Done a test run Sette-quattro (talk) 21:10, 1 November 2023 (UTC)
- Looks OK for me, but please fix space in Category: Maps of municipalities of the province of Monza and Brianza. Will be good idea to create templates for legend and data source. --EugeneZelenko (talk) 15:01, 3 November 2023 (UTC)
- Ok, I will create the two templates. Thank you and best regards. Sette-quattro (talk) 09:16, 8 November 2023 (UTC)
- Looks OK for me, but please fix space in Category: Maps of municipalities of the province of Monza and Brianza. Will be good idea to create templates for legend and data source. --EugeneZelenko (talk) 15:01, 3 November 2023 (UTC)
GlaMainBot (talk · contribs)[edit]
Operator: Beao (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information)
Bot's tasks for which permission is being sought: I've automated lossless crops for Category:Images from the German Federal Archive with borders and need permission to start uploading the results.
Automatic or manually assisted: Manually assisted to start.
Edit type (e.g. Continuous, daily, one time run): One time run for uploads, otherwise daily for my listing of User:GlaMainBot/Most_used_images_for_cleanup
Maximum edit rate (e.g. edits per minute): At most ten uploads per minute.
Bot flag requested: (Y/N): Y
Programming language(s): TypeScript (using mwn)
Beao 07:49, 23 September 2023 (UTC)
- Discussion
- Just to clarify: Is this to upload as a new version, or to overwrite? If the latter, is there a consensus to do so? I see that those borders include photo credits to the individual photographers, and these are from a respected archive, so I'd just want to make sure that there is agreement that this is desired; I've seen similar situations go either way. Clearly more useful in Wikipedia articles without the borders, but it's not clear to me that we don't want also to host a version with the credit line on the image. - Jmabel ! talk 23:58, 1 October 2023 (UTC)
- My thought is to overwrite. I've not seen any written consensus on the matter, but in practice that's what has been done for years in this category. I think that implies a silent consensus, considering these captions have been digitally added by the archive and provide no additional information not already in the description. Beao (talk) 08:36, 2 October 2023 (UTC)
- Please make some example edits. Krd 17:06, 6 October 2023 (UTC)
- All right, here are three examples:
- File:Bundesarchiv Bild 137-068842, Sonderzug der Einwandererzentralstelle.jpg
- File:Bundesarchiv Bild 137-068843, Sonderzug der Einwandererzentralstelle.jpg
- File:Bundesarchiv Bild 137-068852, Sonderzug der Einwandererzentralstelle.jpg Beao (talk) 10:55, 7 October 2023 (UTC)
- Looks good to me. Krd 13:49, 11 October 2023 (UTC)
- Please make some example edits. Krd 17:06, 6 October 2023 (UTC)
- Any more information or discussion needed? Beao (talk) 12:22, 14 October 2023 (UTC)
- [2] Why is this updated so often? Krd 03:04, 19 October 2023 (UTC)
- The "Images with watermarks" category is very big, so the retrieval of file usage statistics is batched to a fixed number of images every hour to avoid performance spikes, and I update the gallery after every batch. Is updating gallery pages too often problematic? I could do it less often (I'm thinking if images are not removed from the category), and also avoid doing it when nothing changes. Beao (talk) 15:53, 20 October 2023 (UTC)
- Please at least don't update when nothing significant changes. Krd 07:42, 21 October 2023 (UTC)
- The "Images with watermarks" category is very big, so the retrieval of file usage statistics is batched to a fixed number of images every hour to avoid performance spikes, and I update the gallery after every batch. Is updating gallery pages too often problematic? I could do it less often (I'm thinking if images are not removed from the category), and also avoid doing it when nothing changes. Beao (talk) 15:53, 20 October 2023 (UTC)
- [2] Why is this updated so often? Krd 03:04, 19 October 2023 (UTC)