Extension talk:Replace Text/Archive 2008
This page is an archive. Do not edit the contents of this page. Please direct any additional comments to the current talk page. |
Large Wikis
For wikis with numerous pages, an approach based on jobs might be more appropriate. Just a thought. Jean-Lou Dupont 15:48, 29 April 2008 (UTC)
- Asking administrators to set up a cron job might be overkill - what do you think about the idea of including a command-line script instead, for the case of large wikis? Yaron Koren 19:25, 7 May 2008 (UTC)
- I was thinking about a MediaWiki custom job written as an extension: each page request (from any user) would trigger X number of replace tasks. This is the general way jobs under MediaWiki work. Jean-Lou Dupont 19:38, 7 May 2008 (UTC)
- Oh, I wasn't even aware of that functionality. I just looked it up; that definitely clarifies some things about how templates and categories work... do you know if there's documentation somewhere on how to add jobs? Yaron Koren 19:44, 7 May 2008 (UTC)
- I usually just look at the code in those cases ;-) Look at JobQueue.php. Jean-Lou Dupont 19:49, 7 May 2008 (UTC)
- Oh, I wasn't even aware of that functionality. I just looked it up; that definitely clarifies some things about how templates and categories work... do you know if there's documentation somewhere on how to add jobs? Yaron Koren 19:44, 7 May 2008 (UTC)
- Okay, thanks; that looks ideal. It could be useful not just for this extension, but for functionality like SMW's refresh-semantic-data action, which currently runs only as a command-line script. Is there any code of yours I can look at that uses the job queue? Yaron Koren 19:55, 7 May 2008 (UTC)
- Unfortunately no but it looks straight forward enough. Jean-Lou Dupont 20:02, 7 May 2008 (UTC)
- Okay, thanks; that looks ideal. It could be useful not just for this extension, but for functionality like SMW's refresh-semantic-data action, which currently runs only as a command-line script. Is there any code of yours I can look at that uses the job queue? Yaron Koren 19:55, 7 May 2008 (UTC)
[RESOLVED] Error
I have this error: Fatal error:
Maximum execution time of 30 seconds exceeded in path\includes\Database.php on line 681
--85.18.14.29 10:53, 4 May 2008 (UTC)
Regexp and others
Hi Yaron.
Could you make this excellent extension use Regexp and make some check box on giving it a preview mode or a list of data which will be changed. :)--Roc michael 20:19, 8 May 2008 (UTC)
- Hi Roc, I totally agree with you. +: I think we could have a way to select in which namespaces or categories we want to do the search, and could be a multiselect box, something like this: (first field) http://lab.arc90.com/tools/multiselect/
[RESOLVED] Finally!
Was looking for something like this as this would be the only reason to use bots what I'm too dumb to. Thanks!
Still I get memory problems: On localhost I get this error after ~3 seconds:
Fatal error: Allowed memory size of 33554432 bytes exhausted (tried to allocate 3672 bytes) in C:\xampp\htdocs\wiki12clean\includes\Title.php on line 142
On my sever where I have a php memory limit of 30MB it gives me a blank page. Error log:
PHP Fatal error: Maximum execution time of 30 seconds exceeded in includes/Database.php on line 818
(MW 1.12.0 / PHP 5.1) --Subfader 21:47, 10 May 2008 (UTC)
- And I also support the "per namespace" feature. --Subfader 15:56, 11 May 2008 (UTC)
[RESOLVED] Version 0.2 released
Hi, to everyone who's had problems with the extension before, I just released a new version, 0.2. This version has two big changes: the replacement of text is a two-step process, with the user first shown a list of pages to be replaced so they can choose which ones to replace in; and actual replacement is done through MediaWiki jobs, as Jean-Lou Dupont suggested above. Both of these changes should combine to deal with many of the problems people have experienced, like server timeout, memory overload, and lack of control over replacements. Please try out this new version when you can, and let me know if it improves things for you. Yaron Koren 20:53, 12 May 2008 (UTC)
[RESOLVED] Still my memory problem
Thanks for the fast work. But still: Fatal error: Allowed memory size of 33554432 bytes exhausted (tried to allocate 3672 bytes) in C:\xampp\htdocs\wiki12\includes\Title.php on line 142
I use MW 1.12.0rc and I saw heavy changes are done to Title.php on SVN these days. --Subfader 21:53, 12 May 2008 (UTC)
- At which stage of the process do you see this error? Yaron Koren 22:59, 12 May 2008 (UTC)
- Right after hitting Continue --Subfader 05:16, 13 May 2008 (UTC)
- Okay, I think I see what's going on - the way in which the extension currently finds the list of pages is extremely inefficient. I think there's an easy fix, which I'll add to the next version, which should speed things up, and reduce memory usage, considerably. Yaron Koren 14:54, 13 May 2008 (UTC)
- Maybe use a form of iterator to solve the issue? Insert a job in the queue with 1st article as target. When a job concludes, insert a new job with the next article id and so on. With this method, no more long-lead-time-fetching. Of course, you could fetch a handful of article IDs per-job and process them as well. I would also suggest adding an entry in the log when the whole process is finished. Jean-Lou Dupont 15:01, 13 May 2008 (UTC)
- Actually, this comes before jobs are used - when the list of pages containing the search string is found. Thanks for the previous jobs tip, though. Yaron Koren 15:50, 13 May 2008 (UTC)
- Well, you could have the job go through all pages one by one also; no more searching first and post replace job after. Jean-Lou Dupont 15:54, 13 May 2008 (UTC)
- Actually, this comes before jobs are used - when the list of pages containing the search string is found. Thanks for the previous jobs tip, though. Yaron Koren 15:50, 13 May 2008 (UTC)
- Maybe use a form of iterator to solve the issue? Insert a job in the queue with 1st article as target. When a job concludes, insert a new job with the next article id and so on. With this method, no more long-lead-time-fetching. Of course, you could fetch a handful of article IDs per-job and process them as well. I would also suggest adding an entry in the log when the whole process is finished. Jean-Lou Dupont 15:01, 13 May 2008 (UTC)
- Okay, I think I see what's going on - the way in which the extension currently finds the list of pages is extremely inefficient. I think there's an easy fix, which I'll add to the next version, which should speed things up, and reduce memory usage, considerably. Yaron Koren 14:54, 13 May 2008 (UTC)
- Right after hitting Continue --Subfader 05:16, 13 May 2008 (UTC)
- I don't have much clue but would a preview still be possible job-by-job? Speed is no issue with this extension imo since a bot isn't fast either. So if the preview would take 10 minutes to load; fine for me. At least it would give me a better feeling than bashing everything without undo. Btw: there are ~8500 articles and 16,112 total pages in my local wiki i tested this on. --Subfader 19:28, 13 May 2008 (UTC)
- If the time and/or memory limits are still being exceeded on the preview page, you could paginate it, fetch like 100 previews at a time. To get around these limits and still have all the checkboxes on one page, you could use an AJAX function to call back to the server and pull the next page of results - might take several minutes to display all the previews, but it won't time out because its done in multiple requests.
- Of course, there's still 2 places a large wiki could have problems - the time taken by the database to produce a result set (which you can't really work around - but its a high enough limit to not really matter), and memory on the client side (making your browser run really slowly - solve by splitting the preview into pages) --MrAngel 23:33, 4 July 2008 (UTC)
[RESOLVED] Works fine
Hi, it works smooth now. The "Job" speed varies a lot (on my local wiki). 1-15 actions per minute or even 10 minutes pause. Is it ok this way?
Preview loads but is a bit buggy when div's are included in the previewed section where the "term to be replaced" is included. This broke the layout. The source code of the section is
<div class='comment'><div class='commenttop'><div class='username'>MDB</div><div class='date'>Â Â 00:59, 6 January 2008 (CET)</div></div><div class='commentpost'>
<div class='commenttextfield'>more HMC, esp MayDay ~95 (''from Main Page'')</div>
</div></div>
Thanks anyway, will use it a lot I think :) --Subfader 17:29, 14 May 2008 (UTC)
- Okay, that's great to hear. I have no idea how long jobs are supposed to take - for me it works almost instantly, on a small wiki, but in any case, I doubt if the process can be sped up. Thanks for the bug report - that's a new bug in the current version; I'll fix it in the next version. Yaron Koren 19:25, 14 May 2008 (UTC)
- Just wanna let you know that divs or <> seem to make trouble genereally. I tried these:
- find "<div class='username'>MDB" | replace with "<div class='username'>NEW" | >> MYSQL error
- same for "class='username'>MDB" | ">MDB<" gave me a broken preview similar to above example. Subfader 21:47, 15 May 2008 (UTC)
- 0.2.2 works fine in the preview now, just found another "preview breaker". Center:
==<center>This is a list of mix pages with new comments</center>==
(source of "MixesDB:Latest Comment Pages")- Not bugging me, just to let you know. --Subfader 16:28, 27 May 2008 (UTC)
- Oh, oops, I was planning to fix that in this version but then I forgot. I just fixed it now, so if you get the "new" version 0.2.2, it should work. :) Thanks for the bug report. Yaron Koren 16:55, 27 May 2008 (UTC)
- Update on my above note "1-15 actions per minute or even 10 minutes pause": That was on my localhost wiki. On my real server I have constantly ~30 actions per minute. Already did ~2500 replacements and it's just smoof. --Subfader 17:02, 27 May 2008 (UTC)
- Smoof, awesome. Yaron Koren 17:09, 27 May 2008 (UTC)
- When someone views a page, it uses the same PHP process to run a Job. So if you get no pageviews for 10 minutes, you'll also have no Jobs run in that time. I suspect this is only likely to happen on very small sites, or on your local/testing servers. --MrAngel 16:35, 1 July 2008 (UTC)
[RESOLVED] Using Bots / Recent Changes
You could insert $wgNewUserSupressRC into the code in order to enable hiding bot flagged users from recent changes (as Extension:NewUserMessage does), but better world be to instert only 1 entry to the normal recent changes (User Bla: Global Text Replace 'old text' to 'new text'. Still all details should show up in the user's contribs --Subfader 07:54, 15 May 2008 (UTC)
- Sorry, what's a bot-flagged user? Yaron Koren 15:12, 15 May 2008 (UTC)
- Could be the term is wrong :) A user which you flag as bot / give bot rights. Such aren't displayed in Recent Changes. But mine is with the replacing actions. --Subfader 15:54, 15 May 2008 (UTC)
- Oh, I see. Well, maybe having a bot as the user doing the text replace is a bit misleading, since it's not a script but a special page making the changes? Or maybe that's too subtle a distinction to worry about. Yaron Koren 16:31, 15 May 2008 (UTC)
- My users have no clue about wiki, better than letting them think it was done manually by a "user". That's why I ment a single entry in Recent Changes describing it was a bunch of changes would be useful. If you see ne need for it, no problem. --Subfader 16:34, 15 May 2008 (UTC)
[RESOLVED] Replace text in title (move)
Why is it not possible? I can imagine I'm not the only one who needs it this way. Either soft by additionally moving or hard (if it's possible and easier). Categories won't need to be moved though (that's just one manual change and losing all revisions for the category is ok). So there could be a tickbox before the preview [_] Change article names as well (move)? --Subfader 16:06, 15 May 2008 (UTC)
- Well, my thought was that (a) there are challenges when moving a page, if a page already exists at the new location, and (b) the number of pages that would need to be moved is too small to justify the effort. How many pages would you want renamed, anyway? Yaron Koren 17:59, 15 May 2008 (UTC)
- I don't plan to use this for one job "that needs to finally be done" only. I (want to) use this regulary, as mentioned I'm too dumb for setting up real bots and this replace text tasks would be the only reason. I waited long to find something like this (see first post by me). And the terms I'd rename are e.g. artist names which in all cases both exist in the article title and in the article content (as category link). Deleting the old and creating the new category is no effort then but moving 5-30 pages per replace task is. Subfader 19:46, 15 May 2008 (UTC)
I'd really appreciate such an addition. You could easily seperate it into 2 different tasks. On the Special page with the input fields you could make check boxes for:
"Replace Text in: [x]Article [_]Title (move)" with article beeing checked by default. Please let me know what you think about it. --Subfader 09:09, 30 May 2008 (UTC)
- It does make sense to allow it, and there's probably no need for a separate checkbox, since the user gets to choose exactly what gets replaced anyway. The big complication is when the new title matches some page that already exists. My guess is that it's impossible for the extension to do anything reasonable in that case, so it's best to just ignore those, maybe with some text to the user listing the pages that can't be moved. Otherwise, I guess it's a reasonable idea. Yaron Koren 16:39, 30 May 2008 (UTC)
- Checkbox: Ok, from the point of view that you want to replace misspellings it makes sense to not split them up. But as soon as you want to differ it will be unhandy to go through a long list unchecking the title replacements. Grouping article and title replacements would help. New title already exists: Yep a list afterwards with the (linked) articles not being moved cos the new title already exists (linked as well for a quick cehck) would be good. --Subfader 20:45, 30 May 2008 (UTC)
- I still dream about it :) Only problem is the "if exists"? I see that displaying a report afterwards is crap as you mainly close the window or start another replacement. So what about a report of not changes titles via email? --Subfader 17:23, 27 October 2008 (UTC)
- Okay, I still want to add this feature in. I don't think the reporting afterwards is a big deal - the system should know even before the user clicks "Replace" which pages can be moved and which can't, so on the page with the "Replace" button, there can be a section stating, "The following pages cannot have their titles changed because a page with what would be the new title already exists." Yaron Koren 13:06, 29 October 2008 (UTC)
- lol, indeed. that's even more easy & handy :) --18:02, 30 October 2008 (UTC)
- Okay, it's in the new version, 0.3. Check it out and let me know what you think. Yaron Koren 21:06, 5 November 2008 (UTC)
Buddy, you did some great work. SO HAPPYÂ :) First I thought it would be good to have a second checkbox "Only move titles" but that would break article links to moved pages and wouldn't fix redirects. On the preview page I would highlight the sections a bit more so it's
Please select the pages for which you want to replace 'Abc' with 'Xyz':
<h2>In page text (including redirects)</h2>
check ...
<h2>>In page title (move)</h2>
check ...
Invert selection
<h2>No move possible (target exists already)</h2>
list ...
Speed was fine, nothing else noticed by now. Below some "issues". Thanks a lot! --Subfader 10:48, 6 November 2008 (UTC)
- Cool, I'm glad it worked for you. Thanks for the suggestions; I have some comments below... Yaron Koren 22:29, 6 November 2008 (UTC)
Link to existing target
In the "target exists already" section it would be good to have the existing target page linked (to see why and maybe to do some manual action on Xyz):
The following pages cannot be moved:
- Abc because Xyz already exists
--Subfader 11:41, 6 November 2008 (UTC)
- That's a good idea. Yaron Koren 22:29, 6 November 2008 (UTC)
[RESOLVED] Move action in Recent Changes
Just a minor thing: the move actions don't show up in Recent Changes when the user is a bot. But I can track it via the Move log though. --Subfader 11:41, 6 November 2008 (UTC)
- Isn't that true for all bot actions - doesn't it also happen for regular text replacements? Yaron Koren 22:29, 6 November 2008 (UTC)
- The user is a bot and article text replacements appear (as minor changes). In a single replace action where i replace something in 2 article texts and in 10 article titles, I will only see the 2 text replacements in recent changes, the moves only in the move log. --Subfader 08:53, 7 November 2008 (UTC)
- I had the same problem, page moves done by my "pseudobot" were flagged correctly as "bot" and don't appear on Recent Changes, but the minor edits were not flagged as "bot" and do appear on the RC list. You'll need to check if the user is a bot during page edits and set the flag rc_bot to 1 in table recentchanges. Capmo 07:13, 11 November 2008 (UTC)
- (To Subfader: it is possible to also see edits by bots on the Recent Changes page, just click on "Show Bots" on top of that page.)
- Ah the minor thing is causing it. Ok, thanks. --Subfader 12:47, 11 November 2008 (UTC)
Moving Redirects
Redirects shouldn't be moved? I don't get it, can't concentrate. Anyway I ended up with redirct pages redirecting to non-existing pages. --Subfader 11:41, 6 November 2008 (UTC)
- I don't understand - can you give an example of what's going on? Yaron Koren 22:29, 6 November 2008 (UTC)
Ok I was confused by too many pages at once. Let's nail it down: If a redirect page with the "to be replaced" word exists, a new and useless redirect page will be created including the new word and redirecting to the content page:
Create a page 1 X
and move it to 2 X
. We have:
2 X
as only content page and1 X
redirecting to it.
Now replace "X" with "Z".
Text replacements:
1 X
: #REDIRECT [[2 X]]
>> #REDIRECT [[2 Z]]
Move actions:
1 X
(redirect page) >> 1 Z
2 X
(content page) >> 2 Z
Afterwards we have:
1 X
redirecting to2 Z
(correct)2 X
redirecting to2 Z
(correct)2 Z
as only content page (correct)1 Z
redirecting to2 Z
but was created by the move action for no usage, cos that can't be linked to.
--Subfader 13:04, 9 November 2008 (UTC)
- This sounds like a case where the user just shouldn't have replaced the text within a redirect, no? Yaron Koren 23:58, 9 November 2008 (UTC)
- I don't understand? Within? No. The extension shouldn't create new redirect page names. --Subfader 11:10, 10 November 2008 (UTC)
- What do you mean that "1 Z" can't be linked to? Yaron Koren 13:38, 10 November 2008 (UTC)
- Page "1 Z" didn't exist before = there are no links to "1 Z" anywhere in the wiki or internet = no need to move "1 X" which creates a redirect page "1 Z" redirecting to "2 Z". --Subfader 17:23, 10 November 2008 (UTC)
Image moves
If an image "1.jpg" is linked on a page to be displayed and you replace and move "1.jpg" to "2.jpg" then the file page can't be moved but on the page it will be "2.jpg" which doesn't exist. So you must enable File moves via $wgAllowImageMoving (experimental), which I didn't check yet. Otherwise you end up with broken image links you don't even know about, esp when the preview for page replacement doesn't display the replacement in red for the file cos there's a replacemnt before on the page. --Subfader 11:41, 6 November 2008 (UTC)
[RESOLVED] Special Characters
I get an MYSQL error trying to replace â or â with ' or "n´t" to "n't":
1064: You have an error in your SQL syntax; check the manual that corresponds to your MySQL server version for the right syntax to use near 't%' AND p.page_namespace != 1 AND p.page_namespace != 3 AND p.page_namespace ' at line 5 (localhost)
Any idea? --Subfader 19:27, 22 May 2008 (UTC)
- Thanks for the bug report; this will be fixed in the next version. Yaron Koren 20:01, 22 May 2008 (UTC)
- Also magic words like __NOEDITSECTION__ aren't found. --Subfader 19:23, 24 May 2008 (UTC)
- How interesting: I was able to reproduce that bug, but it took me a while to figure out what was causing it. The problem isn't magic words, it's that the extension can't handle strings at the very beginning of pages (as I assume __NOEDITSECTION__ is for you). This, too, will be fixed in the next version; thanks. Yaron Koren 20:18, 25 May 2008 (UTC)
- Yep, I always added __NOEDITSECTION__ at the very beginning of the article. --Subfader 06:48, 26 May 2008 (UTC)
- How interesting: I was able to reproduce that bug, but it took me a while to figure out what was causing it. The problem isn't magic words, it's that the extension can't handle strings at the very beginning of pages (as I assume __NOEDITSECTION__ is for you). This, too, will be fixed in the next version; thanks. Yaron Koren 20:18, 25 May 2008 (UTC)
- Works fine now in 0.2.2. Thanks! --Subfader 16:31, 27 May 2008 (UTC)
- It also doesn't find [[foo]] eg for replacing [[Category:Foo]] --Subfader 18:14, 2 June 2008 (UTC)
- I don't understand - are you trying to replace "Foo" or "[[Foo]]"? Yaron Koren 18:39, 2 June 2008 (UTC)
- My ver. 0.2.2 replaced hundreds of cases of "[[Category:abc123]]" with "{{abc123}}". Brian7632416 18:47, 2 June 2008 (UTC)
- I tried to replace "[[Category:Foo]]" but it wasn't found although it's there. Of course I could replace "Category:Foo" which would work most of the cases if one wants to change the category. Although the problem I wanted to fix was solved otherwise now, I just wanted to let you know. --Subfader 19:05, 2 June 2008 (UTC)
- Ok that was on my localhost. On my server it does the trick. Sorry for the confusion. --Subfader 07:21, 3 June 2008 (UTC)
- What's causing the problem on your localhost, do you know? Yaron Koren 13:45, 3 June 2008 (UTC)
- Nope, dunno. Found another bug. when replace "foo" with "<!--foo" or "<includeonly>foo" the following page is cut off by the tag. --Subfader 21:34, 11 July 2008 (UTC)
[RESOLVED] Error with Apostrophes in Find String in 0.2.7
A database query syntax error has occurred. This may indicate a bug in the software. The last attempted database query was: (SQL query hidden) from within function "". MySQL returned error "1064: You have an error in your SQL syntax; check the manual that corresponds to your MySQL server version for the right syntax to use near 'M %' AND p.page_namespace != 1 AND p.page_namespace != 3 AND p.page_namespace' at line 5 (localhost)".
Tried to replace |I'M | with |I'm | Found out it's about apostrophes in the find string. E.g. you can replace |Im | to |I'm | but not |I'm | with something else. It was working fine in version 0.2.3 --Subfader 16:19, 26 September 2008 (UTC)
- Thanks for the bug report - I just fixed this (again :) ) in the new version, 0.2.8. Yaron Koren 20:07, 1 October 2008 (UTC)
[RESOLVED] Run as minor edits
It should be possible to let it run as minor edits, since people won't be amused to get lots of watchlist notifications for minor changes on lots of pages. Extension:NewUserMessage uses it if you wan tto check. --Subfader 19:32, 22 May 2008 (UTC)
- Hm... that's a good idea, but do you think it should be up to the admin? Or is that too much effort, and it's better to just always make them minor edits? Yaron Koren 20:06, 22 May 2008 (UTC)
- If possible I think it should be optional minor/not-minor. The replaced text could be a simple change or it could be a significant change occuring across multiple pages. Smaug 20:09, 22 May 2008 (UTC)
- A checkbox on teh special page would be the best, if not fro admin only in the php files would be enough for me as I would run all as minor edit as I cannot think of a case where the watching users should be informed about the change --Subfader 20:24, 22 May 2008 (UTC)
- Minor Edit works fine in 0.2.2. Thanks a lot! --Subfader 16:35, 27 May 2008 (UTC)
- Cool. I set it so that every edit is a minor edit, because that's the simplest solution, and because text replacements are a minor change pretty much by definition. But if anyone still thinks that should be a user option, let me know. Yaron Koren 17:14, 27 May 2008 (UTC)
Multiple lines
Thanks Yaron for another stellar extension. Any plans for handling multiple lines? --Tosfos 03:49, 25 May 2008 (UTC)
- Hi, I'm glad you find it useful. Regular expressions might be added soon for finding strings, which should allow for replacing multiple lines (replacing something with multiple lines might be trickier, but we'll see.) Yaron Koren 20:20, 25 May 2008 (UTC)
- I'm thinking it might be easier for non-techies if they were presented with two text areas so that they can represent a new line by simply going to the next line. Perhaps this can be implemented in addition to regular expressions at some point. --Tosfos 01:32, 3 June 2008 (UTC)
[RESOLVED] Automated Routines
Would be useful to define routines that are run regulary for replacements that need no verification preview but need to be done every now and then (like obvious mispellings) to keep the data clean. A routine could include those settings:
- define user unhder which the actions are run
- define timespan, e.g. weekly, monthly (where to start?)
- define replacements 'X' with 'Y'
Only implent if it doesn't take you too much coding. --Subfader 14:38, 25 May 2008 (UTC)
- I don't believe MediaWiki has any mechanism for scheduling regular tasks; let me know if you know of something. Yaron Koren 20:22, 25 May 2008 (UTC)
- No I don't know about schedules either. The point is I have already 20 standard replacements noted I would run every 2 weeks to keep my data clean. Maybe it's easier to forget the schedule but to combine many replacements in 1 job without preview and warnings? Again, if it's too much coding, no problem. I can still run them manually every now and then. doesn't take too much time though. --Subfader 20:39, 25 May 2008 (UTC)
- CHanged to resolved, would be too much potential for unseen mistakes. --Subfader 17:31, 10 November 2008 (UTC)
[RESOLVED] Log Out in Job Queue
Whoa, took me a while to realize that you cannot log out while the job queue is running. I logged out like 20 times, cleared the cache, tried to log in as a different user, nothing. I stayed logged in as the bot user I use for the text replacements. Maybe make a note somewhere to avoid confusion for others. ;) --Subfader 19:41, 25 May 2008 (UTC)
- Thanks for that note. I still don't fully understand how jobs work in MediaWiki, but your observation clarifies some things I've noticed. I'll add something to the documentation. Yaron Koren 20:34, 25 May 2008 (UTC)
- Good if I can help. Another thing i realized: While the job queue was running (+1 hour) I changed my own talk page. And everytime I logged out the system logged me in again. I can tell because it always displayed me the "you got a new message" thingy for the talk page change. I watched my talk page, logged out, again: "you got a new message". Cache? I wonder why the system didn't notice I've read my talk page already since my last edit of it. Not buggy, just may be helpful to understand the job queue. --Subfader 20:36, 25 May 2008 (UTC)
- Ok this is weird now but maybe the solution. I just started replacement actions as "User B" (bot grants) on a different machine(!) (a windows VPS server). On my local PC(!) where I'm logged in as "User A" I can see "User B" on the upper right User Links section. Although I'm not logged in as "User B" it makes me think so (see above). {{CURRENTUSER}} gives me "User A" also when I edit etc.
Can anyone confirm thsi behaviour? This is highly confusing for all other users being logged in seeing a different user name on top while job queue is running, what can take hours for my replacements. --Subfader 07:03, 28 May 2008 (UTC)
- Ok this is weird now but maybe the solution. I just started replacement actions as "User B" (bot grants) on a different machine(!) (a windows VPS server). On my local PC(!) where I'm logged in as "User A" I can see "User B" on the upper right User Links section. Although I'm not logged in as "User B" it makes me think so (see above). {{CURRENTUSER}} gives me "User A" also when I edit etc.
- Re-checked. When I run a job queue as User X, everyone sees the UserLinks on top as if he would be logged in as User X (but actually isn't). This also the case for all not logged in visitors. Very confusing for the users. I also tried adding the user to different user groups but no joy ;) --Subfader 15:47, 20 June 2008 (UTC)
- In ReplaceTextJob.php, you have (lines 43-49)
if ($num_matches > 0) { global $wgUser; $wgUser = User::newFromId($this->params['user_id']); $edit_summary = $this->params['edit_summary']; $flags = EDIT_MINOR; $article->doEdit($new_text, $edit_summary, $flags); }
- Note that a Job is run in the same process as a normal page request, after parsing but before output generation. So any code in the Skin file will see the $wgUser that you have changed here. As well as the effects you mention, this may also confuse some other extensions. I'd suggest putting the user back when you've finished with it. Something like:
if ($num_matches > 0) { global $wgUser; $realUser = $wgUser; $wgUser = User::newFromId($this->params['user_id']); $edit_summary = $this->params['edit_summary']; $flags = EDIT_MINOR; $article->doEdit($new_text, $edit_summary, $flags); $wgUser = $realUser; }
- Hope that helps some --MrAngel 16:48, 1 July 2008 (UTC)
- Oh, oops... that problem makes sense now. Thanks for finding the bug, and explaining a solution. This will go into the next version. Yaron Koren 20:34, 1 July 2008 (UTC)
- Yay :) --Subfader 10:39, 2 July 2008 (UTC)
- It's finally fixed now, in version 0.2.4. Yaron Koren 17:15, 6 August 2008 (UTC)
- From the extension page 'Note': "Even more importantly, it unfortunately means that replacements will not happen if the user who initiated them is logged out; if you have started a large set of text replacements, please stay logged in as that user to ensure that these replacements get finished as soon as possible." That is and never was the case.? The replacements are done, no matter if I stay logged in or just log out. --Subfader 09:08, 21 July 2008 (UTC)
- Oh, I see - I misinterpreted what you had said before; I guess the issue was just that "real user" bug. I'll fix the documentation. Yaron Koren 13:16, 22 July 2008 (UTC)
[RESOLVED] Version 0.2.1 Working Well - and THANKS Yaron Koren
- MediaWiki: 1.11.1
- PHP: 5.2.3 (apache2handler)
- MySQL: 5.0.51a
- Extensions:
- MassEmail 0.3.5
- RSS Reader 0.2.4
- AWC's MediaWiki Forum 2.3.4
- Google News Bar 071007
Replace Text extension (0.2.1) works very well for me on the above. The extension page and instructions are easily understandable and a pleasure to read. The installation itself was quick and easy.
I'm a document/text-oriented perfectionist who needed something to tidy up the sloppy submissions on my recipe wiki (DishiWiki). I'm new to MediaWiki and had just begun to investigate bot-editing but wasn't even able to find where to start, even though MediaWiki is full of references to bot-edits. All along, though, all I needed to do was to make changes such as ½ to 1/2 (for legibility, in that case), and similar types of "search and replace" operations. Since I'm new to MediaWiki, I did what I knew until I found a better way, so Replace Text enables me to go back and change what I did poorly when I didn't know any better.
I think a paragraph about what is happening in the background (is it a true cron job?) and how logging in and out (or attempting to do so) affects the "cron job" would be helpful, but Replace Text is running right now over on my wiki, and I'm filling the time by sending this congratulations to you. My experience was similart to Subfader's (above). I get about 10 pages edited per minute (average about 5 replaces per page), with about a 10-minute pause between distinct replace requests that were entered in rapid succession. (This is NOT a deal breaker, IMO.)
I hope somebody who knows how to do so will give you a star (or whatever they do here) to recognize a worthy contribution. Thanks again, Yaron. -- Brian7632416 02:19, 26 May 2008 (UTC)
- Thanks for the compliments, and I'm glad the extension has been useful to you. I must admit that my similar lack of understanding of bots is partly why I created the extension in the first place. :) I'll definitely try to improve the documentation. Yaron Koren 23:43, 26 May 2008 (UTC)
Will Wild Cards Work in Replace Text?
- MediaWiki: 1.11.1
- PHP: 5.2.3 (apache2handler)
- MySQL: 5.0.51a
- Extensions:
- MassEmail 0.3.5
- RSS Reader 0.2.4
- AWC's MediaWiki Forum 2.3.4
- Google News Bar 071007
- Replace Text 0.2.2
Replace Text 0.2.2 works great. What about wild cards? How could I replace these:
- [[Onions|green onion]] with "green onion," and
- [[Turnips|Turnip greens]] with "Turnip greens," and
- [[Avocados|Haas avocado]] with "Haas avocado"
all at once to reduce to plain text all the intra-wiki links that appear on pages I have imported? Maybe, e.g.,:
Original text = "[["*"|"**"]]" and Replacement text = **
--72.220.151.42 18:41, 27 May 2008 (UTC)
- There are no plans right now to add wildcard handling, but it could happen. Yaron Koren 19:25, 27 May 2008 (UTC)
- I add lots of external links. It would be great to know I am able to delete tham all at once. E.g. when a site is down I'd like to be able to remove all links where [http://somesite.com ...]. I guess it's hard to define the string from tag start to tag end if a tag is given at all and not just http://somesite.com... Maybe a feature for some 1.x version :) --Subfader 16:21, 1 February 2009 (UTC)
Restrict to categories
It would be useful to only let the replacement be applied on pages in a defined category (additional box) so unticking many pages in the preview for some standard text isn't neccessary. Combined with the multiple lines this would make it extremely powerful and handy. --Subfader 21:00, 5 June 2008 (UTC)
[RESOLVED] Handling Quotes
Is there anyway to replace a string with quotes in it ie;
style="background:#ffdead;"
Is what I want to change but it only finds upto before the quote so it only finds
style=
Am I doing something wrong?
- If you simply want to change the color find
#ffdead
/#FFDEAD
and replace.? --Subfader 05:02, 10 June 2008 (UTC)- Thanks for the bug report; there was a bug in the handling of strings with quotation marks. This should be fixed in the latest version, 0.2.3. Yaron Koren 14:11, 24 June 2008 (UTC)
[RESOLVED] Problems
I have an article where i have this words: Mamoeiro and
[[Logradouros:Bairro Mamoeiro|Mamoeiro]]
. I have a lot of other articles that have Mamoeiro word on the text too. I wanted to replace all words Mamoeiro by:
[[Logradouros:Bairro Mamoeiro|Mamoeiro]]
. When i do it on replace text, the articles that already have
[[Logradouros:Bairro Mamoeiro|Mamoeiro]]
will be replaced by this wrong long string:
[[Logradouros:Bairro [[Logradouros:Bairro Mamoeiro|Mamoeiro]]|bairro [[Logradouros:Bairro Mamoeiro|Mamoeiro]]]]
. So i really think to solve this problem, Replace text should add the check box per word and not per article.
- Here's how I handle that situation in any type of search-and-replace operation, be it a word-processor document, HTML, XML, etc.:
- Temporarily replace [[Logradouros:Bairro Mamoeiro|Mamoeiro]] with a string that is sure not to appear elsewhere. (I like "%%").
- Replace all the remaining (standalone) "Mamoeiro"s with [[Logradouros:Bairro Mamoeiro|Mamoeiro]].
- Replace (restore) all the "%%"s with [[Logradouros:Bairro Mamoeiro|Mamoeiro]].
- You will want to think through each instance of using this technique to prevent conflicts, and be sure to let the job queue completely finish each replace operation before invoking the next one. Hope this helps. -Brian7632416 00:23, 14 June 2008 (UTC)
- Hi Brian, It's a workaround that can help, but i think will be great if we have a real solution.
- It's just a "replace text" extension. It's helpful to us non-programmers. I don't want to have to check hundreds of check-boxes (in a per-word paradigm). I'll stick to the Replace Text version that does not make that change, and I'll use the "workaround" (?) mentioned above. -Brian7632416 01:04, 14 June 2008 (UTC)
- Way easier. Use brain! :) Just find the blank after Mamoeiro (cos this is not included in the code you already have added). FIND "Mameiro " REPLACE WITH "[[Logradouros:Bairro Mamoeiro|Mamoeiro]] ". --Subfader 15:05, 14 June 2008 (UTC)
- But there may be one of any number of punctuation marks after a freestanding/non-piped/non-linked "Mameiro," so multiple swipes would be needed. OP said he has "a lot of other articles with the word on [sic] the text." That's why I like to work backwards, starting by "disguising" the text that should remain unchanged, in the end. -Brian7632416 16:13, 14 June 2008 (UTC)
- Way easier. Use brain! :) Just find the blank after Mamoeiro (cos this is not included in the code you already have added). FIND "Mameiro " REPLACE WITH "[[Logradouros:Bairro Mamoeiro|Mamoeiro]] ". --Subfader 15:05, 14 June 2008 (UTC)
- It's just a "replace text" extension. It's helpful to us non-programmers. I don't want to have to check hundreds of check-boxes (in a per-word paradigm). I'll stick to the Replace Text version that does not make that change, and I'll use the "workaround" (?) mentioned above. -Brian7632416 01:04, 14 June 2008 (UTC)
- In this case, I suspect the best fix would be to add a "don't replace text within links" checkbox - easy enough to do (actually, I just did it - need to test thoroughly before submitting a patch, though) --MrAngel 16:14, 5 July 2008 (UTC)
- Hi MrAngel It's really a good feature, thanks. As i can see by your comments it's not a good idea have per-word replace. If per-word replace really can't be added, I think we should have an option to see all words being replaced. I say this because if i have for example five words that will be replaced on the same article, replacetext doesn't show all words "lines". Understand? Is it possible? Lleoliveirabr 20:03, 5 July 2008 (UTC)
[RESOLVED] Installation issue
I've tried to install this extension into my wiki. I added
include_once('extensions/ReplaceText/ReplaceText.php');
$wgGroupPermissions['sysop']['replacetext'] = true;
to LocalSettings.php and the form appears on the wiki - but when I press Continue, it takes me back to the main page of the wiki and nothing happens. I don't see the confirmation page (and no text replace happens)
After I press Continue, the address bar shows:
http://robertinventor.com/wiki/index.php?target_str=aaa&replacement_str=aaabbbb which just brings up the main page of the wiki.
What am I missing? Robertwalker 11:07, 15 July 2008 (UTC)
- Odd. http://robertinventor.com/wiki/index.php?title=Special:ReplaceText is working correct but http://robertinventor.com/wiki/index.php/Special:ReplaceText takes you to Main Page. index.php/ should work teh same as index.php?title=. You changes some Apache settings on your server? --Subfader 14:33, 15 July 2008 (UTC)
- No, nothing like that, only the LocalSettings.php for MediaWiki. The site is hosted by bluehost so they set everything up. BTW I have secure shell access so can run commands interactively on the server, but don't have root access. Robertwalker 18:03, 15 July 2008 (UTC)
- Could this be anything to do with why the extension isn't working on my wiki? Robertwalker 23:11, 15 July 2008 (UTC)
- That's a bug in Replace Text - it should add 'title' as a hidden field in the form, just to handle cases like yours. I'll add it to the next version - till then, I guess use the workaround that Subfader found. Yaron Koren 10:34, 16 July 2008 (UTC)
- Okay, thanks, what's the work around? I did a search of this page for subfader but can't seem to find anything related?Robertwalker 12:03, 16 July 2008 (UTC)
- Yaron ment my post above where it looks like http://robertinventor.com/wiki/index.php?title=Special:ReplaceText is working. If not, wait for the update. He's quick with new versions :) --Subfader 21:38, 16 July 2008 (UTC)
- Oh I see, no that's not working. The page shows up okay, but if you enter a replacement string e..g just testing, aaa replaced by aaabbbb then the url changes to http://robertinventor.com/wiki/index.php?target_str=aaa&replacement_str=aaabbbb (no "title=" in the url) and it takes you back to the main page of the wiki. I never see the confirmation page, and no replacement is made. I look forward to trying out this extension, sounds very useful from the things people have been saying about it. Happy to wait for the update but want to make sure my report is accurate Robertwalker 00:39, 17 July 2008 (UTC)
- Okay, this is fixed in the latest version, 0.2.4. Yaron Koren 17:14, 6 August 2008 (UTC)
No admin rights ?
Hello !
Have installed this extension but going to Special:ReplaceText gives me an error : you don't have enough right ? I'm the only user and administrator of my test wiki... any idea ? Thanks :) Draky 20:50, 20 July 2008 (UTC)
- Are you a sysop? See Special:Userrights. If you can't access Special:Userrights you're no sysop. Change in the DB table directly then. --Subfader 08:31, 21 July 2008 (UTC)
- Yes, I am. Draky 08:42, 21 July 2008 (UTC)
- I got a similar error - I was able to access the page by adding:
- Yes, I am. Draky 08:42, 21 July 2008 (UTC)
include_once('extensions/ReplaceText/ReplaceText.php');
$wgGroupPermissions['sysop']['replacetext'] = true;
- to LocalSettings.php - like you I was sysop but seems that I didn't have replacetext permission. - I didn't get it to work but that was probably something to do with the title=... url my wiki uses (not yet sorted out unfortunately) - don't know if this helps Robertwalker 00:35, 25 July 2008 (UTC)
- Yeah, someone added this new permission type to the extension a few weeks ago and didn't tell me... but the documentation now reflects it, thankfully. Yaron Koren 12:38, 25 July 2008 (UTC)
- Not yet: the README file that accompanies the extension package says nothing about setting permission rights. BTW, thanks for this useful tool. Capmo 15:16, 10 November 2008 (UTC)
Expanding the options?
So, since this baby is technically working fine, what about expanding the options a bit?
- Would be nice to be able to "Select all - Deselect all - Invert selection" of the checkboxes.
- Furthermore it would be useful to be able to restrict the affected (previewed and ticked) pages for the replacement to only a few pages related to a certain category. I.e. add a new input box on Special:ReplaceText where you can (optionally) add a category name (without Category: prefix). If filled, the process will first check which pages are added to this category, then check for the text to be replaced. WOuld it be possible? --Subfader 14:05, 25 July 2008 (UTC)
- I added an "invert selections" link into the form in version 0.2.4. Let me know what you think. Yaron Koren 17:14, 6 August 2008 (UTC)
- The invert is fine and enough (no Select / Deselect needed). But I still think it would be great to restrict the replacement action to a batch of pages.
Perfect would be a batch of search results since my wiki uses an advanced search with accurate results but technically it would be hard to combine those two.
Good for a start would be the mentioned category input box so the replacement action is only done on pages in those categories. --Subfader 12:41, 8 January 2009 (UTC)
- The invert is fine and enough (no Select / Deselect needed). But I still think it would be great to restrict the replacement action to a batch of pages.
Extension cannot find copy with \ in it
I am trying to do a search and replace for \SERVER\DEV_AVP\ and replace with \server\avp\avpdev\
This search turns up no results.
If I search for DEV_AVP it finds the instances fine, but it is not sufficient to replace using that as there are other instances of DEV_AVP than do not have the \SERVER\ prefix that I do not want to touch. Any ideas?
- Hi, this was a bug; thanks for the bug report. It's been fixed in the latest version, 0.2.5. Yaron Koren 17:43, 2 September 2008 (UTC)
[RESOLVED] Bug when running on MW 1.9.1
See: Another report on this same problem and the fix
The function wfLoadExtensionMessages() on line 12 of SpecialReplaceText.php does not work on MW 1.9.1. This may have been introduced in version 1.11.0.
I ran into the same problem on the extension "WhoIsWatching" which was fixed by re-coding by the author. (See reference above). I don't know how to do this recoding myself, and so I have commented out the line, which makes for an unattractive special page.
Any idea on how I can fix this problem?
--C4duser 00:20, 23 September 2008 (UTC)
- Forgive me for sounding facetious, but why not upgrade your copy of MediaWiki? Yaron Koren 02:03, 23 September 2008 (UTC)
- If the intent is to have this extension available only on MW 1.11 or higher, perhaps it would be useful to change the box on the extension page that currently suggests it works on 1.8 or greater. That's the reason I posted this comment. --C4duser 19:22, 23 September 2008 (UTC)
- Ah, right you are. :) Sorry about that - I changed the description. Yaron Koren 22:02, 23 September 2008 (UTC)
Select namespace(s)
It would be useful to have a box with the available namespaces on the start page to (un)tick namespace(s) which the find/repace process should affect. Currently I change the code or untick a lot in the preview to do that but it's unhandy. --Subfader 15:38, 28 September 2008 (UTC)
Error when trying to replace a URL
I need to replace absolute links in my project by relative ones, and was trying the following find/replace:
Find: http://www.mysite.org
Repl: {{SERVER}}
When I clicked on the Continue button, the following error was returned:
Forbidden You don't have permission to access /wiki/index.php/Special:ReplaceText on this server. Additionally, a 404 Not Found error was encountered while trying to use an ErrorDocument to handle the request. -------------------------------------------------------------------------------- Apache/2.2.10 (Unix) mod_ssl/2.2.10 OpenSSL/0.9.8i DAV/2 mod_auth_passthrough/2.1 mod_bwlimited/1.4 FrontPage/5.0.2.2635
After a few tests I came to the conclusion that the problem lies in the string &target_str=http%3A%2F%2Fwww.mysite.org appearing on the URL address. I imagine you are using a GET to pass the parameters target_str and replacement_str, and it's resulting in a link that may resemble a stealth URL forwarding to Apache, that blocks it. Would it be possible for you to use POST instead of GET? I believe it would solve this problem. Thanks Capmo 16:41, 11 November 2008 (UTC)