ISFDB talk:Data Consistency/Disallowed URLs
Revision as of 15:44, 21 October 2010 by Mhhutchins (talk | contribs) (→Looking for offending records)
It looks like bestsf.net has moved its images? I've found several broken ones for Interzone covers for instance. BLongley 17:36, 3 September 2010 (UTC)
- I've replaced the bestsf.net images with Galactic Central ones, and most of the SJGames ones with Amazon links. BLongley 22:05, 13 October 2010 (UTC)
This table is far out of date. The scan that created it should be re-run when possible and then every few months if possible. -DES Talk 16:44, 13 October 2010 (UTC)
- I'm not sure what the original script was. This is my closest, for the counts:
select SUBSTR(p.pub_frontimage,8,locate('/',p.pub_frontimage,8)-8) , COUNT(*) from pubs p where p.pub_frontimage is not null and LENGTH(p.pub_frontimage) > 8 and p.pub_frontimage LIKE 'http%' group by SUBSTR(p.pub_frontimage,8,locate('/',p.pub_frontimage,8)-8)
- then plug the dodgy prefixes found into a query like
select pub_id from pubs where pub_frontimage LIKE '%prefix%'
- to find the actual pubs. There seem to be a lot more now. :-/ BLongley 20:19, 13 October 2010 (UTC)
Rerun of the script
Ahasuerus ran the original script, and I've just left him a note to see if he could do it again when he gets the chance. Mhhutchins 20:33, 13 October 2010 (UTC)
Looking for offending records
How would someone look for records that link to sjgames, cox.net and photobucket? Mhhutchins 22:19, 13 October 2010 (UTC)
- I think I've fixed the sjgames and photobucket ones, but here are the cox.net Pub IDs: 3251, 34297, 35399, 40674, 45990, 169051, 196833. BLongley 22:34, 13 October 2010 (UTC)
- Thanks. I've changed all links to cover images on approved servers. Mhhutchins 00:35, 14 October 2010 (UTC)
- I've removed (most) and replaced (about 75%) of the links from the offending records based on the latest posted list, but there are a couple of servers for which the links weren't provided. Can someone who has a local copy of the database check to see what records are linked to "albin-michel.fr" and "eclipse.co.uk", and the three that have "nohttp"? Thanks. Mhhutchins 18:53, 20 October 2010 (UTC)
- I was able to find those for albin-michel by going through the publisher records. That leaves eclipse.co.uk. Mhhutchins 19:06, 20 October 2010 (UTC)
- If they are for Sweet Despise, then we've got implicit permission from Unapersson. BLongley 23:38, 20 October 2010 (UTC)
- Is there a way to make sure that they're all on that URL? Mhhutchins 00:41, 21 October 2010 (UTC)
- Yes, as of the last backup I loaded all 76 are for Sweet Despise. BLongley 19:34, 21 October 2010 (UTC)
- Great. Thanks for checking. Mhhutchins 19:44, 21 October 2010 (UTC)