|
Welcome to the Database Correction page. This page is for letting me or the other editors know of corrections that need to be made. Please read the posting instructions carefully.
|
|
Monday August 04 02:58:13 2003 Re: Titles |
Brian R wrote:
> it seems that some entries still > have "Episode Title:" in them, Frig, when went through before thought got all them? > The error messages at the top are because the tool > currently treats anything in () as a date. Some are AKAs, > some are notes that will eventually need to be moved to > the Description, and a bunch more (?). Unfortunately a > lot of those, we'll either need to check each of those > titles or just erase the ?s and assume the title is > correct. Opinions? Okay wait here Buddy. The (?) in the title means unawares of the date. In some cases have confirmed the film exists and just haven't tracked down a date yet. > There are some funky ones because the Titles look like: 7 > Days, AKA Seven Days Oh frig yes. Ongoing common Sqwack Plan was to line up all these series same, in IMDB movie form. Problem being some posters still insist on entering their own way, and it gets by the boy here. Then when that missed, hear all about it. So aware of this <G> *Eventually* I'll get on top of that Buddy |
Jay L |
kdnpr@yahoo.com |
Monday August 04 03:16:24 2003 Re: Titles |
Brian R wrote:
> There are some funky ones > 7 Days, AKA Seven Days. Okay, the 7 Days series all sorted proper now. I'll get after the others when get time. |
Jay L |
kdnpr@yahoo.com |
Monday August 04 03:33:47 2003 Re: Brian's Post- Titles |
"and a bunch more (?). Unfortunately a lot of those, we'll either need to check each of those titles or just erase the ?s and assume the title is correct. Opinions?" Ang Tipo Kong Lalake: Illegal date (?) Eseng Ng Tondo: Illegal date (?) Okay, just looking at these two, both Philippine Islands films we couldn't find a date for as suspected. My opinion is the date mark not needed if coming up Illegal and causing hassles, so's these will be sorted right now. |
Jay L |
kdnpr@yahoo.com |
Monday August 04 11:43:20 2003 Re: Titles |
Jay L wrote:
> Okay wait here Buddy. > The (?) in the title means unawares of the date. Ah, ok then no problem. I'll set the tool to handle that. > > There are some funky ones because the Titles look > > like: 7 Days, AKA Seven Days > > Oh frig yes. Ongoing common Sqwack > *Eventually* I'll get on top of that Buddy I wouldn't bother, like I said 162 to change. It'll be easier for me to set the tool to handle it, the new database will store things differently anyway. |
Brian R |
Monday August 04 11:45:21 2003 Re: Brian's Post- Titles |
Jay L wrote:
> My opinion is the date mark not needed if coming up > Illegal and causing hassles, so's these will be sorted > right now. Let's discuss things before making decisions based on the output from the tool. I'm many cases, once I understand what the cause is, it's easier for me to change the tool. |
Brian R |
Monday August 04 13:53:13 2003 Latest on titles |
I'm home today because the sales people are getting special training and they figure the engineers wouldn't get any work done anyway, so they gave us a day off. Yay.
Anyway, I've folded in some rule changes. Most of the "illegal dates" are now AKAs without saying AKA, notes, and one thing that looks like an episode name. The only problem really is "Randall and Hopkirk: (Deceased)" where the parenthetical information is part of the title. I can just add a special rule for that. If we weed out all the comments and episode information, then I can make a default rule that anything left must be an AKA. That will cut down on the amount of fixup to do. Scanning down the list of titles, there are some obvious inconsistencies and what look to be some duplicates. Many are minor inconsistencies in the capitalization of words in the title: Amazing Spider-Man, The (1978) Amazing Spider-man, The (1978) Others are missing articles: A-Team A-Team, The |
Brian R |
Monday August 04 13:56:41 2003 Tool plans |
I'm going to start on the Episode extractor next. I may also make a version of the Title extractor that looks for what seems to be duplicate Movie titles. That would be a Media == Movie, Title identical, Date identical or not present. This would be purely to check for errors.
|
Brian R |
Monday August 04 14:51:05 2003 Re: Tool plans |
Brian R wrote:
> I may > also make a version of the Title extractor that looks for > what seems to be duplicate Movie titles. That would be a > Media == Movie, Title identical, Date identical or not > present. This would be purely to check for errors. I ran this, there weren't that many hits. Not all of these are for sure duplicates, but many are. Also understand that there may be still dupes uncaught because of inexact title matches. Results: Possible duplicate: Price, The / Price, The (1985) Possible duplicate: Nightwatch (1994) / Nightwatch (1994) Possible duplicate: Hell Hath No Fury (1991) / Hell Hath No Fury (1991) Possible duplicate: Hell Mountain (1998) / Hell Mountain (1998) Possible duplicate: Tarzan (1999) / Tarzan Possible duplicate: Night Before, The (1988) / Night Before, The (1988) Possible duplicate: Pukar (2000) / Pukar (2000) Possible duplicate: Maachis (1996) / Maachis (1996) Possible duplicate: Texas Across the River (1966) / Texas Across the River (1966) Possible duplicate: Captive Women (1952) / Captive Women (1952) Possible duplicate: Betaabi (1997) / Betaabi (1997) |
Brian R |
Monday August 04 18:20:21 2003 Re: Brian's Post- Titles |
Brian R wrote:
> Let's discuss things before making decisions based on > the output from the tool. I'm many cases, once I > understand what the cause is, it's easier for me to > change the tool. On the (?) thing, that was something I doing to mark both suspect entries and to work on eventually getting dates for others, etc. They've all been sorted to "Not in IMDB" etc comments in the bottom of description now. Btw: Randall & Hopkirk (Deceased) is how entered in IMDB. Any idea how to sort that one? The 'aka' thing was an ongoing thing anyway, Last one sorted from "Sqwack-List": Simon & Simon, aka Simon and Simon Simon and Simon, aka Simon & Simon Simon & Simon Simon and Simon The "Moderator" had a post about searching on sub-strings or whatever? After awhile, hard for the boy to find. Idea being, to gather up them all, and sort them exact same as IMDB, So that no Simon and Simon episodes "would be F*%#ing missed" off ABC, or ABE, whatever US channel they airing on? Now that the others brought up like 7 Days aka Seven Days, etc, figured great time to mark and sort instead of waiting to come across or "be alerted" to them. Perhaps I'm underestimating your abilities with these tools? |
Jay L |
kdnpr@yahoo.com |
Monday August 04 18:26:14 2003 Re: Tool plans |
Brian R wrote:
> I ran this, there weren't that many hits. Not all of > these > are for sure duplicates, but many are. Also understand > that there may be still dupes uncaught because of inexact > > title matches. > > Results: > > Possible duplicate: Price, The / Price, The (1985) > Possible duplicate: Nightwatch (1994) / Nightwatch > (1994) > Possible duplicate: Hell Hath No Fury (1991) / Hell Hath > No Fury (1991) > Possible duplicate: Hell Mountain (1998) / Hell Mountain > (1998) > Possible duplicate: Tarzan (1999) / Tarzan > Possible duplicate: Night Before, The (1988) / Night > Before, The (1988) > Possible duplicate: Pukar (2000) / Pukar (2000) > Possible duplicate: Maachis (1996) / Maachis (1996) > Possible duplicate: Texas Across the River (1966) / Texas > Across the River (1966) > Possible duplicate: Captive Women (1952) / Captive Women > (1952) > Possible duplicate: Betaabi (1997) / Betaabi (1997) I went over these, and determined eight out of 11 duplicates. Deleted one of each, sometimes after a little merging. The ones that are no duplicates: The Price - most probably two different shows Nightwatch - one of these is actually the 1998 remake of the 1994 one Tarzan - one of these is an animated one, the other isn't. In case one of you is short of work, he may go through the Tarzan entries anyway some time, there are quite a few. |
Peter de K |
Monday August 04 18:27:00 2003 Re: Brian One Step Ahead Of The Boy Here |
Brian R wrote:
> "Randall and Hopkirk: (Deceased)" > I can just add a special rule for that. Forget asked about that previous then. |
Jay L |
kdnpr@yahoo.com |
Monday August 04 19:34:07 2003 Re: Brian's Post- Titles |
Jay L wrote:
> On the (?) thing, > that was something I doing to mark both suspect entries > and to work on eventually getting dates for others, etc. Ok, so those did need to be clarified. > The 'aka' thing was an ongoing thing anyway, > Last one sorted from "Sqwack-List": > Simon & Simon, aka Simon and Simon > Simon and Simon, aka Simon & Simon > Simon & Simon > Simon and Simon > The "Moderator" had a post about searching on > sub-strings or whatever? Yeah, that's how the search tool works. So in the above examples, searching for "simon and simon" when the title is "Simon & Simon" will fail. Just "simon" would hit all. > Idea being, to gather up them all, and sort them exact > same as IMDB, That's probably best, if not too much work. > Now that the others brought up like 7 Days aka Seven > Days, etc, figured great time to mark and sort instead of > waiting to come across or "be alerted" to > them. That's another one. > Perhaps I'm underestimating your abilities with these > tools? If I can find a pattern, I can have the tool handle it. If it requires human smarts to figure it out, then not. Titles and AKAs in variable order are very hard. The pattern is not there, it would require special cases (I think). You and I know that "Seven Days" and "7 Days" are the same, but how does the tool know? Parse every leading numeral? Also, the tool can't know which is the real way it belongs. |
Brian R |
Monday August 04 21:35:44 2003 Re: Brian's Post- Titles |
> That's probably best, if not too much work.
Shouldn't be too much work as been sorting these uniform as come across for last while. Going by your tool, actually closer than figured we'd be > 7 Days aka Seven Days, > That's another one. 7 Days series has all been sorted now. > Titles and AKAs in variable order are very hard. The > pattern is not there, I see, as mentioned earlier, your tool has brought them all out in open and not as bad as figured. We should have them all uniform in next few days. |
Jay L |
kdnpr@yahoo.com |
|