Welcome to dbForumz.com!
FAQFAQ    SearchSearch      ProfileProfile    Private MessagesPrivate Messages   Log inLog in

unknown unique key count variance

 
   Database Forums (Home) -> Full Text RSS
Next:  Trying to exit a module  
Author Message
PhantomRick

External


Since: Feb 06, 2008
Posts: 2



(Msg. 1) Posted: Wed Feb 06, 2008 11:20 pm
Post subject: unknown unique key count variance
Archived from groups: microsoft>public>sqlserver>fulltext (more info?)

I am pretty new to FTS but found I am stuck with a problem that
doesn't seem common. In all my searching, I have only come across one
post that had the same issue and that was from 2002.

http://groups.google.com.au/group/microsoft.public.sqlserver.fulltext/...wse_thr

The only suggestion from that post (answered by Hilary Cotter) was the
problem could be related to using change tracking with background
indexing or very high cpu activity on my server. Neither are the
case.

To relate my problem:

I am trying to use FTS to perform content searching on only the pdf
files stored in my database.

While trying to discover why I was getting inaccuracies, I found that
if I rebuild the catalog then do a full population, the unique key
count is different each time.

I tested it 12 times in a row, & the unique key counts results:

Min 10,530
Max 42,731
Average 24,888
Median 21,455

I am running SQL Server 2000 and Adobe iFilter 6 on XP SP2. This is a
development machine, so there is no external interference to alter my
testing. I was also able to duplicate the issue on a Windows 2000
server running SQL Server 2000 and iFilter 6.

I am indexing only one table with a single catalog. I used the wizard
to create this & checked on Hilary's site (http://
www.indexserverfaq.com/SQLFTIWizard.htm) to ensure I was doing it
correctly.

The table holding the files contains a total of 1261 files of which
747 are pdfs. The rest are a variety
of .wpd, .doc, .zip, .tif, .exe, .jpeg, & .dwg files.

The table has the following layout.

3 gVfsId uniqueidentifier 16 0
0 sExt varchar 10 1
0 imgTarget image 16 1
0 sCrc varchar 100 1
0 iFilesize int 4 1
0 bEncrypted bit 1 1
0 bRebuild bit 1 1
0 dteCatIncrement timestamp 8 1


If someone could at least point me in a direction to rectify this
issue, I will be eternally grateful.


Cheers,
Rick.

 >> Stay informed about: unknown unique key count variance 
Back to top
Login to vote
Hilary Cotter

External


Since: Oct 09, 2005
Posts: 78



(Msg. 2) Posted: Thu Feb 07, 2008 8:48 am
Post subject: Re: unknown unique key count variance [Login to view extended thread Info.]
Archived from groups: per prev. post (more info?)

There was a problem in one of the earlier RTM and service packs of SQL
Server 2000 which had a behavior like this. It was solved in one of
the later SP's - SP 4 IIRC.

Are you doing a full population each time?


On Feb 7, 2:20 am, PhantomRick wrote:
> I am pretty new to FTS but found I am stuck with a problem that
> doesn't seem common.  In all my searching, I have only come across one
> post that had the same issue and that was from 2002.
>
> http://groups.google.com.au/group/microsoft.public.sqlserver.fulltext...
>
> The only suggestion from that post (answered by Hilary Cotter) was the
> problem could be related to using change tracking with background
> indexing or very high cpu activity on my server.  Neither are the
> case.
>
> To relate my problem:
>
> I am trying to use FTS to perform content searching on only the pdf
> files stored in my database.
>
> While trying to discover why I was getting inaccuracies, I found that
> if I rebuild the catalog then do a full population, the unique key
> count is different each time.
>
> I tested it 12 times in a row, & the unique key counts results:
>
> Min     10,530
> Max     42,731
> Average 24,888
> Median  21,455
>
> I am running SQL Server 2000 and Adobe iFilter 6 on XP SP2.  This is a
> development machine, so there is no external interference to alter my
> testing.  I was also able to duplicate the issue on a Windows 2000
> server running SQL Server 2000 and iFilter 6.
>
> I am indexing only one table with a single catalog.  I used the wizard
> to create this & checked on Hilary's site (http://www.indexserverfaq.com/SQLFTIWizard.htm) to ensure I was doing it
> correctly.
>
> The table holding the files contains a total of 1261 files of which
> 747 are pdfs.  The rest are a variety
> of .wpd, .doc, .zip, .tif, .exe, .jpeg, & .dwg files.
>
> The table has the following layout.
>
> 3       gVfsId  uniqueidentifier        16      0
> 0       sExt    varchar 10      1
> 0       imgTarget       image   16      1
> 0       sCrc    varchar 100     1
> 0       iFilesize       int     4       1
> 0       bEncrypted      bit     1       1
> 0       bRebuild        bit     1       1
> 0       dteCatIncrement timestamp       8       1
>
> If someone could at least point me in a direction to rectify this
> issue, I will be eternally grateful.
>
> Cheers,
> Rick.

 >> Stay informed about: unknown unique key count variance 
Back to top
Login to vote
PhantomRick

External


Since: Feb 06, 2008
Posts: 2



(Msg. 3) Posted: Thu Feb 07, 2008 5:53 pm
Post subject: Re: unknown unique key count variance [Login to view extended thread Info.]
Archived from groups: per prev. post (more info?)

Although I had applied SP4 originally, I re-applied it this morning.

I have then re-tested the catalog, each time rebuilding the catalog
then running a full population through the enterprise manager. The
results were:

Item count / unique word count
1260 46997
1260 36005
1260 38896
1261 32451
1260 31806
1261 33381

Oh, it's now 1262 files - I added another this morning from other
testing, but before running this test.

I aslo noticed while checking through the pdf files, I seem to get a
lot of false positives using CONTAINS. I was presuming this is a
related matter.

Do you have any suggestions as to what else I could try?


On Feb 8, 3:48 am, Hilary Cotter wrote:
> There was a problem in one of the earlier RTM and service packs of SQL
> Server 2000 which had a behavior like this. It was solved in one of
> the later SP's - SP 4 IIRC.
>
> Are you doing a full population each time?
>
> On Feb 7, 2:20 am, PhantomRick wrote:
>
>
>
> > I am pretty new to FTS but found I am stuck with a problem that
> > doesn't seem common.  In all my searching, I have only come across one
> > post that had the same issue and that was from 2002.
>
> >http://groups.google.com.au/group/microsoft.public.sqlserver.fulltext...
>
> > The only suggestion from that post (answered by Hilary Cotter) was the
> > problem could be related to using change tracking with background
> > indexing or very high cpu activity on my server.  Neither are the
> > case.
>
> > To relate my problem:
>
> > I am trying to use FTS to perform content searching on only the pdf
> > files stored in my database.
>
> > While trying to discover why I was getting inaccuracies, I found that
> > if I rebuild the catalog then do a full population, the unique key
> > count is different each time.
>
> > I tested it 12 times in a row, & the unique key counts results:
>
> > Min     10,530
> > Max     42,731
> > Average 24,888
> > Median  21,455
>
> > I am running SQL Server 2000 and Adobe iFilter 6 on XP SP2.  This is a
> > development machine, so there is no external interference to alter my
> > testing.  I was also able to duplicate the issue on a Windows 2000
> > server running SQL Server 2000 and iFilter 6.
>
> > I am indexing only one table with a single catalog.  I used the wizard
> > to create this & checked on Hilary's site (http://www.indexserverfaq.com/SQLFTIWizard.htm) to ensure I was doing it
> > correctly.
>
> > The table holding the files contains a total of 1261 files of which
> > 747 are pdfs.  The rest are a variety
> > of .wpd, .doc, .zip, .tif, .exe, .jpeg, & .dwg files.
>
> > The table has the following layout.
>
> > 3       gVfsId  uniqueidentifier        16      0
> > 0       sExt    varchar 10      1
> > 0       imgTarget       image   16      1
> > 0       sCrc    varchar 100     1
> > 0       iFilesize       int     4       1
> > 0       bEncrypted      bit     1       1
> > 0       bRebuild        bit     1       1
> > 0       dteCatIncrement timestamp       8       1
>
> > If someone could at least point me in a direction to rectify this
> > issue, I will be eternally grateful.
>
> > Cheers,
> > Rick.- Hide quoted text -
>
> - Show quoted text -
 >> Stay informed about: unknown unique key count variance 
Back to top
Login to vote
Hilary Cotter

External


Since: Oct 09, 2005
Posts: 78



(Msg. 4) Posted: Fri Feb 08, 2008 6:17 am
Post subject: Re: unknown unique key count variance [Login to view extended thread Info.]
Archived from groups: per prev. post (more info?)

This seems abnormal. The way it works is that there are temporay
memory resident indexes which are merged into shadow indexes and then
into a single master catalog.

You might be getting this discrepancy from there.

Also are you getting these same false positives when you are using a
contains/containstable query.

On Feb 7, 8:53 pm, PhantomRick wrote:
> Although I had applied SP4 originally, I re-applied it this morning.
>
> I have then re-tested the catalog, each time rebuilding the catalog
> then running a full population through the enterprise manager.  The
> results were:
>
> Item count / unique word count
> 1260    46997
> 1260    36005
> 1260    38896
> 1261    32451
> 1260    31806
> 1261    33381
>
> Oh, it's now 1262 files - I added another this morning from other
> testing, but before running this test.
>
> I aslo noticed while checking through the pdf files, I seem to get a
> lot of false positives using CONTAINS.  I was presuming this is a
> related matter.
>
> Do you have any suggestions as to what else I could try?
>
> On Feb 8, 3:48 am, Hilary Cotter wrote:
>
>
>
> > There was a problem in one of the earlier RTM and service packs of SQL
> > Server 2000 which had a behavior like this. It was solved in one of
> > the later SP's - SP 4 IIRC.
>
> > Are you doing a full population each time?
>
> > On Feb 7, 2:20 am, PhantomRick wrote:
>
> > > I am pretty new to FTS but found I am stuck with a problem that
> > > doesn't seem common.  In all my searching, I have only come across one
> > > post that had the same issue and that was from 2002.
>
> > >http://groups.google.com.au/group/microsoft.public.sqlserver.fulltext....
>
> > > The only suggestion from that post (answered by Hilary Cotter) was the
> > > problem could be related to using change tracking with background
> > > indexing or very high cpu activity on my server.  Neither are the
> > > case.
>
> > > To relate my problem:
>
> > > I am trying to use FTS to perform content searching on only the pdf
> > > files stored in my database.
>
> > > While trying to discover why I was getting inaccuracies, I found that
> > > if I rebuild the catalog then do a full population, the unique key
> > > count is different each time.
>
> > > I tested it 12 times in a row, & the unique key counts results:
>
> > > Min     10,530
> > > Max     42,731
> > > Average 24,888
> > > Median  21,455
>
> > > I am running SQL Server 2000 and Adobe iFilter 6 on XP SP2.  This is a
> > > development machine, so there is no external interference to alter my
> > > testing.  I was also able to duplicate the issue on a Windows 2000
> > > server running SQL Server 2000 and iFilter 6.
>
> > > I am indexing only one table with a single catalog.  I used the wizard
> > > to create this & checked on Hilary's site (http://www.indexserverfaq.com/SQLFTIWizard.htm) to ensure I was doing it
> > > correctly.
>
> > > The table holding the files contains a total of 1261 files of which
> > > 747 are pdfs.  The rest are a variety
> > > of .wpd, .doc, .zip, .tif, .exe, .jpeg, & .dwg files.
>
> > > The table has the following layout.
>
> > > 3       gVfsId  uniqueidentifier        16      0
> > > 0       sExt    varchar 10      1
> > > 0       imgTarget       image   16      1
> > > 0       sCrc    varchar 100     1
> > > 0       iFilesize       int     4       1
> > > 0       bEncrypted      bit     1       1
> > > 0       bRebuild        bit     1       1
> > > 0       dteCatIncrement timestamp       8       1
>
> > > If someone could at least point me in a direction to rectify this
> > > issue, I will be eternally grateful.
>
> > > Cheers,
> > > Rick.- Hide quoted text -
>
> > - Show quoted text -- Hide quoted text -
>
> - Show quoted text -
 >> Stay informed about: unknown unique key count variance 
Back to top
Login to vote
Display posts from previous:   
Related Topics:
Newbie question: unique index across nvarchar fields - Surely an easy question for those in the know... I have a contacts table, and I'd like to ensure uniqueness across first/middle/last name fields. Each field is an nvarchar(255) - as such, I can't create a regular index because the combined index size i...

Is There a Quick Way to Get a Total Match Count (estimate .. - Hi, I am using the June CTP release of SQL Server 2005 on Windows Sever 2003. Is there a quick way to get a total natch count (estimate is OK) of a SQL FullText query? I am working with about 10 million r ows of simple character data. The following SQ...

Item Count Limitations Full Text Catalogs - Hello Does anyone know if there is a limit to how many items(item count) a single catalog can hold? I have a table with over 30 million records and it only gets up to a little over 14 million in the item count and never finishes the population. It..

help to define a search criteria with FTS - Hi! I'm using FTS in MSSQL2000. 1. i have a string "bcd" and i want the results : "abcd" or "1bcd" but not "bcda" or "aabcd" (always from the second letter). 2. the search column is a long string. the ...

CONTAINSTABLE - weird results - using "and not" - Hello everyone, I use full text search using containstable for search on my intranet site. Its been working wonderfully. However, I have recently been working on an upgrade to my search page to allow users to exclude words. When excluding words I use...
   Database Forums (Home) -> Full Text All times are: Pacific Time (US & Canada)
Page 1 of 1

 
You can post new topics in this forum
You can reply to topics in this forum
You can edit your posts in this forum
You can delete your posts in this forum
You can vote in polls in this forum



[ Contact us | Terms of Service/Privacy Policy ]