Kinook Software Forum

Go Back   Kinook Software Forum > Ultra Recall > [UR] General Discussion

Reply
 
Thread Tools Rate Thread Display Modes
  #1  
Old 07-05-2008, 07:54 PM
tfjern tfjern is online now
Registered User
 
Join Date: 10-09-2007
Posts: 132
URD is Bloating

Not much activity on the forum these days, so here is a minor contribution. I recently created a UR database, and I decided to take someone's advice (which I read on the forum) that it is better to link and not store material (esp. pdfs and the like). But I noticed that the file already has 80 megs, and I've barely started. Kinook cautions that things start to slow down ("degrade" is the word he uses) after 200 megs, though the maximum for a UR file is much larger.

My question: how can my UR database be so large already when it consists mostly of links? At this rate of bloat I expect things such as searches will start to suffer major degradation.
Reply With Quote
  #2  
Old 07-06-2008, 03:21 AM
quant's Avatar
quant quant is online now
Registered User
 
Join Date: 11-30-2006
Posts: 967
Did you delete large amount of data? If YES, go to Tools->Compact and Repair and choose "Compact Database":

http://www.kinook.com/UltraRecall/Ma...pairdialog.htm

"Compact database: If this check box is checked when OK is clicked, the current Info Database will be shrunk, removing any free space.

Note: Ultra Recall uses a highly efficient, binary format for Info Database files. As Info Items are deleted, empty space remains within the file, and is reused when new data is added to the Info Database. If you delete a large amount of data, you can immediately shrink the Info Database, removing this free space by using the provided Compact functionality."
Reply With Quote
  #3  
Old 07-06-2008, 05:31 AM
tfjern tfjern is online now
Registered User
 
Join Date: 10-09-2007
Posts: 132
Been There, Done That!

Thanks, but I had already looked into that. I'll just have to be sure I don't store but link whenever possible. Be nice if UR had global searches across databases. Maybe someday.

I just noticed something. In Database Properties my URD reads:

1. File Size: 74,345,688
2. Stored Document Size: 387,992
3. Item Rich Text Text: 27,203,470
4. Icon Data: 235,373

Now 1. indicates the total of both stored and linked files. I was wondering if this item is the one to start being concerned about when the file size exceeds 200 megs?

Also, how can 3. (Rich Text Stored) exceed 2. (Stored Doc. Size)? Something wrong here.

Last edited by tfjern; 07-06-2008 at 06:04 AM.
Reply With Quote
  #4  
Old 07-06-2008, 06:02 AM
quant's Avatar
quant quant is online now
Registered User
 
Join Date: 11-30-2006
Posts: 967
hmmm, ok. So when you go to file->properties what is the size of Stored Documents?

PS:
I don't know if you were referring to my comments about storing/linking preferences. I think the search speed will not deteriorate that much when file size increases (as everything is indexed), i.e. that's not the reason I personally prefer linking to storing. My main concern is the quality of the search results. When I link the files, they are still available to other applications, for example, in my case Archivarius3000, which has much superior searching capabilities to UR (relevance, showing highlighted fragments of found terms, morphology, searching for keywords close to each other, ...)
Reply With Quote
  #5  
Old 07-06-2008, 06:05 AM
tfjern tfjern is online now
Registered User
 
Join Date: 10-09-2007
Posts: 132
A Few Edits

Quant, could you look at the edited post please?
Reply With Quote
  #6  
Old 07-06-2008, 06:20 AM
quant's Avatar
quant quant is online now
Registered User
 
Join Date: 11-30-2006
Posts: 967
Re: Been There, Done That!

Quote:
Originally posted by tfjern

Now 1. indicates the total of both stored and linked files.
no, 1. is the URD file size (linked files are not included)

Quote:
Originally posted by tfjern
Also, how can 3. (Rich Text) exceed 2. (Stored Doc. Size)?
3. is the text in your Text, Folders, ... etc. all those derived from Text core template. Remember, even if your item has few words, the rtf is huge!



I'm still wondering why is your file size so big ... what was your urd file size before compacting? When you did compact, did it show sth like "80% shrunk"?
Reply With Quote
  #7  
Old 07-06-2008, 06:53 AM
tfjern tfjern is online now
Registered User
 
Join Date: 10-09-2007
Posts: 132
Conundrum

First, you said that 1. File Size does NOT include linked files, but when I tested this by linking five pdf files, the only change in the four sizes occurred in the first, 1. File Size.

Second, how can Stored Document Size be smaller than Item Rich Text? This doesn't make sense.

Third, upon compacting no percentage change was indicated, since, as I said, almost all the data in the database is linked, not stored.
Reply With Quote
  #8  
Old 07-06-2008, 07:12 AM
quant's Avatar
quant quant is online now
Registered User
 
Join Date: 11-30-2006
Posts: 967
Re: Conundrum

Quote:
Originally posted by tfjern
First, you said that 1. File Size does NOT include linked files, but when I tested this by linking five pdf files, the only change in the four sizes occurred in the first, 1. File Size.
you probably index them that's why the urd increases, but it certainly doesn't include the file sizes themselves. How many keywords has your urd file (file->properties)? This is probably answer to your 80MB urd file size.

Quote:
Originally posted by tfjern
Second, how can Stored Document Size be smaller than Item Rich Text? This doesn't make sense.
because they are most probably zipped

Quote:
Originally posted by tfjern
Third, upon compacting no percentage change was indicated, since, as I said, almost all the data in the database is linked, not stored.
ok, I thought you were changing your urd file, when you first removed those that were stored, and then you linked them.
Reply With Quote
  #9  
Old 07-06-2008, 11:11 AM
Jon Polish Jon Polish is online now
Registered User
 
Join Date: 07-21-2006
Posts: 391
I think the bloat you are observing is due to keywords. I have observed my databases are at minimum twice the expected size.

I would try this experiment. Before attempting this, backup your database!

1. In the explorer pane, select all items.

2, Go to keywords and delete all of them.

3. Compact this database.

4. Compare the sizes of the newly created, keyword-less database with your backup copy.

I would be interested if you could post your results.

Jon
Reply With Quote
  #10  
Old 07-06-2008, 06:40 PM
tfjern tfjern is online now
Registered User
 
Join Date: 10-09-2007
Posts: 132
Frustrating, to Put It Mildly

OK, I selected everything in Data Explorer. Then went to Item / Keywords, but both panes -- user-defined keywords and auto-generated keywords -- were empty. Delete what?

On the other hand, if I Control + K on a particular item (linked, by the way), I get a list of auto-generated items (which seems to vary, item by item, though not always!). Am I supposed to go through the entire database and delete the item keywords? Please say no.

Also, in Properties it says I have 1.1 million keywords! Nice.

So how, pray tell, am I supposed to delete these keywords?

You would think Kinook would address this problem in a more serious manner, but I get the feeling that after creating a great piece of software, UR, they feel their job is done, and the users should be saavy enough to figure out things on their own. And they do, albeit usually serendipidously, or more often on the forum. I have a sinking feeling keywords is a flawed concept or at least a work in progress.

Last edited by tfjern; 07-06-2008 at 07:56 PM.
Reply With Quote
  #11  
Old 07-07-2008, 02:37 AM
quant's Avatar
quant quant is online now
Registered User
 
Join Date: 11-30-2006
Posts: 967
Re: Frustrating, to Put It Mildly

Quote:
Originally posted by tfjern
You would think Kinook would address this problem in a more serious manner, but I get the feeling that after creating a great piece of software, UR, they feel their job is done, and the users should be saavy enough to figure out things on their own. And they do, albeit usually serendipidously, or more often on the forum. I have a sinking feeling keywords is a flawed concept or at least a work in progress.
You set to keyword your documents, have 1 million of them, so no wonder your file is 80MB, and it will make lightning fast searches.

Unfortunately, Jon Polish was not right cause it's not the way to delete keywords. To delete them, first set which kind of file extensions should not be keyworded, then select the items, and resynchronize ...

Everything is in the help file!!! And it's right where you'd first look at, auto-generated keywords. So if by "saavy enough" you mean someone who knows how to use help file, then I share your frustration ;-)

"Auto-generated Keywords can't be manually added (see User-Defined Keywords) but can be deleted (they are automatically replaced when the Info Item is Synchronized)."
Reply With Quote
  #12  
Old 07-07-2008, 03:46 AM
tfjern tfjern is online now
Registered User
 
Join Date: 10-09-2007
Posts: 132
au contraire

The UR help file stinks -- and this is the consensus, excluding the outliers. Too often the explanations there are as clear as mud.

For a simple example, why is my Stored Document Size: 387,992 smaller than my Item Rich Text Text: 27,203,470?

You wrote, "Everything is in the help file!!!" Really? Where?
Reply With Quote
  #13  
Old 07-07-2008, 04:08 AM
quant's Avatar
quant quant is online now
Registered User
 
Join Date: 11-30-2006
Posts: 967
Re: au contraire

Quote:
Originally posted by tfjern
The UR help file stinks -- and this is the consensus, excluding the outliers. Too often the explanations there are as clear as mud.

For a simple example, why is my Stored Document Size: 387,992 smaller than my Item Rich Text Text: 27,203,470?

You wrote, "Everything is in the help file!!!" Really? Where?
Please calm down ... I don't take away from you your opinion on UR help file, mine is simply different. So I repeat mine, UR help file is very good and comprehensive. If Kinook is planning to make it even better, that's great!

So in help file, you can read:

Stored Document Size: The combined size of all stored documents.

Item Rich Text: The combined size of all rich text stored.

Something unclear?
Reply With Quote
  #14  
Old 07-07-2008, 07:22 AM
tfjern tfjern is online now
Registered User
 
Join Date: 10-09-2007
Posts: 132
Same planet, different world

I am calm. Annoyed, perhaps, but still calm.

You borrowed two definitions from the your crystal-clear "help" file ("Mine is simply different"), but you still haven't answered my simple question:

viz., why is my Stored Document Size (387,992) smaller than my Item Rich Text (also stored, but NOT zipped) (27,203,470)?

Logically shouldn't the size of the latter be smaller than the former, or am I missing something? Or perhaps we have entered the mysterious realm of quantum computing and Heisenberg's Uncertainty Principle has taking effect.

Kinook, are you there? Hello?
Reply With Quote
  #15  
Old 07-07-2008, 07:37 AM
Jon Polish Jon Polish is online now
Registered User
 
Join Date: 07-21-2006
Posts: 391
Re: Re: Frustrating, to Put It Mildly

Quote:
Originally posted by quant
Unfortunately, Jon Polish was not right cause it's not the way to delete keywords. To delete them, first set which kind of file extensions should not be keyworded, then select the items, and resynchronize ...
Hi Quant:

I don't quite understand why this is not correct. UR cannot keyword all files (WordPerfect for example), but those that it can (text based pdf, Word, etc.) do display for me. The keywords appear using the method I suggested whether the items are stored or linked.

Jon
Reply With Quote
Reply

Thread Tools
Display Modes Rate This Thread
Rate This Thread:

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



All times are GMT -5. The time now is 07:23 AM.


Copyright © 1999-2023 Kinook Software, Inc.