Discussion Azure Tables & Search Indexing

  • 27 กรกฎาคม 2555 21:45
     
     

    Hello, 

    I'm trying to index words of a document on Azure Tables. My indexer splits words and send them to Azure Tables with word's score and documentID. The sturcture is like this:

    Table:  "Words"


    (PartitionKey: "the" RowKey: "Document1")
    -Score: 0.0035


    (PartitionKey: "time" RowKey: "Document1")
    -Score: 0.0023


    (PartitionKey: "machine" RowKey: "Document1")
    -Score: 0.0062


    (PartitionKey: "machine" RowKey: "Document2")
    -Score: 0.0105


    I'm querying the table to get all the rows with given partition key as it is the word that'ım looking for.

    For example I get 2 entities for word "machine".

    But: It's incredibly slow. I have 256 words and 1000 entity per words. It takes 2 seconds to get 100 entities with a one word query and about 3 seconds get 100 entities with a three word query.

    I must add, indexing is also very slow; 10,000,000 entities saved in about two days.

    Is there any other way to index words? Is there any way to increase performance in this scenario? 

    Note: Sorry, my English is poor :(( 

ตอบทั้งหมด