Hi,
I will be storing tens or hundreds of thousands of customer's webpages. I will be storing the images within them as blobs in storage, but I need to store the HTML in searchable form. The HTML text
should be no more than 30-40MB
I don't know whether the HTML text is best kept in a SQL table or in storage, linked by ID and metadata in the database.
Scalability is a big concern, but I will also want to be able to do searches based on text as well the metadata (e.g. search for "some phrase" in all files linked to a user's ID and her friends' IDs).
Advice on the correct approach would be much appreciated.
Best,
Jack.