If you add files to subversion they are associated with a mimetype. SvnQuery will only index text files, that means files without an svn:mime-type property or where the property is set to something like “text/*”. At work I wondered why I couldn’t find some words that I know must exist. It turned out that subversion marks files stored as UTF-8 files with BOM as binary, using svn:mime-type application/octet-stream. This forces the indexer to ignore the content of the file.
I used SvnQuery to find all files that are marked as binary, e.g. t:app* .js finds all javascript files and t:app* .cs finds all C# files. With the download button at the bottom of the results page I downloaded a text files with the results. Because svn propdel svn:mime-type [PATH] can work only on one file (it has no --targets option) I had to modify the text file to create a small batch script like this:
svn propdel svn:mime-type c:\workspaces\javascript\file1.js     
svn propdel svn:mime-type c:\workspaces\javascript\file1.js      
svn propdel svn:mime-type c:\workspaces\javascript\file1.js      
…
After this change indexing worked again. I now run a daily query that ensures that no sources files are marked as binary.
Hi great reaading your post
ReplyDelete