Xapian is an Open Source Search Engine Library, released under the GPL. It’s written in C++, with bindings to allow use from Perl, Python, PHP, Java, Tcl, C# and Ruby (so far!) info
Xenu's Link Sleuth (TM) checks Web sites for broken links. Link verification is done on “normal” links, images, frames, plug-ins, backgrounds, local image maps, style sheets, scripts and java applets.