|
Data Mining Tutorial complete
with Data Mining Tools (PHP
Functions) to parse data and
match based on regular
expressions. Basic Data
Mining Steps: Fetch the HMTL
page(s) of Interest using
the Snoopy PHP Class, Split
the page HTML into a more
managable portion, Remove
un-wanted HTML tag
attributes, Reformat
HTML, adjust spacing and
remove entities, Match
content with regular
expressions and Store
content into a MySQL
database for future use.
Data mining services
available for online
resources such as Google,
DMOZ, Yahoo, Yellow Pages
and several others.
Date: May, 24 2012 |