Surf3D AGENT: CIAFactbookAgent_1.xo WHAT IT DOES: Gathers fact book abstracts from www.odci.gov/cia/publications/factbook/index.html WHAT ITS GOOD FOR Researching country differences using the CIA factbook and gathering associated target text extracts and image documents On a per web page node basis: Metric A counts the word 'telephone' and extracts sentences Metric B counts incidence of the word 'military' Metric C displays the %GDP Agriculture for the country Getting quick glance summaries for data across all countries Getting extracts from factbook listings about the telephone system in country HOW IT WORKS: Drills down into each CIA country listing Performs metric counts and extracts sentences containing preferred extraction keywords HOW TO SET IT UP: Save-As the provided CIAFactbookAgent_1.xo to a new name, such as CIA-Education.xo Type preferred text extract keywords into the Metric channel lines A and B Look for those keywords in the the web page that have associated numbers you want (To access the Metric channel control popup doc click the '?' top right of the Agent window) & click on the text space for Metric channel A (for more popup Metric channel doc click B & C) Look for the visual occurence of the word on the page - is it the 1st or later mention? Look for the numbers past the word - is the 1st instance of number (including periods as separators) or later instance? e.g. on supplied agent the text in the channel is #11001"agriculture:" which is the GDP (Gross Domestic Production percentage number. Finds the 1st number past the 1st visual occurence of "agriculture:" To gather pictures open the Agent Action Configuration window and under Acquire and Capture select the file type (or type in) and the file size parameters you want to accept. Note the agent is set to gather text and include False pages. Set up text gathering for channel B: %TEXT Crime B: Military, Militia, Militant, Defense, Unrest, Disturbance This feature works so it retrieves sentences with these keywords from the page (limited to a paragraph of sentence extracts per page at this time). HOW TO USE IT: Point your browser to http://www.odci.gov/cia/publications/factbook/index.html Here you can see and browse what the CIA factbook has to offer Start the agent to see a demo of extraction and gathering. No need to GET URL - the agent already has it. VARIATIONS TO TRY: Browse the different view buttons #1 through #8 Try different searches with different keywords for text extraction Try different searches with different keywords for OR line evaluation Make agents for different fact book elements topics by changing keywords