Homeland Security is Using Text Profiling to Discriminate “Fact” from “Opinion”

Cornell University News Service reports that:

A new research program by a Cornell computer scientist, in collaboration with colleagues at the University of Pittsburgh and University of Utah, aims to teach computers to scan through text and sort opinion from fact. The research is funded by the U.S. Department of Homeland Security, which has designated the consortium of three universities as one of four University Affiliate Centers (UAC) to conduct research on advanced methods for information analysis and to develop computational technologies that contribute to national security. Cornell will receive $850,000 of $2.4 million in funding provided for the consortium over three years…

The new research will use machine-learning algorithms to give computers examples of text expressing both fact and opinion and teach them to tell the difference. A simplified example might be to look for phrases like “according to” or “it is believed.” Ironically, Cardie said, one of the phrases most likely to indicate opinion is “It is a fact that ...”

Recall that words associated with emphatic factual assertion are used to discriminate between “racist” and “anti-racist” text by the EU-funded anti-majority artificial intelligence watchdogs.  In their research the appearance of such words would tend to indicate that the work was “racist” where “racist” was defined by similarity to the writings of selected “racist” authors.  So does this mean Homeland Security’s “fact vs opinion” discriminators will classify “racist” writings as “opinion” and “anti-racist” writings as “fact”?  Well, perhaps, but what is more important is why this “ironic” correlation exists. 

We can test this assertion about the phrase “It is a fact that ...” by running a simple google search on that phrase.  What we find is that there are two very different ways in which “It is a fact that…” can be used:  1) To assert the primary point of the writing.  2) To support the primary point of the writing.  Using “It is a fact that…” as part of an opinion piece means the author isn’t necessarily asserting his primary proposition but is rather asserting supporting arguments which he believes are verifiable facts.  The latter makes sense and, particularly when expressing opinions violating sensibilities of the likely audience, requires the emphatic use of verifiable facts.  On the other hand, when one is reinforcing the sensibilities of the target audience, the primary proposition of the opinion piece is frequently asserted as fact which the audience is likely to accept without objection precisely because it is the common sensibility of the audience.

Posted by James Bowery on Monday, September 25, 2006 at 10:47 AM in
Comments (3) | Tell a friend

Comments:

1

Posted by Alex Zeka on September 26, 2006, 04:22 AM | #

I find all this highly suspicious. If I state that homosexuals have a lower time preference because of their inability to have children and hence lack of a stake in the future, is that a fact or an opinion? Is the Big Bang theory, which is doubted by many scientists, a fact or an opinion? For that matter, is this distinction between fact and opinion itself a fact or an opinion?

These are just a bunch of semi-autistic computer researchers who, not being able to themselves understand human discussion, are trying to reduce it what they can understand: computer algorithms. I’m not buying.

2

Posted by James Bowery on September 26, 2006, 01:11 PM | #

Don’t be confused by their failure to come up with an objective distinction (operational definition independent of human judgement) between “fact” vs “opinion”.  All they have done is ask some humans to use their judgement to classify some writings as “fact” and others as “opinion” and then used pretty standard data mining techniques to train a computer program to mimic that judgement against a much larger sample of texts.

The best the computer can do under these circumstances is no better than the selected human consensus can do.  Indeed, as I pointed out in an earlier comment on “word sense disambiguation” and its application to creation of coherent lexicons, the use of humans as the standard is precisely where these approaches are failing to realize the potential of computer algorithms.  There is a battle brewing within the philosophy of science over precisely this sort of standard and it is going to erupt throughout all of academia, the humanities as well as sciences.

The trigger of this eruption is the termination of the long hiatus—now nearly 50 years—of rational research into artificial intelligence.  I won’t go into all of the dimensions of the abominable history of artificial intelligence research, but suffice to say that with the resurgence of algorithmic information theory, things are being reformulated rapidly.

The bottom line is this:

Information and knowledge are inseparable.  If you can formulate information theory consilient with computer technology you have a rational basis for artificial intelligence.  Algorithmic information theory is that consilience and it has been in hibernation for decades.

The principle result of algorithmic information theory is that the shortest program that can output a text string represents the true information content of that text string.  It is Ockham’s Razor on steroids.

This doesn’t mean that a computer program can be written that will find that shortest program—indeed it has been proven that such a metaprogram cannot exist in the general sense.  But what it does mean is that we have an objective test of the relative truthfulness of two discriptive frameworks.  The one which results in the shortest description of the world—the one that is most coherent—most consilient—that “hangs together’ the best—is also the most truthful.  We can still have human judgement play a part of course—but that part is put to the emperical test of now rigorously defined epistemology.

The failure of “political correctness” as a conceptual framework is, like the failure of the canons of prior theocracies, due to their need to inject confusing political construct at the wrong level of discourse.  The correct level of discourse for political correctness, as with much theocratic nonsense, is as an instance of ethnic nepotism hijacking the moral machinery of competing ethnicities.  If placed at its proper place in the universe of discourse, the world becomes more comprehensible precisely because its description is simpler.

3

Posted by TNB Alerts on September 26, 2006, 11:24 PM | #

Here let me help them:

It is a fact that TNB is a major problem throughout the world today.  It is a fact that negroes are 13% of the US population but are responsible for over 50% of the crime.  Just the facts, man, just the facts.

Post a Comment:

Name: (required)

Email: (required but not displayed)

URL: (optional)

Smileys

You must prefix http://anonym.to/? to gnxp.com links...
e.g., http://anonym.to/?http://www.gnxp.com/...

Copy your comment to the clipboard or paste it somewhere before submitting
it just in case the software loses it because the session time has been exceeded.

Remember my personal information

Notify me of follow-up comments?

Submit the word you see below: (not needed for preview)


Next entry: Krauthammer: Everyone Influential is Jewish

Previous entry: The Little Lexicon

image of the day

Existential Issues

White Genocide Project

Of note

Majority Radio

Recent Comments

Also see trash folder.

Leon Haller commented in entry 'Golden Dawn - Greece' on 05/25/12, 12:41 AM. (go) (view)

pletcheroka commented in entry 'A repeatable comment for mass-pasting on American public message boards' on 05/25/12, 12:36 AM. (go) (view)

Graistetrisog commented in entry 'Top Wog embraces his Inner Englishman' on 05/25/12, 12:19 AM. (go) (view)

BomeDeddell commented in entry 'ATTRITION THROUGH ENFORCEMENT: Government's Own Data Point to a Cost-Effective Strategy' on 05/24/12, 09:48 PM. (go) (view)

indernMix commented in entry 'ATTRITION THROUGH ENFORCEMENT: Government's Own Data Point to a Cost-Effective Strategy' on 05/24/12, 09:42 PM. (go) (view)

Wandrin commented in entry 'Golden Dawn - Greece' on 05/24/12, 04:44 PM. (go) (view)

Lee John Barnes commented in entry 'Golden Dawn - Greece' on 05/24/12, 03:20 PM. (go) (view)

grecian commented in entry 'Golden Dawn - Greece' on 05/24/12, 03:10 PM. (go) (view)

Wandrin commented in entry 'Golden Dawn - Greece' on 05/24/12, 02:04 PM. (go) (view)

Salvatore Quinto commented in entry 'More on the Indian beauty question' on 05/24/12, 12:47 PM. (go) (view)

Classic Sparkle commented in entry 'Golden Dawn - Greece' on 05/24/12, 12:07 PM. (go) (view)

Cobus commented in entry 'A genocide in South Africa' on 05/24/12, 10:14 AM. (go) (view)

daniel commented in entry 'Golden Dawn - Greece' on 05/24/12, 09:49 AM. (go) (view)

uh commented in entry 'Golden Dawn - Greece' on 05/24/12, 08:54 AM. (go) (view)

daniel commented in entry 'Golden Dawn - Greece' on 05/24/12, 07:41 AM. (go) (view)

uh commented in entry 'Golden Dawn - Greece' on 05/24/12, 07:18 AM. (go) (view)

daniel commented in entry 'Golden Dawn - Greece' on 05/24/12, 06:53 AM. (go) (view)

Swan commented in entry 'The facial proportions of beautiful people' on 05/24/12, 06:48 AM. (go) (view)

Swan commented in entry 'The facial proportions of beautiful people' on 05/24/12, 06:47 AM. (go) (view)

daniel commented in entry 'Golden Dawn - Greece' on 05/24/12, 06:32 AM. (go) (view)

Guest commented in entry 'The Torment of the Mulattoes' on 05/24/12, 06:17 AM. (go) (view)

daniel commented in entry 'Beyond the 14 words' on 05/24/12, 03:05 AM. (go) (view)

Lee John Barnes commented in entry 'Golden Dawn - Greece' on 05/24/12, 02:31 AM. (go) (view)

daniel commented in entry 'Golden Dawn - Greece' on 05/24/12, 02:03 AM. (go) (view)

Captainchaos commented in entry 'Beyond the 14 words' on 05/23/12, 11:08 PM. (go) (view)

Captainchaos commented in entry 'Golden Dawn - Greece' on 05/23/12, 09:13 PM. (go) (view)

Leon Haller commented in entry 'Golden Dawn - Greece' on 05/23/12, 07:47 PM. (go) (view)

Swan commented in entry 'Indian beauty' on 05/23/12, 12:52 PM. (go) (view)

Lee John Barnes commented in entry 'Golden Dawn - Greece' on 05/23/12, 12:45 PM. (go) (view)

Swan commented in entry 'More on the Indian beauty question' on 05/23/12, 12:31 PM. (go) (view)

Leon Haller commented in entry 'Beyond the 14 words' on 05/23/12, 11:43 AM. (go) (view)

Leon Haller commented in entry 'Golden Dawn - Greece' on 05/23/12, 11:32 AM. (go) (view)

Mellaba Pechios commented in entry 'Golden Dawn - Greece' on 05/23/12, 07:55 AM. (go) (view)

daniel commented in entry 'Beyond the 14 words' on 05/23/12, 03:51 AM. (go) (view)

Leon Haller commented in entry 'Golden Dawn - Greece' on 05/22/12, 10:40 PM. (go) (view)

General News

Science News

The Writers

Each author's name links to a list of all articles posted by the writer; the hashes link to authors' homepages.

Links

Endorsement not implied.

Controlled Opposition

Crime

General

Immigration

Islam

Jews

Nationalist Political Parties

Science

Whites in Africa