SBIR-STTR Award

Applying Latent Group Models to Web Publishing
Award last edited on: 2/2/2009

Sponsored Program
SBIR
Awarding Agency
NSF
Total Award Amount
$600,000
Award Phase
2
Solicitation Topic Code
-----

Principal Investigator
Mark Bucciarelli

Company Information

Cross Cut Media LLC (AKA: Crosscut Media LLC~Cross Cut Media LLC)

30 Kestrel Lane
Amherst, MA 01002
   (413) 367-6737
   N/A
   www.crosscutmedia.com
Location: Single
Congr. District: 02
County: Hampshire

Phase I

Contract Number: ----------
Start Date: ----    Completed: ----
Phase I year
2007
Phase I Amount
$100,000
This Small Business Innovation Research (SBIR) Phase I project will apply recent advances in knowledge discovery to bridge the gap between what is known about an Internet viewer and what is done with this knowledge to improve user experiences and business outcomes. Recent machine learning research has shown that latent group models perform extremely well compared to other relational probabilistic models (such as the more traditional relational Bayesian networks) in most problem categories. This research will investigate if latent group models can help a publisher make better publishing decisions. Online publishers operate in an environment of massive quantities of input data from disparate sources, non-homogeneous attribute data, and a business requirement for computation agility. Solving this problem will require advances in modeling, algorithmic, and implementation technologies. Today, online content publishers aggregate enormous volumes of data about their viewers from their web logs, registration systems, third-party web analytics providers, and ad serving systems. Mostly, these systems operate independently with a primary focus on describing what has happened. For example, a web site analytics package can best describe how many visitors came to this page yesterday, while an ad management system accurately counts how many ads were served on this section last month. Through analysis these tools can provide information used primarily for medium to long-range planning. None of these tools assist a publisher answer the question, "what does this viewer want from my site on this page at this point in time?" Answering this question is the key to unlocking a new path to growth for the online content publisher. If the publisher can anticipate the needs of its users, it can better hone its content and navigation to the specific needs of its diverse audience. This in turn leads to improved viewer satisfaction and more time spent, consuming more content from the content publisher

Phase II

Contract Number: ----------
Start Date: ----    Completed: ----
Phase II year
2008
Phase II Amount
$500,000
This Small Business Innovation Research (SBIR) Phase II project will extend the work begun in Phase I to apply advances in knowledge discovery to bridge the gap between what is known about an Internet viewer and what is done with this knowledge to improve user experience and business outcomes. The effort will develop new algorithms to combine implicit and explicit taxonomies to build content networks. A live feedback loop that uses multivariate test results will be used to adjust and refine clusters of users in order to establish specific parameters which can subsequently be acted on. Online content publishers aggregate enormous volumes of data about their viewers from web logs, registration systems, third-party web analytics providers and ad-serving systems. Mostly these systems operate independently with a primary focus on describing what has happened. Through a deeper analysis, which will be enabled by the current effort, content providers will be able to use this data in more predictive ways. This in turn will allow content providers a more intelligent tool for serving higher-value content throughout the online experience. If successful, this will have implication for new rich media services and e-commerce