Previous versions

WebClass 0.1

WebClass 1.0


WebClass 0.1

An Intermediary for the Classification of HTML Pages  developed by

LACAM Dipartimento di Informatica 
Università degli Studi di Bari 
via Orabona, 4 
70126 Bari

Java Technology Center 
IBM SEMEA Sud 
via Tridente, 42/14 
70125 Bari

 

Description

 

WebClass is a prototypical workbench written in Java for experimenting the application of Statistical and Case-Based Reasoning methods to automatic Web page classification.

 

System Requirements

Platform : Any Java 1.1 (or higher) enabled platform 
Java development tool: JDK 1.1 
Processor: Intel 486 (or higher)

 

Installation Procedure & Testing

The distribution package webclass.zip contains the following files:

 

README.html      : this file 
webclass.jar          : WebClass jar file 
html.jar                  : jar file corresponding to a sub-directory containing sample Web pages. 
experiment1.jar     : jar file corresponding to a sub-directory containing Webclass configuration data files 
experiment2.jar     : jar file corresponding to a sub-directory containing Webclass configuration data files

 

To install, make the following steps:

  1.        Unzip the file webclass.zip into a directory you have chosen
  2.       Go to this directory
  3.       Extract webclass.jar with command jar -xfM webclass.jar
  4.       Extract html.jar with command jar -xfM  html.jar
  5.       Extract experiment1.jar with command jar -xfM  experiment1.jar
  6.       Extract experiment2.jar with command jar -xfM  experiment2.jar

The system User Manual and other documentation is not yet available. Anyway you can test the system by doing the experiments described hereunder. The system is not "stable" (it is written by our university students and its main aim is to "demonstrate" hopefully good ideas). Therefore, please be patient for slowly running or for eventually system errors. 
 

The First Experiment 
In the sub-directory experiment1 there are all WebClass configuration files for running a simple experiment with the Web pages stored in the sub-directory html.  In particular, by copying all the content of the sub-directory experiment1in the main directory and by typing the command java WebClass, the system will be ready for running the experiment.

 

Note: 
In this experiment the system will start considering  4 classes, i.e. Astronomy, Car, Moto, Jazz, as its Knowledge Base. In this configuration there are 5 training pages and 5 testing pages for each class . You can see all the training and testing pages by selecting the menu-item "Categories" from the pop-up menu "Browse" in the WebClass main window. Then, the Categories Management Window will appear showing you all categories currently active in the workbench (you can also add other ones for doing other experiments) and allowing you to browse the training and testing pages for each category (also in this case you can add other pages for doing other experiments).

 

You can explore the system features by using the experiment 1 knowledge base: for example, you can classify a Web page by selecting the menu-item "Page" from the pop-up menu "Classify". A micro-browser will appear allowing you to load a web page from the directoryhtml (buttons: openfile and reload) and classify it by pushing one of the classification buttons: Classify by Centroids, Classyfy by NN, Classify by k-NN. 
You can download the web page to be classified also from Internet/Intranet by writing the http address. Don't forget to configure in the proper way the proxy server or the socks server by selecting the menu-item "Network Configuration" from the pop-up menu "Preferences".

 

The Second Experiment 
In the sub-directory experiment2 there are all WebClass configuration files for running a simple experiment with the Web pages stored in the sub-directory html.  In particular, by copying all the content of the sub-directory experiment2 in the main directory and bytyping the command java WebClass, the system will be ready for running the experiment. 
In this second experiment the training set consists of 128 pages and the test set consists of 64 pages.

 

Future work

 We started to embed the best results produced by WebClass into WBI plug-ins with the aim of building "intelligent proxy servers". 

Publications

 

FAQs

None yet available. Send all requests/comments to: Pietro Leo,  IBM Java Technology Center, Bari (Italy). 
 

 

Last modified 10/01/2000



 

WebClass 1.0

An Intermediary for the Classification of HTML Pages developed by

LACAM Dipartimento di Informatica 
Università degli Studi di Bari 
via Orabona, 4 
70126 Bari

Description

WebClass is a prototypical workbench written in Java for experimenting the application of Statistical and Case-Based Reasoning methods to automatic Web page classification. 
 

System Requirements

Platform : Java2 (or higher) enabled platform. Since WebClassIII interfaces a MsAccess database, we recommend to install it on a Windows machine in order to run the system on one machine.
Java development tool: JSDK 1.2.* 
WebClass interfaces a MsAccess database by means of JDBC.

Installation Procedure & Testing

The distribution package WebClass1.0 contains the following files:

the leggimi.txt file (readme.txt).
the .bat file (to run the application)
the .class files (WebClass1.0)
the .java files (WebClass1.0 source code)

Installation procedure:

o      Unzip the file WebClass10 con codice.zip into the application folder.

o      Modify the CLASSPATH environment variable adding the “..../uniba” folder

o      Create an ODBC connection to the database.(WebClassDabaBase.mdb) 
The name of the ODBC connection must be: “Web”.
(Control Panel/administrative tools/ODBC)

o      Run WebClass1.0 double-clicking on the esegui.bat file.
        login: Luigi
        password: Admin

FAQs

None yet available. Send all requests/comments to: Donato Malerba
 

Last modified 01/10/2004

 


KDDE  Template

KDDE presentations have to be based on this template.

Group members and students who are taking a degree, are invited to use it.

Discovery Science 2016

The 19th International Conference on Discovery Science (DS 2016) will be held in Bari on October 2016, 19th-21st. KDDE Group is organizing it.

ALT 2016

Algorithmic Learning Theory 2016

Bari, Italy, 19-21 October, 2016.

Powered by CMSimple| Template: ge-webdesign.de| Login