Input data format
You can copy- paste this set into the program as:
The position of Y-values in both tables is:
The default way is (A), with dependent variable (Y) followingindependent variables (X1-X5). In the case (B) you should use REVERSED=1.The first line of data should indicate number of rows (data entries) thatare available in the data for the training data set.
Suppose, you want to use 2 last rows as a test set. This can be doneby :
The program will know that there are two data set. The first onewill be used for training (and in general, always the first)and the second one to test the algorithm performance. Up to 10 setscan be added in the same way and only the first set will be used to trainthe program.
If you do not know the target values of the test set, the first lineshould be changed to:
If data sets can contains names of data entries, this should be indicatedby NAMES=1. An example of the same dataset with names is:
You can also see that there is no requirement for alignment of datain columns. The data can be separated with any number of tabs and spaces.See FAQ if you have questions. How to cite this applet? Are you looking for a new job in chemoinformatics?
Copyright 2001 -- 2016 http://www.vcclab.org. All rights reserved.