Molegro Virtual Docker v4.0.2

Molegro Virtual Docker is an integrated platform for predicting protein - ligand interactions. Molegro Virtual Docker handles all aspects of the docking process from preparation of the molecules to determination of the potential binding sites of the target protein, and prediction of the binding modes of the ligands. Molegro Virtual Docker offers high-quality docking based on a novel optimization technique combined with a user interface experience focusing on usability and productivity. The Molegro Virtual Docker (MVD) has been shown to yield higher docking accuracy than other state-of-the-art docking products (MVD: 87%, Glide: 82%, Surflex: 75%, FlexX: 58%).
Molegro Data Modeller offers different types of data modelling:
Multiple Linear Regression models simple linear relations between data, and is fast and efficient.
Partial Least Squares reduces the dimensionality of the data set before creating a model. Suitable for data sets with many independent variables.
Neural Networks are able to model highly non-linear relations.
Support Vector Machines are also able to model complex relations and tend to be less prone to overfitting than Neural Networks.
K-Nearest-Neighbors for simple classification.

Different regression types.

Feature Selection and Cross-Validation
Feature selection is easy to set up in the regression wizard: different schemes can be chosen (Forward, Backward, and Hill Climber selection) and be combined with different model selection criteria (Bayes Information Criterion or cross validated R^2). Different descriptor rankings can be employed when searching the descriptors.

Cross-validation is just as easy. Cross-validate using a specified number of random folds, by using Leave-One-Out, or by manually creating folds.

The different visualization types are highly interactive. Selections in the spreadsheet are directly shown in the plots and vice versa. It is also possible to apply different user-defined coloring schemes and apply jitter (add artificial noise to the data plots).

It is possible to visualize high-dimensional data. Using the built-in Spring-mass Map model, high-dimensional data can be projected onto 2D or 3D

Molegro Data Modeller supports chemical data: MDM understands SMILES and SDF files and can create 2D depictions of molecules directly in the spreadsheet or in the 2D plotter.

Molegro Data Modeller offers different kinds of clustering: K-means clustering and threshold-based clustering (both very efficient), and a density-based clustering scheme (which is able to capture more complex cluster shapes).

Principal Component Analysis (PCA).
Principal Component Analysis is a method for reducing the dimensionality of a dataset. A new set of principal components is created using linear combinations of the original descriptors. The number of descriptors is then reduced by only keeping the descriptors contributing most to the variance.

Algebraic Data Transformations.

It is possible to work with algebraic transformations directly on columns: for instance, "New Activity = log(Act) + Beta^2" will create a new column based on the expression.

Outlier Detection
Molegro Data Modeller provides two methods for locating abnormal data:
A quartile based method which checks how far away a data point is from the 25th and 75th percentile. This method examines each descriptor individually.
A density-based method which calculates a local density for each data point. Data points with a low density are far away from other data points and could be outliers.

Advanced Subset Creation
Molegro Data Modeller offers a grid-based method for creating a diverse subset of a dataset. It is possible to create grids in an arbitrary number of dimensions, and if working with 2D and 3D grids they can be visualized directly in the data plotters.

Molegro Data Modeller works with:
Windows XP and Vista.
Mac OS X (10.4 and later, PowerPC and Intel supported).
Most major Linux distributions.

Other Features
Scrambling (shuffling) of columns and "replace with random values" for performing y-Randomization.
Data preparation: scaling, normalization, repair of missing values.
Statistical measures: Pearson and Spearman correlation, Confusion matrices, F-measures, and many others.
Correlation Matrix.
Cross-term generation.
Custom Data Views and Grid Molecule Depictions.
Similarity Browser (Euclidean, Manhattan, Cosine, and Tanimoto measures).
Gnuplot export (for creating and customizing publishing quality plots).
Online help and automatic check for updates.

Home Page -

Download Links:
HotFile | RapidShare



Back To Top