This task compares up to 10 lists of elements, computing the union and intersection
between identifiers from the lists and plotting a Venn diagram if possible.
A list can consist e.g. of sequence names, gene names, GeneIds or accession numbers.
Additionally, there can be associated values for each input
(e.g. expression values for GeneIds),
which will be carried into the result lists.
The result page lists the number of elements and the first few elements from the following result sets:
union of all elements from the input lists
elements in the intersection of lists
(if more than two input lists were analysed, (n1) subsets will be given,
e.g. for 4 input lists
elements common to exactly four lists
elements common to exactly three lists
elements common to exactly two lists
duplicate elements within each of the lists
elements that are unique to a list (i.e. not in any other input list)
The first few elements of each result set are listed, and the full results can
be exported to Excel or in tabseparated format (*.tsv).
The exported lists also contain the associated values for each element from each input list, if available.
For plotting Venn diagrams the R package
VennDiagram
is used. It is described in
Hanbo Chen and Paul C Boutros (2011)
VennDiagram: a package for the generation of highlycustomizable Venn and Euler diagrams in R BMC Bioinformatics 2011, 12:35
Select the number of lists you want to compare. Up to 10 lists can be analysed.
If two, three or four lists are analysed, the output will include a Venn diagram.
For each list you will have to paste the values or supply a file (see below).
List <number>
Each list can be supplied in two different ways:
The elements can be entered with copy & paste into the text field; optional associated
values can be given in an extra textfield.
To be associated correctly to the main listvalues they must be the same number and order as the list elements.
Elements in these text fields can be separated by blanks, newlines or commas.
A file containing the list (and optional additional values) can be uploaded as a tabseparated file.
Note that only the values in the first column are used for comparison with the other lists,
the other columns are used as associated values. The number of columns to be used can be set (e.g.
if "3 columns" are selected, the first column contains the elements used for comparison, the next 2 columns
are carried along to the output).
Options
Case Sensitivity
Check this option, if uppercase and lowercase letters should be distinguished in
the comparison (default is caseinsensitive comparison, i.e. element "act" will be the same as "ACT").
Header Lines
Check this option, if the first (only one!) line of each uploaded file shall be removed before the analysis starts.
This is to remove potential header lines containing e.g. descriptions of columns.
Compute Probability
This option is only available, if exactly two lists are compared.
Usually your input lists are subsets of a large list of entities
(the "population", in statistical terms), e.g. genes or promoters.
Based on the lengths m and n of the input lists (counting only unique elements!),
the cardinality i of the intersection of the
two lists and the cardinality N of the population (a positive interger, which you must enter into
the textfield), two probability values are computed:
the probability that two lists of m resp. n elements picked randomly from the population
have exactlyi elements in common, and
the probability that two lists of m resp. n elements picked randomly from the population
have at leasti elements in common
In the output, the probabilities are printed both as percent values and in scientific notation.
If you do not enter a value into the textfield, the probabilities are not computed.
a short analysis parameter section, listing the input lists
a Venn diagram, if exactly two, three, or four lists were analysed
a list of result sets containing union, intersection, duplicate and unique elements
export option for each result set
The exported lists contain the associated values from each input list for each element.
If no associated values were suplied, the union and intersection lists show from which input
list they originated ("1" for present, "" for not present).
Example
Example Input:
List1
Associated values for List1
List2
Associated values for List2
Abcg8 Abhd2 Ace2 Actl7a
0,1234 0,3 0,14 0,34
Ace2 Actn3 Adam15 Adam1a Actl7a
0,4 0,4 0,34 0,14 0,35
Example Output:
Venn diagram
(click to enlarge)
Union / Intersection
Union
7 elements
Abcg8, Abhd2, Ace2, Actl7a, Actn3, Adam15, Adam1a
Intersection
2 elements
Ace2, Actl7a
Probability values
probability that 2 random subsets (having 5 resp. 4 elements) picked from a set of 20 elements, have an intersection
of exactly 2 elements is 21.6718% ( 0.2167182663e0 )
of at least 2 elements is 24.8710% ( 0.2487100103e0)
Single Lists
List1 (Input List1)
4 elements
Abcg8, Abhd2, Ace2, Actl7a
4 elements
Abcg8, Abhd2, Ace2, Actl7a
0 elements

2 elements
Abcg8, Abhd2
List2 (Input List2)
5 elements
Ace2, Actl7a, Actn3, Adam15, Adam1a
5 elements
Ace2, Actl7a, Actn3, Adam15, Adam1a
0 elements

3 elements
Actn3, Adam15, Adam1a
Export selected list as
When exporting the intersection of both lists to Excel the output looks like this:
Element
value(s) from List1
value(s) from List2
Actl7a
0,34
0,35
Ace2
0,14
0,4
If no associated values were supplied, the union of both lists would look like this: