Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A1430 |
Symbol | tolA |
ID | 5135301 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 1531590 |
End bp | 1532660 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640532888 |
Product | tolA protein |
Protein accession | YP_001217373 |
Protein GI | 147673834 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3064] Membrane protein involved in colicin uptake |
TIGRFAM ID | [TIGR02794] TolA protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000000000458161 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGAGA ATAAGTCCAG AAAAAGTAAT GATGCTAAAT CGATAACTAT TTCGCTGGCG ATGCACGGTG CTTTGGTCGC GATATTGCTC TGGGGAGCCG ATTTCACCAT GTCTGATCCT GAACCGACAG GACAGATGAT TGAGGCGGTT GTCATTGATC CTCAGCTCGT TCGTCAGCAA GCTCAGCAAA TTCGTAGTCA GCGTGAAGAG GCGGCGAAAA AAGAGCAAGA GCGACTGGAT AAACTCCGTC GTGAAAGCGA ACAGCTAGAG AAAAATCGAC AAGCAGAAGA AGAGCGGATC CGCCAGTTGA AAGAGCAACA GGCTAAAGAA GCCAAAGCGG CTCGTGAAGC GGAAAAGTTG CGTGAGCAGA AAGAACAAGA GCGCTTAGCT GCTGAGCAGA AAGCTCGCGA AGAAAAAGAG CGTGCTGCAA AAGCAGAGGC TGAACGCAAA GTGAAAGAAG AAGCCGCAAA AAAAGCTGAG CAAGAACGCG TGGCGAAAGA AGCCGCAGCA GCAAAGGCAG AACAGCAACG TATTGAACGT GAAAAAGAAG CCAAGCTTGC GGAAGAAAAA GCCAAACGTG AAAAAGAAGT GGCAGCGAAA GCTGAGCAAG AACGTTTAGC AAAGGAAAAA GCGGCCAAAG AGGCGGCAGA TAAAGCCAAG AAAGAAAAAG AACGCGCGGC AAAAGCAGAA GCTGAACGTA AAGCTCAAGA AGCGGCCTTG AATGATATTT TTGGCAGCTT GAGTGAAGAA AGTCAGCAAA ATAATGCGGC ACGTCAGCAG TTTGTTACTT CGGAAGTAGG GCGCTATGGA GCCATCTACA CCCAGCTCAT CCGGCAGAAT CTGTTAGTGG AAGACAGTTT TCGAGGAAAG CAGTGTAGGG TAAACCTAAA GCTTATTCCT ACTGGGACTG GCGCATTACT GGGTAGCTTA ACCGTTTTGG ATGGTGACAG TCGTTTATGT GCAGCAACCA AACGCGCTGT TGCCCAAGTG AATAGTTTTC CATTGCCGAA AGATCAACCT GACGTTGTGG AAAAACTAAA GAATATTAAT TTAACCGTAG CACCTGAATA A
|
Protein sequence | MKENKSRKSN DAKSITISLA MHGALVAILL WGADFTMSDP EPTGQMIEAV VIDPQLVRQQ AQQIRSQREE AAKKEQERLD KLRRESEQLE KNRQAEEERI RQLKEQQAKE AKAAREAEKL REQKEQERLA AEQKAREEKE RAAKAEAERK VKEEAAKKAE QERVAKEAAA AKAEQQRIER EKEAKLAEEK AKREKEVAAK AEQERLAKEK AAKEAADKAK KEKERAAKAE AERKAQEAAL NDIFGSLSEE SQQNNAARQQ FVTSEVGRYG AIYTQLIRQN LLVEDSFRGK QCRVNLKLIP TGTGALLGSL TVLDGDSRLC AATKRAVAQV NSFPLPKDQP DVVEKLKNIN LTVAPE
|
| |