Gene Cthe_2386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2386 
Symbol 
ID4811038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2851543 
End bp2853252 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content40% 
IMG OID640107799 
ProductVanW 
Protein accessionYP_001038781 
Protein GI125974871 
COG category[V] Defense mechanisms 
COG ID[COG2720] Uncharacterized vancomycin resistance protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGGTC CAGCAGTTGA AAATAAAAGT TTGAACCCAA AAACAAAAAT AATACTAATC 
TGCCTACTGG TATTAATATC GGTTTGCTTA CTTGTATCCA TATTTTTCGT TTATTCCACT
CTGTCCTATG ACAGGGTCTA TAAAGGAGTG TTTATAAACG ATACAGATGT AAGCCGGATG
ACATTTGATG ATGTTTGCAA TCTTCTTAAA TCAAAGTATT CCGAGAAGGC AAAAGATTTA
AAGGTGGTTT TAAACCATGA AGGCATGGCC GCAGAGGTTG ACCTGTCTGA AATAGATTTG
GAATACAAGA TTGAAGAAGC CGCACAAAAA GCTTTTGATG TGGGCAGAAA AGGAAATATC
TTTGAAAGGT TGTCCGAGAT TTTTAACACC AGCAGGGAAG GTGTTCGGCT AAGTCTGGAG
TATTCTTATA ACTTAGGCAA AGTAAATGAG ATAATCGATG ATTTCTATTC CAAAACATTG
ATTTCGGTCA AAGAAGCAAA CCTGTCCATT CAAGACAACA AGGTAACCCT TACCACCGGT
CATCCCGGCA AATCAATAGA CAAAGAAAAA GCTTTGGAAA TTATTGATAA TTCTATTAAA
ACATGCGAGG GCGGAACTTT TGACGTGCCT GTTATAACAA CCATGCCAAA AGCTATCGGC
GTCGATGACA TTTACAATCA AATAGTCGTA CAACCGGTGG ATGCAAAAGC AGTCGTTGAA
AACAACAAGG TAAGAGTTGT TCCCCACGAG TTGGGCCGCG AAATTTCAAA GTCCGAACTT
GCCGACATAA TAAAGGAAAA TGAAAATACG ACGGACAAGG AAATATTGCT TCCGGTAAAA
TTTATACAGC CGAAAGTTAC AACCGATGAA GTAAACGCAA AGCTTTTCAG AGATGTTCTT
GCCTCATCAA GTACTTCATT TTCCACATCC GGTCAAAACA ATTACAACAG AGGCATAAAC
ATAGGCGTTG CGGCATCAAA AATAAACGGA AAAATTCTTG CTCCCGGCGA AGTTTTCTCA
TTCAATGACG TAGTCGGTCC AAGAACGGTG GCAAACGGTT TTAAGATTGC AAAAGAATAT
GTCAACGGTA AAATAGTGGA CGGAGTCGGC GGAGGAGTGT GCCAGGTTTC GTCAACTCTG
TACAGTGCGG TGCTTTTTGC CGACCTTGAG ACAGTGGAAA GACAAAATCA TATGTTTACG
GTATCATACA TCCCTCTGGG AAGAGATGCC GCCGTAGCAT ACAACGAACT GGACTTTAAG
TTTAAAAACA ATACAAACTG GCCGATTAAA ATTGTGTCCA GCGTAAAGAA CAACACCGTA
TCTTTTACAA TTTACGGCAC AAAGGAAGAG CCGGGAAAAA CCGTAGAACT TAAACATGTT
CAGGTAAGTT CCCAGCCATC ACCCGTCAAA TATATAGACG ACCCAAATCT TGAAGAAGGC
AAGACAGTGG TTGTTCAATC CGGATATACG GGATATACAA TAGACACATA TAAAATAGTA
AAGATTAACG GTAATGTTGT AAGTAACAAC AAGATTCACA GAAGCATCTA CAGACCCTAT
GAAACCATAA TTAAGAGAGG TACCAAAAAA GTTGAGAAAG TGGTACAAAC CAACTCTCCG
GCTCCCGAAA CAGTATTATC GCCTACATCT GATCCAACAC CCGATCCTGC AGCTTCGAAT
GAAGTTGTTG ATGAAACACT CGTAGAATAA
 
Protein sequence
MTGPAVENKS LNPKTKIILI CLLVLISVCL LVSIFFVYST LSYDRVYKGV FINDTDVSRM 
TFDDVCNLLK SKYSEKAKDL KVVLNHEGMA AEVDLSEIDL EYKIEEAAQK AFDVGRKGNI
FERLSEIFNT SREGVRLSLE YSYNLGKVNE IIDDFYSKTL ISVKEANLSI QDNKVTLTTG
HPGKSIDKEK ALEIIDNSIK TCEGGTFDVP VITTMPKAIG VDDIYNQIVV QPVDAKAVVE
NNKVRVVPHE LGREISKSEL ADIIKENENT TDKEILLPVK FIQPKVTTDE VNAKLFRDVL
ASSSTSFSTS GQNNYNRGIN IGVAASKING KILAPGEVFS FNDVVGPRTV ANGFKIAKEY
VNGKIVDGVG GGVCQVSSTL YSAVLFADLE TVERQNHMFT VSYIPLGRDA AVAYNELDFK
FKNNTNWPIK IVSSVKNNTV SFTIYGTKEE PGKTVELKHV QVSSQPSPVK YIDDPNLEEG
KTVVVQSGYT GYTIDTYKIV KINGNVVSNN KIHRSIYRPY ETIIKRGTKK VEKVVQTNSP
APETVLSPTS DPTPDPAASN EVVDETLVE