Gene Cthe_1062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1062 
Symbol 
ID4811360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1268083 
End bp1269528 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content38% 
IMG OID640106484 
ProductVanW 
Protein accessionYP_001037487 
Protein GI125973577 
COG category[V] Defense mechanisms 
COG ID[COG2720] Uncharacterized vancomycin resistance protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGACATAG AAGGTATGGA TATGTTGCTT GATTTTTTTT CGAAAAACAA AACTAAAATT 
TTAATATTTG CGGCAAGCCT TTTGTTGCTT GTAATAGCTT CAGTTTCGAC TTATGTGGTA
GTTGTTCTGA ACAGAAAAAC TTTTTACGAC GGTATTGCTG TTGAAGGGAT TGATGTATCC
GGATTGACTG TTGATGAGGC AAGAGAAAAA GTGGAAAAAA AGCTTGACAG GATTGTATAT
GAAAATAGTC TTTTGCTAAA CTATGAAGGC ATGACATGGA GGATCGGGCT TTCAGATATA
TCTTACGATT TCCTTGTGGA TGATGCAATA AAAAGAGCCT ATTCAATAGG AAGAGAAGGC
AGTGTTTTAA AAAGGCTTAA AACCATAAGG AATTTAAAGT CTGATAAAAA GAACGTTTTA
GCGAAAGTTG TCTTTTCAAG GCCTCTTCTT GAGGAATATA TTACCAGCAT CAAAAAGCAA
GTTGACGAAA ATCCGAAAGA TGCAACGGTA ACTTACCAAA ATGGCAATAT TATGTTTGAA
AAAGAGATTA TCGGGCGATT TGTGGATGTT GACAAAAATC TTGGTTTGTT AGAAAATAAA
TTAATAAAGA GGGATTTCTC GCCTTTCGAA CTTGAAGTCA CAAATGTTTA TCCGAAAATT
ATGTACAAGG ATATTTCTCA TATTGAAGAG GTAATTTCCT CTTTTTCCAC TGTTTTTAAT
TCAGCCAATG TCAACAGAAG CCATAATATC AAACTTGCGT GTGAAAGAAT CAATAACACA
GTGTTGCTTC CCGGTGAGAC ATTTTCCATG GATGCTTCTT TGGGGTCAAG AACAAAAGAA
AATGGTTACA AAGACGCACC TGTCATAGTT AAAGGCCGCT TGATTGAAGG AGTTGGAGGT
GGTGTTTGCC AGGTTACGTC AACATTGTAT GTTGCAGTAC TCAAGGCAAA GCTTGAGGTG
GTTGAAAGGG TAAAACATTC AATGCCTTTG GGATATGTGG AGCCAGGTCA GGACGCAACA
ATATCGGAAG GCTACATCGA CTTCAAATTC AGAAACAATA CGGATCGTGC CTGCCTTATA
AGTGCATCGG TTGTCGGAAA CAGGATAGAT ATAAAGCTGC TGGGAGCAAA AAGAAATTCA
AATTATGATG TAAGGCTGAA ATCTGTTGTT GTGGAACGTA TTTCTCCTCC GGAAGATGAG
ATAATTGTCG ACAAGTCGCT GCCCAAAGGA GCAGTGAAAA TTGAAAGGGA GCCGGTGCAA
GGTTTGAAGG TGATAGTATA CAGGGAAACT TATGAAAATA ATAGACTTAT TGAAAGGGAA
AAAATATCCG AAGATATTTA TAAACCTGTA CAAGGTTTAA AAAGAGTGGG GCCTTATGAC
AGCAATGATA CGGAAGGCGG AGAAGAAACT GTAAAGGAAG AAGCTATGGA AGAACAAACT
ATATAA
 
Protein sequence
MDIEGMDMLL DFFSKNKTKI LIFAASLLLL VIASVSTYVV VVLNRKTFYD GIAVEGIDVS 
GLTVDEAREK VEKKLDRIVY ENSLLLNYEG MTWRIGLSDI SYDFLVDDAI KRAYSIGREG
SVLKRLKTIR NLKSDKKNVL AKVVFSRPLL EEYITSIKKQ VDENPKDATV TYQNGNIMFE
KEIIGRFVDV DKNLGLLENK LIKRDFSPFE LEVTNVYPKI MYKDISHIEE VISSFSTVFN
SANVNRSHNI KLACERINNT VLLPGETFSM DASLGSRTKE NGYKDAPVIV KGRLIEGVGG
GVCQVTSTLY VAVLKAKLEV VERVKHSMPL GYVEPGQDAT ISEGYIDFKF RNNTDRACLI
SASVVGNRID IKLLGAKRNS NYDVRLKSVV VERISPPEDE IIVDKSLPKG AVKIEREPVQ
GLKVIVYRET YENNRLIERE KISEDIYKPV QGLKRVGPYD SNDTEGGEET VKEEAMEEQT
I