Gene Moth_1638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1638 
Symbol 
ID3831267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1674153 
End bp1675463 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content62% 
IMG OID637829563 
ProductVanW 
Protein accessionYP_430483 
Protein GI83590474 
COG category[V] Defense mechanisms 
COG ID[COG2720] Uncharacterized vancomycin resistance protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.2815 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGGT GGAAGCTGGG GGTTCTGTTA TTCTTCTTGT TGGGGTTGGC CCTCCTGGCC 
CTGGCAAGAT TTTATTTTTC CGGCCGGATT TTACCGGGGG TGGCCGTCGC CGGCCGCCCG
GTGGGCGGTA TGGACCTGGA AAGGGCCAGA AAAGTGATAA CAGAACTGGC GGCAGAGGTT
GAGAACCGCC AGGTAAGCCT GCGCCTGGGG GAACAGGTGC TGGCTTCCAC TCCCGGCGCC
CTGGGATTGG AGGTGGACGT AGAAGCTACC CTGGCCCGGG CTTACGCCCT GGGCCGCCAG
GGACCCCTGG TTAAACGTTT GGTCCTGCTG TCTGCCCGGA AGCGCCGGGT CGAGCCGGTT
ACCCACCTGG ACCAGCAACG CCTCCAGGCA GGGCTCGAAC GCCTGGGCGG AGCCTGGCGC
AGGGAACCGG CGGATGCCCG GATAGAGATC GTTGCCGGCG GCCAGCCACG CCTGGTACCG
GCCGTGACCG GCTGGCAGGT GGACGCCGCT ATTTTAAAGT CACGCCTGGA GGAGGCCACC
ACAGGGCAAA CAATCGACAT TCCCCTGAAC CGCCTCGAAC CCCGGCTCAC AACGGCGGAA
CTGGCGGCCC GCAGGATTAC CCGGCAGGTG GCCTCCTTTA CCACCCTTTT TGATCCAGCC
GAGGCCGACC GTACCCATAA TATCCGCCTG GCGGCCGGCA CCCTGGACGG GCTGTGGCTA
CCACCAGGAG GGGAGTTTTC CTTTAACCGG ACTGTCGGGC CACGGACGCC CGACCGGGGT
TACCGGGATG CCCTGGTCGT CGAAGAGGGC AACTTTGTCC CTGGCACCGG TGGCGGCGTC
TGCCAGGTAT CTTCGACCCT CTACAATGTG GCCCTGCTGG CGGGACTGAC CATTACGGAA
CGCCAGCCCC ACGGCCTGCC CATCACCTAT GTTCCCCCGG GCCGGGATGC CACCGTAGCC
TATGGCCTGA TAGACCTGAA GTTTCGTAAT GATACCCCTT ACTGGTATTT ATTAAAGACT
GGCGTTGAAG CCGGGAAGTT GACCATGGCC TTTTATGCCG CTGATGAGGC GCCCCGGGCA
GAGGTCACCA GCCAGGTACT AGAAACCATC CCGCCCCTGG AAGAAATCGA ATGGGAACCG
GATTGGCCCC CGGGGCGGGT GGAGTTAAAA AGGGAAGGAA AACCCGGTTT CCGCACGCAA
GTAATCCGGA TCATTTACCA GGACGACAAG AAAGGGCAAC CCGAAGTAGT TTCGAGGGAT
CTCTATCCAC CCCAACCCCG AATCATCCGC AAGGGAGGCC AGGGGGGGTA G
 
Protein sequence
MARWKLGVLL FFLLGLALLA LARFYFSGRI LPGVAVAGRP VGGMDLERAR KVITELAAEV 
ENRQVSLRLG EQVLASTPGA LGLEVDVEAT LARAYALGRQ GPLVKRLVLL SARKRRVEPV
THLDQQRLQA GLERLGGAWR REPADARIEI VAGGQPRLVP AVTGWQVDAA ILKSRLEEAT
TGQTIDIPLN RLEPRLTTAE LAARRITRQV ASFTTLFDPA EADRTHNIRL AAGTLDGLWL
PPGGEFSFNR TVGPRTPDRG YRDALVVEEG NFVPGTGGGV CQVSSTLYNV ALLAGLTITE
RQPHGLPITY VPPGRDATVA YGLIDLKFRN DTPYWYLLKT GVEAGKLTMA FYAADEAPRA
EVTSQVLETI PPLEEIEWEP DWPPGRVELK REGKPGFRTQ VIRIIYQDDK KGQPEVVSRD
LYPPQPRIIR KGGQGG