Gene Mchl_5047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_5047 
Symbol 
ID7113650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp5397820 
End bp5398950 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content73% 
IMG OID643527741 
Productglycosyl transferase group 1 
Protein accessionYP_002423740 
Protein GI218532924 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.318515 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.241933 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACCG GTGCGATGCG CCCGATCGTC GTCGTGACCG GGGCGCTCGC GCCCTACACG 
CATGTTCTCT ACGAGCATCT GGCGGAACGG CTCGCCGGCC GAGACGGCCG GGTCCTGCAC
GTCCTGTCCT GCACGCCGCG CGAGAGCGCG CGGCAATGGG TGATGCCGCC GCCGCGCCTC
TACCGGCATG CCGTGCTGCC GGGCCTGCGC TGGCACCGCT CCTCGATCCG CAACCTTTAC
GTCAACCCGG CGGTGGTGCC GCGGCTCGCC GCCCTCCGTC CGGCGGCGGT GGTGCTCAAC
GACTTCTCGC CGACCATGCT GTTCGCGGCC GGCGCTGCGC GCCTGCAGCG AATCCCCACC
CTGATCCGCA CCGACGGGGT GCCCGAGACC GATCCCGGCG AGCGCTCGGC CCCGCATCGC
TGGCTGCGCC GTGCCATCGT TGCGGGGGCC ACAGCCGGGA TCGGACCGAG CGAGGGCAGC
GGCGCCGTGC TGGCCCGCTA CGGCTTGCCG GCTCCAAACT TCGTCCTGAG CCCGCTCTTT
CCGGCCTGGA CGCCGCCCGC CCCGCCCCCG CCCGACTCCG AACGGCCCTA CGACCTCCTG
TTCTGCGGCA TGCTGAACGA GGAGGTGAAG GGCGCACGCT TCTTCACCGA CGTGGTGCTC
GGCTGCTGTG CCCGCGGCCG GCGCCTGTCG GTCCGGGTCG CGGGGGACGG CCCGTTGCGG
GGGGAGATGG AGGCGCGGTT CGCGCAGGCG GGAATCTCCG TCCGCTTCGA TGGCTTCCTG
GGCCAAGAGG CGTTGCCCGC GGTCTACGCC TCGGCCCAGC TCTTCCTGTT TCCGAGCCGC
GGCGACGTGT GGGGAATCGT CGTGCAGGAG GCGCTCCAGA GCGGGACGCC GGTGCTCGCC
TCACCCCATT CCGGCGCGGC CCGTGGTCTC CTCGAAACCT ATGGCTGTGG CGAGGTGCGG
CCGATGGCGG TGGCGGATTG GGTCGATGCG ACCCTGCGGC TCCTCGAGGA TGAGGGCCGC
CGCCGCGACC TGCGCCGTGC GGCCGAGCGC GCGCTCCTGC ATTTCACGGT GGAGGCGGCC
GTCGCGGGAT ATCTCGATGC CCTCGAGCCC CTTCTGGCGG AGCGCACCTG A
 
Protein sequence
MATGAMRPIV VVTGALAPYT HVLYEHLAER LAGRDGRVLH VLSCTPRESA RQWVMPPPRL 
YRHAVLPGLR WHRSSIRNLY VNPAVVPRLA ALRPAAVVLN DFSPTMLFAA GAARLQRIPT
LIRTDGVPET DPGERSAPHR WLRRAIVAGA TAGIGPSEGS GAVLARYGLP APNFVLSPLF
PAWTPPAPPP PDSERPYDLL FCGMLNEEVK GARFFTDVVL GCCARGRRLS VRVAGDGPLR
GEMEARFAQA GISVRFDGFL GQEALPAVYA SAQLFLFPSR GDVWGIVVQE ALQSGTPVLA
SPHSGAARGL LETYGCGEVR PMAVADWVDA TLRLLEDEGR RRDLRRAAER ALLHFTVEAA
VAGYLDALEP LLAERT