Gene Mchl_2191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_2191 
Symbol 
ID7116137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp2288727 
End bp2290283 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content66% 
IMG OID643524941 
Productexopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 
Protein accessionYP_002420966 
Protein GI218530150 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.894429 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.147587 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCA TCGACGTTCG GGATCTGCTG AAGGTCGCGA GGGAGAACGG CGCACTCACA 
TCCGTGCCGA CGCCGCTCGT GCTCCACAGC GACGAGAATT CTGATTCCGC GCCGTCCGCG
TCGAACCCTG CGCCAACCCC ACCGAAAGGG GCTTGGCTGT CGCCGGTCGT GCTCGCCGGT
TGCGTGCGGC TGGCTGAATT CTGCGGCTTA ATCCTGCTTG GCTTGGCCCT GCATCAGGCA
TTGCTGCGCG GTGTCGTGCC CCTCGCCCCG CGCTACCACG CCGCCATTCT GGCGGTGACC
CTGGCGGCGC TCGGACTTTT CCAGGCTTCG GGCAGTTACC GGATCAGCGC ATTCCGCGAT
CTGCCGAGAA CGGCGGTGAA GCTCGCCACC GGCTGGTCCA TCGCGTTCCT GATGGTGGCC
GCGGCCATGG TGCTGGCCAA GGTCGCCGAC CATTACTCCC GGATCTGGCT GCTCAGCTAC
TACATGGCCG GCCTCGGCAT CCTGCTCGGC GGACGCGCGG CGCTTTCAGC CTTCGTGCGT
ATGCAAATGG CCAAGGGGCG CTTCGACCGC CGTACCGCGA TCGTCGGCGG CGGGCCCGCG
GCCGTGGAAC TGATCCATGC CCTGGAAGCG AGCGGCGATA ACGGCATCCG CATCATCGGG
ATCTTCGATG ACCGGGGCGA CGACCGATCC AGCACGGACG TCGCCGGCTA CCCCAAGCTC
GGCAATGTCA GCGACCTCGT CACCTATGCC CGCCACGCGC CCGTCGATCT CGTGGTGTTC
ACCCTGCCGA TCTCGGCCGA GACGCGCATC CTGCAGATGC TCGCCAAGCT CTCGGTTCTG
CCGGTCGATA TCCGCCTCTC AGCCCATGCG ACCAAGCTGC GCCTGCGCCC GCGCGCCTAT
TCCTATCTCG GCGGCGTGCC GCTGCTCGAC GTGTTCGACA AGCCGCTGGC CGATTGGGAC
GTTATCCTGA AGGGCGCGTT CGACCGCGTC GTCGGCCTGC TGCTGCTACT GGGTCTCTCA
CCGGCGATGA TCGCCGTGGC GCTCGCGGTG AAGCTCACCT CGCCGGGGCC TGTGCTGTTC
CGGCAGAAGC GCTACGGCTT CAACAACGAG CTGATCGAGA TTTTCAAGTT CCGCTCGATG
TATGTCGATC TCTGCGACGC GGGCGCATCG CAGCTCGTCA CCAAGACCGA TGCCCGGGTG
ACGCCCGTGG GCCGCTTCAT CCGCAAGACA TCGCTGGACG AGCTACCGCA GCTGTTCAAC
GTGATCCGCG GCGATCTCTC GCTGGTCGGG CCGCGCCCGC ATGCGGTCCA GGCCAAAGCG
GCGAACACCC TCTATGATCA GGTGGTGGAC GGGTACTTCG CCCGCCACAA GGTCAAACCC
GGCATCACCG GCTGGGCGCA GATCAATGGC TGGCGCGGCG AGACCGACAC CAGCGAGAAG
CTCCAGCGCC GGGTCGAGCA CGACCTGCAC TACATCGAGA ATTGGTCGAT CCTGTTCGAC
CTCAAGATCC TGCTCACCAC GCCGCTCGCG CTCTTTAAGA CCGACAACGC GTATTGA
 
Protein sequence
MSAIDVRDLL KVARENGALT SVPTPLVLHS DENSDSAPSA SNPAPTPPKG AWLSPVVLAG 
CVRLAEFCGL ILLGLALHQA LLRGVVPLAP RYHAAILAVT LAALGLFQAS GSYRISAFRD
LPRTAVKLAT GWSIAFLMVA AAMVLAKVAD HYSRIWLLSY YMAGLGILLG GRAALSAFVR
MQMAKGRFDR RTAIVGGGPA AVELIHALEA SGDNGIRIIG IFDDRGDDRS STDVAGYPKL
GNVSDLVTYA RHAPVDLVVF TLPISAETRI LQMLAKLSVL PVDIRLSAHA TKLRLRPRAY
SYLGGVPLLD VFDKPLADWD VILKGAFDRV VGLLLLLGLS PAMIAVALAV KLTSPGPVLF
RQKRYGFNNE LIEIFKFRSM YVDLCDAGAS QLVTKTDARV TPVGRFIRKT SLDELPQLFN
VIRGDLSLVG PRPHAVQAKA ANTLYDQVVD GYFARHKVKP GITGWAQING WRGETDTSEK
LQRRVEHDLH YIENWSILFD LKILLTTPLA LFKTDNAY