Gene Mext_1856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1856 
Symbol 
ID5831624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2075146 
End bp2076756 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content66% 
IMG OID641367655 
Productexopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 
Protein accessionYP_001639326 
Protein GI163851283 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.508023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.726353 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCACA CTCGACGACC GGGGAGGCTC GATGTTTTCA CGTGGAACAT TCCGATGAGC 
GCCATCGACG TTCGGGATCT GCTGAAGGTC GCGAGGGAAA GCGGCGCACT CACATCCGTG
CCGACGCCGC TCGTGCTCCA CAGCGACGAG AATTCTGATT CCGCGCCGTC CGCGTCGAAC
CCCGCGCCGA CCCCACCGAA AGGGGCTTGG CTGTCGCCGG TCGTGCTCGC CGGTTGCGTG
CGGCTCGCCG AATTCTGCGG CTTGATCCTG CTCGGTTTGG CTCTGCATCA GGCCTTGCTG
CGCGGCGTCG TGCCCCTCGC CCCGCGCTAC CACGCCGCCA TCTTGGCGGT GACCCTTGCG
GCGCTCGGAC TGTTTCAGGC TTCGGGCAGT TACCGGATCA GCGCATTCCG CGATCTGCCG
AGAACGGCGG TGAAGCTCGC CACCGGCTGG TCCATCGCGT TCCTGATGGT GGCCGCGGCC
ATGGTGCTCG CCAAGGTCGC CGATCATTAC TCCCGGATCT GGCTGCTCAG CTACTACATG
GCCGGCCTTG GCATCCTGCT CGGCGGCCGC GCGGCGCTCT CGGCCTTCGT CCGCCTGCAG
ATGGCCAAGG GGCGCTTCGA CCGTCGCACC GCGATCGTCG GCGGCGGACC GGCGGCCGTG
GAACTGATCC ATGCCCTGGA AGCGAGCGGC GACAACGGCA TCCGCATCAT CGGGATCTTC
GATGACCGGG GCGACGACCG ATCCAGCACG GACGTCGCCG GCTACCCCAA GCTCGGCAAT
GTCAGCGACC TCGTCACCTA TGCCCGCCAC GCGCCCGTCG ATCTCGTGGT GTTCACCCTG
CCGATCTCGG CCGAGACGCG CATCCTGCAG ATGCTCGCCA AGCTCTCGGT TCTGCCGGTC
GATATCCGCC TCTCAGCCCA TGCGACCAAG CTGCGCCTGC GCCCGCGCGC CTATTCCTAT
CTCGGCGGCG TGCCGCTGCT CGACGTCTTC GACAAGCCGC TAGCCGATTG GGACGTCATC
CTGAAGGGCG CGTTCGACCG CGTCGTCGGC CTGCTGCTGC TGCTGGCCCT CTCACCGGCG
ATGATCGCTG TGGCGCTCGC GGTGAAGCTC ACTTCGCCGG GGCCGGTGCT GTTCCGGCAG
AAGCGCTACG GCTTCAACAA CGAGCTCATC GAGATCTTCA AGTTCCGCTC GATGTACGTC
GATCTCTGCG ACGCGGGCGC ATCGCAGCTC GTCACCAAGA CCGATGCCCG GGTGACGCCC
GTGGGCCGCT TCATCCGCAA GACATCGCTG GACGAGCTAC CTCAGCTATT CAACGTGATC
CGCGGCGATC TCTCGCTGGT CGGGCCGCGC CCGCATGCGG TCCAGGCCAA GGCGGCGAAC
ACCCTCTACG ATCAGGTGGT GGACGGGTAC TTCGCCCGCC ACAAGGTCAA ACCCGGCATC
ACCGGCTGGG CGCAGATCAA TGGCTGGCGC GGCGAGACCG ACACCAGCGA GAAGCTCCAG
CGCCGGGTGG AGCACGACCT GCACTACATC GAGAATTGGT CGATCCTGTT CGACCTCAAG
ATCCTGCTCA CCACGCCGCT CGCGCTCTTC AAGACCGACA ACGCGTATTG A
 
Protein sequence
MRHTRRPGRL DVFTWNIPMS AIDVRDLLKV ARESGALTSV PTPLVLHSDE NSDSAPSASN 
PAPTPPKGAW LSPVVLAGCV RLAEFCGLIL LGLALHQALL RGVVPLAPRY HAAILAVTLA
ALGLFQASGS YRISAFRDLP RTAVKLATGW SIAFLMVAAA MVLAKVADHY SRIWLLSYYM
AGLGILLGGR AALSAFVRLQ MAKGRFDRRT AIVGGGPAAV ELIHALEASG DNGIRIIGIF
DDRGDDRSST DVAGYPKLGN VSDLVTYARH APVDLVVFTL PISAETRILQ MLAKLSVLPV
DIRLSAHATK LRLRPRAYSY LGGVPLLDVF DKPLADWDVI LKGAFDRVVG LLLLLALSPA
MIAVALAVKL TSPGPVLFRQ KRYGFNNELI EIFKFRSMYV DLCDAGASQL VTKTDARVTP
VGRFIRKTSL DELPQLFNVI RGDLSLVGPR PHAVQAKAAN TLYDQVVDGY FARHKVKPGI
TGWAQINGWR GETDTSEKLQ RRVEHDLHYI ENWSILFDLK ILLTTPLALF KTDNAY