Gene Mext_1044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1044 
Symbol 
ID5833665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1137797 
End bp1139302 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content69% 
IMG OID641366839 
Productexopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 
Protein accessionYP_001638520 
Protein GI163850477 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.996068 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0474242 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCACG AGCGCCCCCC TCAGCAGTAC CAGCGGCTGC CCGGCGACGG GGTAAGCCGC 
TCTGTCTGGG CCTCGCTTCT GCCCAGGCGG CGGCGCATGG CCCTGCGCGT GGGCATCTCG
GCCTCCCTGC TGGCGGCTGA TCTCGTGGCG ATCTTCGCGG TCGGCTTCGC GGCGGATGTG
GCCTACCACG CCTATATCGG CGACGGGGAG TTGATCCCGC TCACGAACAG CATGAACCTT
CAGACGGCCG GCTTCCTGTC TCTGATCTTC GTGCTTACCA ACCTCGCCCG CGGCGAATAC
AGCATCGAGC GCTGCCTGTC GCAGACGCCG CATCTGCAGC GCCGGGCGAC GCTCTGGCTC
ATGGCCTGGG CGGTCGCCCT GCTGGTCGGC TTCGCGACGA AGACGACTCA GGACTTCTCC
CGCGTCGCCT CGGTCGCCTT CTTCCTGGCC GGCCTGCCCG TCACGATCCT CGTCCGCGCG
GGGACGGTGG CCATGGTACG CCGCAGCAGC ACCTCCGGCT CGCCCTCCGC CAGCCGGGTC
CACCTCGTCG GCTACGAGGA GGACGTGACG AACTTCTACG CCAACAACGA TGTCGAGGCG
CTGGGATTGC GCATCGTTGG GACGAGCTAC CTGCGCCGCC CCGAGCCCGC CTCCGGCACG
GGGAACGCGG AGAGTTTGCT CGCCGAGGAT CTCGACCTCG CGGTCTCGGT GGTGCGGTTC
CTGCGGCCCG ACGACGTGTT CGTGCTGGTG CCGTGGTCGG AGCCCGCCGA CATCGAGCGC
TGCATCGACG CCTTCCTGCG GGTGCCGGCC GCCCTGCACC TGCGGCCCGG CACGATGATG
GATCGCTTCC CCGACCTGCA GGTCGCCCGG GTCGGCCGGC TCTCGGGCAT CAATATCGGC
CGCCGCCCGC TCTCGGTCGG CGAGATCCTG CTCAAGCGCG CCTTCGACGT GACGCTGGCC
GGGATCGGGC TACTGCTGCT CGCGCCGCTC TTCGTCGCAC TCGCGGTGCT GATCAAGCTC
GACAGCCCCG GCCCGGTCTT CTTCCGGCAG CGGCGCTACG GCTTCAACCA GGAGGCCTTC
GGCGTCTTCA AGTTCCGTAG CATGAAGGCC GCCCCCGACG CCCCCTTCCG GCAGGCCTCG
CGCAACGACG AGCGCATCAC CCGGATCGGC GCCCTGCTGC GCCGGACCAA CCTCGACGAG
TTGCCGCAGC TCCTGAACGT GATCCGGGGC GACATGTCGC TCGTCGGCCC CCGCCCGCAC
GCGCTGGCGC ATGACCGCAG CTTCGAGCGC CGCATCGCCC TCTACGCCCG CCGCCACAAC
GTGAAGCCGG GCATCACCGG CTGGGCGCAG GTGAACGGCT TTCGGGGCGA GACCCTGACC
GACGCGGCGA TGGAGAGCCG CGTCCAGGCC GATCTGCACT ACATCGACAA CTGGTCGCTC
TGGCTCGACA TCACGATCCT GTTCCGGACG ATCGCCTCGC CGCGCGCCTA CCGTAACGCG
TGCTGA
 
Protein sequence
MFHERPPQQY QRLPGDGVSR SVWASLLPRR RRMALRVGIS ASLLAADLVA IFAVGFAADV 
AYHAYIGDGE LIPLTNSMNL QTAGFLSLIF VLTNLARGEY SIERCLSQTP HLQRRATLWL
MAWAVALLVG FATKTTQDFS RVASVAFFLA GLPVTILVRA GTVAMVRRSS TSGSPSASRV
HLVGYEEDVT NFYANNDVEA LGLRIVGTSY LRRPEPASGT GNAESLLAED LDLAVSVVRF
LRPDDVFVLV PWSEPADIER CIDAFLRVPA ALHLRPGTMM DRFPDLQVAR VGRLSGINIG
RRPLSVGEIL LKRAFDVTLA GIGLLLLAPL FVALAVLIKL DSPGPVFFRQ RRYGFNQEAF
GVFKFRSMKA APDAPFRQAS RNDERITRIG ALLRRTNLDE LPQLLNVIRG DMSLVGPRPH
ALAHDRSFER RIALYARRHN VKPGITGWAQ VNGFRGETLT DAAMESRVQA DLHYIDNWSL
WLDITILFRT IASPRAYRNA C