Gene Mext_4130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4130 
Symbol 
ID5833621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4597733 
End bp4599256 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content73% 
IMG OID641369920 
Productexopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 
Protein accessionYP_001641570 
Protein GI163853527 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03022] Undecaprenyl-phosphate galactose phosphotransferase, WbaP
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.800255 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGACG TCGATGTCGC AAATACCAGT CTCCTGCCGG CCGGCGGAAC CGGCTGGCAG 
GGACGACTTC GAGAGCGTCT CAAGGAGCGT CTCGGACCCG TCCGGTCCGG GCGCGCCGAG
ACGATCGCCG AGGCACGGCG CCGGATCGTC ACCGGTGCCC TGCTCGCGGG CGACACCGTC
GCCGTGCTGG TCGCCTGCGG GGGAAGCCTG CTCGTGATGG CAGCGGCCGG AGCAGCCGCC
GGCCTCGCGC CGGCCATGCT GACCGGCTGG TGCGCCCTGC AGATCTGCGC ACTGGCCCTG
TGCGGCCTCT ACGAACCGAT CGAGGCGGAG CCGATCGAGC GGCTCCGCCG CCGCGGCCTG
GCCGCCGCCC TCGCGTTCGC CGCCGCCATC CTGGTCGGCG GCGGGCCCTG GTCCTGGCCG
TGGTCCTGGG CCGCCACCGG GATCGCCGCC CTTCTCTCCA TCCCGCTTGG GCATTACGCG
GAGGGGTTCG TGCGTGCGCG TCTCGTCCGG CGGGGCGTGT GGGGGGCGGC GACCATCGTC
TACGGCGAGG GTGCCACCGA GCTGGCCCGC AGCCTCGCCG CGCGGCCGGA ACTCGGGCTC
CGACCGATCG GCATCGTCCG CGCGGCCGAT CAAGTGGTCG AGCCCTTCCG CACCGTCGTG
TCGCCGGGGC CAGGAGAGGA CACCGAGCGA GCGGCCGAGC GCATGGCGAG CCTGCTGGAG
GCGGCCGAGG TCGCCATCTG CACGCCCGGC GAACGCGAGC CCACCCGCTT CGCCTGGCTC
ACCCGGCACC CGTTCCGGCA GGTGCTCGTG GCGCACCATG CGCCGGAGGT GGAGACCGTG
CGCCTCAAGA CCCGCTGCCT CGGTCCCGTG GTCGGGCTCG TGGTGCGCCG GGCGATCTTC
CTGCCGCACA ACCTGCGGCT CAAGCGGGCG CTCGACCTTG CCGTGACGGT GCCGGGCCTT
CTCGTCTGCG GACCGCTGAT CGGTGTGCTG GCGCTCGCGG TGAAGATCGC TGATCCAGGC
CCGGCCTTCT ACGTCCAGCC CCGCGTCGGG CGGGATGGCC GCACCATCCG CGTGTACAAG
CTGCGCAGCA TGTTCCGCGA CGCGGAGGCG CGGCTCGCCG AGCATCTGGC CGCCGACGAG
GCCGCCCGGC GCGAATGGGA CCGGTTCTGC AAGCTGCGCA ACGATCCGCG CGTCCTGCCC
GGCATCGGCG GCTTCATCCG GCGCACCAGC CTCGACGAGC TGCCCCAGCT CCTCAACGTG
CTGCGCGGCG ACATGAGCGT GGTCGGGCCC CGCCCCTTCC CCGCCTACCA CACCGAGCGG
TTCGGCCCGG CCTTCCAGGC GCTGCGGGCG AGCGTGCCGC CGGGCCTGAC CGGACTGTGG
CAGATCTCGG CCCGCAGCGA CGGCGACCTC GCGGTGCAGG AGCAGCAGGA CAGCTTCTAC
ATCCGCAACT GGTCGATCTG GACCGACCTG TACATCCTTC TCGAGACGGT GCCGGCGGTG
CTCAGCGCCA AGGGCGCCCG CTGA
 
Protein sequence
MPDVDVANTS LLPAGGTGWQ GRLRERLKER LGPVRSGRAE TIAEARRRIV TGALLAGDTV 
AVLVACGGSL LVMAAAGAAA GLAPAMLTGW CALQICALAL CGLYEPIEAE PIERLRRRGL
AAALAFAAAI LVGGGPWSWP WSWAATGIAA LLSIPLGHYA EGFVRARLVR RGVWGAATIV
YGEGATELAR SLAARPELGL RPIGIVRAAD QVVEPFRTVV SPGPGEDTER AAERMASLLE
AAEVAICTPG EREPTRFAWL TRHPFRQVLV AHHAPEVETV RLKTRCLGPV VGLVVRRAIF
LPHNLRLKRA LDLAVTVPGL LVCGPLIGVL ALAVKIADPG PAFYVQPRVG RDGRTIRVYK
LRSMFRDAEA RLAEHLAADE AARREWDRFC KLRNDPRVLP GIGGFIRRTS LDELPQLLNV
LRGDMSVVGP RPFPAYHTER FGPAFQALRA SVPPGLTGLW QISARSDGDL AVQEQQDSFY
IRNWSIWTDL YILLETVPAV LSAKGAR