Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4130 |
Symbol | |
ID | 5833621 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 4597733 |
End bp | 4599256 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641369920 |
Product | exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
Protein accession | YP_001641570 |
Protein GI | 163853527 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2148] Sugar transferases involved in lipopolysaccharide synthesis |
TIGRFAM ID | [TIGR03022] Undecaprenyl-phosphate galactose phosphotransferase, WbaP [TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.800255 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGACG TCGATGTCGC AAATACCAGT CTCCTGCCGG CCGGCGGAAC CGGCTGGCAG GGACGACTTC GAGAGCGTCT CAAGGAGCGT CTCGGACCCG TCCGGTCCGG GCGCGCCGAG ACGATCGCCG AGGCACGGCG CCGGATCGTC ACCGGTGCCC TGCTCGCGGG CGACACCGTC GCCGTGCTGG TCGCCTGCGG GGGAAGCCTG CTCGTGATGG CAGCGGCCGG AGCAGCCGCC GGCCTCGCGC CGGCCATGCT GACCGGCTGG TGCGCCCTGC AGATCTGCGC ACTGGCCCTG TGCGGCCTCT ACGAACCGAT CGAGGCGGAG CCGATCGAGC GGCTCCGCCG CCGCGGCCTG GCCGCCGCCC TCGCGTTCGC CGCCGCCATC CTGGTCGGCG GCGGGCCCTG GTCCTGGCCG TGGTCCTGGG CCGCCACCGG GATCGCCGCC CTTCTCTCCA TCCCGCTTGG GCATTACGCG GAGGGGTTCG TGCGTGCGCG TCTCGTCCGG CGGGGCGTGT GGGGGGCGGC GACCATCGTC TACGGCGAGG GTGCCACCGA GCTGGCCCGC AGCCTCGCCG CGCGGCCGGA ACTCGGGCTC CGACCGATCG GCATCGTCCG CGCGGCCGAT CAAGTGGTCG AGCCCTTCCG CACCGTCGTG TCGCCGGGGC CAGGAGAGGA CACCGAGCGA GCGGCCGAGC GCATGGCGAG CCTGCTGGAG GCGGCCGAGG TCGCCATCTG CACGCCCGGC GAACGCGAGC CCACCCGCTT CGCCTGGCTC ACCCGGCACC CGTTCCGGCA GGTGCTCGTG GCGCACCATG CGCCGGAGGT GGAGACCGTG CGCCTCAAGA CCCGCTGCCT CGGTCCCGTG GTCGGGCTCG TGGTGCGCCG GGCGATCTTC CTGCCGCACA ACCTGCGGCT CAAGCGGGCG CTCGACCTTG CCGTGACGGT GCCGGGCCTT CTCGTCTGCG GACCGCTGAT CGGTGTGCTG GCGCTCGCGG TGAAGATCGC TGATCCAGGC CCGGCCTTCT ACGTCCAGCC CCGCGTCGGG CGGGATGGCC GCACCATCCG CGTGTACAAG CTGCGCAGCA TGTTCCGCGA CGCGGAGGCG CGGCTCGCCG AGCATCTGGC CGCCGACGAG GCCGCCCGGC GCGAATGGGA CCGGTTCTGC AAGCTGCGCA ACGATCCGCG CGTCCTGCCC GGCATCGGCG GCTTCATCCG GCGCACCAGC CTCGACGAGC TGCCCCAGCT CCTCAACGTG CTGCGCGGCG ACATGAGCGT GGTCGGGCCC CGCCCCTTCC CCGCCTACCA CACCGAGCGG TTCGGCCCGG CCTTCCAGGC GCTGCGGGCG AGCGTGCCGC CGGGCCTGAC CGGACTGTGG CAGATCTCGG CCCGCAGCGA CGGCGACCTC GCGGTGCAGG AGCAGCAGGA CAGCTTCTAC ATCCGCAACT GGTCGATCTG GACCGACCTG TACATCCTTC TCGAGACGGT GCCGGCGGTG CTCAGCGCCA AGGGCGCCCG CTGA
|
Protein sequence | MPDVDVANTS LLPAGGTGWQ GRLRERLKER LGPVRSGRAE TIAEARRRIV TGALLAGDTV AVLVACGGSL LVMAAAGAAA GLAPAMLTGW CALQICALAL CGLYEPIEAE PIERLRRRGL AAALAFAAAI LVGGGPWSWP WSWAATGIAA LLSIPLGHYA EGFVRARLVR RGVWGAATIV YGEGATELAR SLAARPELGL RPIGIVRAAD QVVEPFRTVV SPGPGEDTER AAERMASLLE AAEVAICTPG EREPTRFAWL TRHPFRQVLV AHHAPEVETV RLKTRCLGPV VGLVVRRAIF LPHNLRLKRA LDLAVTVPGL LVCGPLIGVL ALAVKIADPG PAFYVQPRVG RDGRTIRVYK LRSMFRDAEA RLAEHLAADE AARREWDRFC KLRNDPRVLP GIGGFIRRTS LDELPQLLNV LRGDMSVVGP RPFPAYHTER FGPAFQALRA SVPPGLTGLW QISARSDGDL AVQEQQDSFY IRNWSIWTDL YILLETVPAV LSAKGAR
|
| |