Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1044 |
Symbol | |
ID | 5833665 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 1137797 |
End bp | 1139302 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641366839 |
Product | exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
Protein accession | YP_001638520 |
Protein GI | 163850477 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2148] Sugar transferases involved in lipopolysaccharide synthesis |
TIGRFAM ID | [TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase [TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.996068 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0474242 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTCACG AGCGCCCCCC TCAGCAGTAC CAGCGGCTGC CCGGCGACGG GGTAAGCCGC TCTGTCTGGG CCTCGCTTCT GCCCAGGCGG CGGCGCATGG CCCTGCGCGT GGGCATCTCG GCCTCCCTGC TGGCGGCTGA TCTCGTGGCG ATCTTCGCGG TCGGCTTCGC GGCGGATGTG GCCTACCACG CCTATATCGG CGACGGGGAG TTGATCCCGC TCACGAACAG CATGAACCTT CAGACGGCCG GCTTCCTGTC TCTGATCTTC GTGCTTACCA ACCTCGCCCG CGGCGAATAC AGCATCGAGC GCTGCCTGTC GCAGACGCCG CATCTGCAGC GCCGGGCGAC GCTCTGGCTC ATGGCCTGGG CGGTCGCCCT GCTGGTCGGC TTCGCGACGA AGACGACTCA GGACTTCTCC CGCGTCGCCT CGGTCGCCTT CTTCCTGGCC GGCCTGCCCG TCACGATCCT CGTCCGCGCG GGGACGGTGG CCATGGTACG CCGCAGCAGC ACCTCCGGCT CGCCCTCCGC CAGCCGGGTC CACCTCGTCG GCTACGAGGA GGACGTGACG AACTTCTACG CCAACAACGA TGTCGAGGCG CTGGGATTGC GCATCGTTGG GACGAGCTAC CTGCGCCGCC CCGAGCCCGC CTCCGGCACG GGGAACGCGG AGAGTTTGCT CGCCGAGGAT CTCGACCTCG CGGTCTCGGT GGTGCGGTTC CTGCGGCCCG ACGACGTGTT CGTGCTGGTG CCGTGGTCGG AGCCCGCCGA CATCGAGCGC TGCATCGACG CCTTCCTGCG GGTGCCGGCC GCCCTGCACC TGCGGCCCGG CACGATGATG GATCGCTTCC CCGACCTGCA GGTCGCCCGG GTCGGCCGGC TCTCGGGCAT CAATATCGGC CGCCGCCCGC TCTCGGTCGG CGAGATCCTG CTCAAGCGCG CCTTCGACGT GACGCTGGCC GGGATCGGGC TACTGCTGCT CGCGCCGCTC TTCGTCGCAC TCGCGGTGCT GATCAAGCTC GACAGCCCCG GCCCGGTCTT CTTCCGGCAG CGGCGCTACG GCTTCAACCA GGAGGCCTTC GGCGTCTTCA AGTTCCGTAG CATGAAGGCC GCCCCCGACG CCCCCTTCCG GCAGGCCTCG CGCAACGACG AGCGCATCAC CCGGATCGGC GCCCTGCTGC GCCGGACCAA CCTCGACGAG TTGCCGCAGC TCCTGAACGT GATCCGGGGC GACATGTCGC TCGTCGGCCC CCGCCCGCAC GCGCTGGCGC ATGACCGCAG CTTCGAGCGC CGCATCGCCC TCTACGCCCG CCGCCACAAC GTGAAGCCGG GCATCACCGG CTGGGCGCAG GTGAACGGCT TTCGGGGCGA GACCCTGACC GACGCGGCGA TGGAGAGCCG CGTCCAGGCC GATCTGCACT ACATCGACAA CTGGTCGCTC TGGCTCGACA TCACGATCCT GTTCCGGACG ATCGCCTCGC CGCGCGCCTA CCGTAACGCG TGCTGA
|
Protein sequence | MFHERPPQQY QRLPGDGVSR SVWASLLPRR RRMALRVGIS ASLLAADLVA IFAVGFAADV AYHAYIGDGE LIPLTNSMNL QTAGFLSLIF VLTNLARGEY SIERCLSQTP HLQRRATLWL MAWAVALLVG FATKTTQDFS RVASVAFFLA GLPVTILVRA GTVAMVRRSS TSGSPSASRV HLVGYEEDVT NFYANNDVEA LGLRIVGTSY LRRPEPASGT GNAESLLAED LDLAVSVVRF LRPDDVFVLV PWSEPADIER CIDAFLRVPA ALHLRPGTMM DRFPDLQVAR VGRLSGINIG RRPLSVGEIL LKRAFDVTLA GIGLLLLAPL FVALAVLIKL DSPGPVFFRQ RRYGFNQEAF GVFKFRSMKA APDAPFRQAS RNDERITRIG ALLRRTNLDE LPQLLNVIRG DMSLVGPRPH ALAHDRSFER RIALYARRHN VKPGITGWAQ VNGFRGETLT DAAMESRVQA DLHYIDNWSL WLDITILFRT IASPRAYRNA C
|
| |