Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4156 |
Symbol | |
ID | 5833121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 4623617 |
End bp | 4625002 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641369946 |
Product | UDP-N-acetylglucosamine pyrophosphorylase |
Protein accession | YP_001641596 |
Protein GI | 163853553 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) |
TIGRFAM ID | [TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.952309 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.525681 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCAG GCGCGAACCA CGGGGCGGAG AGCGGCTTCA CCGCGATCGT GCTGGCGGCC GGCAAGGGCA CGCGAATGCG CTCCGACCGG CCCAAGGTGC TTCACGCGCT GGCCAACCGC TCGATGCTCG GCCACGTGCT TGCCGCGGTG CAGGAGGCGG GCGCGGCCCG CCTTGCCGTG GTGGTCGAGC CGGGCCGCGA GGACGTCGTC CGCGAGATCG AACGGCTGGC GCCCGGCGCT GGCATCCATC CGCAGGCCGA GCGCCTCGGC ACCGCCCACG CGGTGCTCGC TGCCCGCGCA TCCCTGGAGG ACGGGCAGGA CGTGCTCGTG GCCTTCGGCG ACACGCCCCT CGTCACGGCC GAAACCTATG CCCGCCTGCG CGCGCCGTTG CGCGAGGGCG CAGCGGTGGC GGTGCTGGCC TTCGAGGCCG CCGACCCCAC CGGTTACGGG CGCGTGCTGA CGGAAGGGGG CCGTGTCCTG GCGATCCGCG AGGAGAAGGA CGCCTCGCAG GAGGAGCGGG TGGTGCGCCT GTCCAATGCC GGGCTGATGG CGCTGTCGGG CGCGCACGCC CTGTCGCTGC TGGAGCGGAT CGGCAACGAC AACGCCAACC GCGAATACTA CCTAACCGAC GCGGTGGCGC TCGCCGCGGG CGACGGCCTC TCCGTCGCCG TGGTGCCCGT GGACGAGGCG GAGGCGCAGG GCGTCAACGA CCGTGTGCAG CTCAGCCAGG CCGAGGCCAC GATCCAGGCG CGCCTGCGCC GGGCGGCCCA GCTCGGCGGG GCGACGCTGA TCGCGCCCGA GACGGTGTTC TTCAGCGTCG ACACGATCCT TGGACGCGAC GTCGTCGTCG AGCCGCACTG CGTGTTCGGC CCCGGCGTGG TCATCGGCGA CGGCTGCACC ATCCGCGCCT TCTCGCACCT GCACGACGCC CGACTGATGG AGGGCGCCGA TATCGGCCCG CATGTGCGCT TGCGCGGCGG TGCGGTACTG GAGGCGGGCG TCCATCTCGG CAACTTCGTC GAGATCAAGA ACGCGACCCT GCATGCGGGC GCCAAGGCCT CGCACCTGAC CTATCTCGGT GACGCCGAGA TCGGAGCGGG CGCCAATATC GGCGCGGGTA CCATCACCTG CAATTACGAC GGCGTGTCGA AGCACCGCAC GCTCATCGGC GAGGGCGCCT TCATCGGCTC GAATTCGGCG CTGGTGGCGC CGGTCAGCGT CGGCGCGGGC GCGCTGGTCG GGGCCGGCTC GGTCATCACC CGCGACGTGC CGGCGGACGC GCTCGCCGTC GCGCGGGGGC GGCAGATCAC CCGCGAGGGA GCGGCCAAGA CCCTGCGTCA GACGCTGAAG GCTGCCAAGG CGGCCCGCGA GGCGAAGAAG AGCTGA
|
Protein sequence | MTAGANHGAE SGFTAIVLAA GKGTRMRSDR PKVLHALANR SMLGHVLAAV QEAGAARLAV VVEPGREDVV REIERLAPGA GIHPQAERLG TAHAVLAARA SLEDGQDVLV AFGDTPLVTA ETYARLRAPL REGAAVAVLA FEAADPTGYG RVLTEGGRVL AIREEKDASQ EERVVRLSNA GLMALSGAHA LSLLERIGND NANREYYLTD AVALAAGDGL SVAVVPVDEA EAQGVNDRVQ LSQAEATIQA RLRRAAQLGG ATLIAPETVF FSVDTILGRD VVVEPHCVFG PGVVIGDGCT IRAFSHLHDA RLMEGADIGP HVRLRGGAVL EAGVHLGNFV EIKNATLHAG AKASHLTYLG DAEIGAGANI GAGTITCNYD GVSKHRTLIG EGAFIGSNSA LVAPVSVGAG ALVGAGSVIT RDVPADALAV ARGRQITREG AAKTLRQTLK AAKAAREAKK S
|
| |