Gene Mext_4156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4156 
Symbol 
ID5833121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4623617 
End bp4625002 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content73% 
IMG OID641369946 
ProductUDP-N-acetylglucosamine pyrophosphorylase 
Protein accessionYP_001641596 
Protein GI163853553 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.952309 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.525681 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCAG GCGCGAACCA CGGGGCGGAG AGCGGCTTCA CCGCGATCGT GCTGGCGGCC 
GGCAAGGGCA CGCGAATGCG CTCCGACCGG CCCAAGGTGC TTCACGCGCT GGCCAACCGC
TCGATGCTCG GCCACGTGCT TGCCGCGGTG CAGGAGGCGG GCGCGGCCCG CCTTGCCGTG
GTGGTCGAGC CGGGCCGCGA GGACGTCGTC CGCGAGATCG AACGGCTGGC GCCCGGCGCT
GGCATCCATC CGCAGGCCGA GCGCCTCGGC ACCGCCCACG CGGTGCTCGC TGCCCGCGCA
TCCCTGGAGG ACGGGCAGGA CGTGCTCGTG GCCTTCGGCG ACACGCCCCT CGTCACGGCC
GAAACCTATG CCCGCCTGCG CGCGCCGTTG CGCGAGGGCG CAGCGGTGGC GGTGCTGGCC
TTCGAGGCCG CCGACCCCAC CGGTTACGGG CGCGTGCTGA CGGAAGGGGG CCGTGTCCTG
GCGATCCGCG AGGAGAAGGA CGCCTCGCAG GAGGAGCGGG TGGTGCGCCT GTCCAATGCC
GGGCTGATGG CGCTGTCGGG CGCGCACGCC CTGTCGCTGC TGGAGCGGAT CGGCAACGAC
AACGCCAACC GCGAATACTA CCTAACCGAC GCGGTGGCGC TCGCCGCGGG CGACGGCCTC
TCCGTCGCCG TGGTGCCCGT GGACGAGGCG GAGGCGCAGG GCGTCAACGA CCGTGTGCAG
CTCAGCCAGG CCGAGGCCAC GATCCAGGCG CGCCTGCGCC GGGCGGCCCA GCTCGGCGGG
GCGACGCTGA TCGCGCCCGA GACGGTGTTC TTCAGCGTCG ACACGATCCT TGGACGCGAC
GTCGTCGTCG AGCCGCACTG CGTGTTCGGC CCCGGCGTGG TCATCGGCGA CGGCTGCACC
ATCCGCGCCT TCTCGCACCT GCACGACGCC CGACTGATGG AGGGCGCCGA TATCGGCCCG
CATGTGCGCT TGCGCGGCGG TGCGGTACTG GAGGCGGGCG TCCATCTCGG CAACTTCGTC
GAGATCAAGA ACGCGACCCT GCATGCGGGC GCCAAGGCCT CGCACCTGAC CTATCTCGGT
GACGCCGAGA TCGGAGCGGG CGCCAATATC GGCGCGGGTA CCATCACCTG CAATTACGAC
GGCGTGTCGA AGCACCGCAC GCTCATCGGC GAGGGCGCCT TCATCGGCTC GAATTCGGCG
CTGGTGGCGC CGGTCAGCGT CGGCGCGGGC GCGCTGGTCG GGGCCGGCTC GGTCATCACC
CGCGACGTGC CGGCGGACGC GCTCGCCGTC GCGCGGGGGC GGCAGATCAC CCGCGAGGGA
GCGGCCAAGA CCCTGCGTCA GACGCTGAAG GCTGCCAAGG CGGCCCGCGA GGCGAAGAAG
AGCTGA
 
Protein sequence
MTAGANHGAE SGFTAIVLAA GKGTRMRSDR PKVLHALANR SMLGHVLAAV QEAGAARLAV 
VVEPGREDVV REIERLAPGA GIHPQAERLG TAHAVLAARA SLEDGQDVLV AFGDTPLVTA
ETYARLRAPL REGAAVAVLA FEAADPTGYG RVLTEGGRVL AIREEKDASQ EERVVRLSNA
GLMALSGAHA LSLLERIGND NANREYYLTD AVALAAGDGL SVAVVPVDEA EAQGVNDRVQ
LSQAEATIQA RLRRAAQLGG ATLIAPETVF FSVDTILGRD VVVEPHCVFG PGVVIGDGCT
IRAFSHLHDA RLMEGADIGP HVRLRGGAVL EAGVHLGNFV EIKNATLHAG AKASHLTYLG
DAEIGAGANI GAGTITCNYD GVSKHRTLIG EGAFIGSNSA LVAPVSVGAG ALVGAGSVIT
RDVPADALAV ARGRQITREG AAKTLRQTLK AAKAAREAKK S