Gene Rsph17025_2016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2016 
SymbolglmU 
ID5082630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2058107 
End bp2059471 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content69% 
IMG OID640483578 
Productbifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase 
Protein accessionYP_001168212 
Protein GI146278053 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.6824 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGTC CGACCGTTTC GCTGATCGTT CTCGCTGCCG GACAGGGCAC CCGGATGAAC 
TCGGACCTGC CCAAGGTGCT GCACCGCGTC GGCGCCGCGC CCATGCTGCA CCATGCGCTG
CGGGCCGGCC AGTCGCTCGA GCCCGAGCGC GTGGTGGTTG TCGCAGGACA CGGCGCCGAA
CAGGTCGCCC GTGCCGCGCG GGCCTTCGAC GAGACGGTCG AGGTCGTGGT GCAGGCCGAA
CAGCTCGGAA CGGCCCATGC CGTCGCCCAG GCGGCCCCGC TTCTGGCCGA TGCGCCGGGA
GAGGTGGTGG TGCTCTATGG CGACACGCCC TTCATCCGGC CCGAGACGCT CGAGCGGATG
ATCGAGATGC GGGCGCGTCA TGCGGTGGTC GTGCTGGGCT TCGAGGCCGC CGACCCCGGC
CGCTACGGCC GCCTCGTCAC CCGCGGCGAA GACCTCGACC GGATCGTGGA ATGGAAGGAC
GCCACGGATG AAGAGCGCGC GATAACCCTC TGCAATTCCG GCGTGATCTG CGCGGAAACG
CATCTTCTCT TTTCGCTCGT CTCGGAAGTG GGCAACGCCA ATGCCGCGGG CGAATATTAC
CTGACGGACG TGGTGGCCCT CGCGCGGGCC AAGGGCCTGT CGGCCGGCGT GGCCATCTGC
GACGAGGCCG AGACGCTCGG CGTGAACACC CGCGCGCAAC TGGCTGCCGC CGAAGCGGAA
TTCCAGAGGC GGGCCCGCGC CGCGGCGCTC GAGGACGGGG TGACGCTCAC CGCACCCGAC
ACGGTCTTCT TCGCGCTCGA CACCTTTCTG GGCCGCGATG CGATCGTGGG ACCGAACGTG
GTCTTCGGCC CCGGAGTGAC GGTCGAGTCC GGCGCCGAGA TCCGGGCCTT CTGCCATCTG
GAAGGCTGCC ACATCTCGCG CGGGGCGACG GTCGGGCCCT TCGCCCGCCT GCGTCCGGGT
GCGGAACTGG CCGAGGATGT GCATGTCGGC AACTTCGTCG AGATCAAGAA TGCGGTGCTC
GACGAGGGGG TGAAGGTGGG CCACCTCACC TACCTTGGCG ACGCGCATGT GGGCGAGCAC
ACCAACATCG GCGCGGGCAC GGTGACCTGC AACTACGACG GGGTGAACAA GCACCGGACC
GAGATCGGCG CCCACGCCTT CATCGGCTCG GACACGATGC TGGTGGCGCC GGTCAGCGTC
GGGGCGCGCG CGATGACGGC TTCGGGCTCG GTCATCACCG AGGATGTTCC CGCCGAGGCG
CTGGCCGTGG GCCGGGCGCG GCAGGTGACG AAACCCGGTC TGGCCACGCG GCTGATGGAG
ATGTTCCGCG CCGCAAGGGA CGCAAGCAAG AAGGGAACGA ACTGA
 
Protein sequence
MSRPTVSLIV LAAGQGTRMN SDLPKVLHRV GAAPMLHHAL RAGQSLEPER VVVVAGHGAE 
QVARAARAFD ETVEVVVQAE QLGTAHAVAQ AAPLLADAPG EVVVLYGDTP FIRPETLERM
IEMRARHAVV VLGFEAADPG RYGRLVTRGE DLDRIVEWKD ATDEERAITL CNSGVICAET
HLLFSLVSEV GNANAAGEYY LTDVVALARA KGLSAGVAIC DEAETLGVNT RAQLAAAEAE
FQRRARAAAL EDGVTLTAPD TVFFALDTFL GRDAIVGPNV VFGPGVTVES GAEIRAFCHL
EGCHISRGAT VGPFARLRPG AELAEDVHVG NFVEIKNAVL DEGVKVGHLT YLGDAHVGEH
TNIGAGTVTC NYDGVNKHRT EIGAHAFIGS DTMLVAPVSV GARAMTASGS VITEDVPAEA
LAVGRARQVT KPGLATRLME MFRAARDASK KGTN