Gene TM1040_0728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0728 
SymbolglmU 
ID4076098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp785636 
End bp786985 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content63% 
IMG OID638006025 
Productbifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase 
Protein accessionYP_612723 
Protein GI99080569 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACTG CCCTCGTCAT CCTCGCCGCA GGCAAAGGCA CCAGGATGAA CTCCGATTTG 
CCCAAAGTCC TGCATCAGAT CGCACATGCG CCGATGCTGG AACATGCGAT GCGCGCCGGG
GGGGCGCTTG ACCCCGAGCG CACGGTGGTT GTGGCAGGCC ACGAGGCCGA GATGGTGCGC
GCGGCCACCG CAGAGATCGC CCCTGAAGCG ACAGTGGTGC TGCAGGAAGA GCAGCTCGGC
ACCGGCCACG CGGTACTTCA AGCGCGCGCG GCACTCGAGG GATTTCGCGG CGATGTCGTG
GTGCTCTATG GCGATACGCC TTTTGTGTCG GCCGAGACGC TGGAACGCAT GATCGAAGCG
CGCAGCCGCG CCGATCTAGT GATCCTCGGC TTTGAAGCCG CCGATCCTGC GCGCTACGGC
CGGTTGATCA TGCAGGGCGA AAGCCTTGAG AAAATCGTCG AGTTCAAGGA CGCAAGCGAC
GCCGAGCGCG CGATTACATT CTGCAACTCG GGCCTCATGG CGTGCAACGC CGAGGTGATG
TTTGGCCTGC TTGATCAAGT GGGCAACGAC AATGCCTCTG GCGAATACTA CCTGACCGAT
CTTGTCGAAC TCGCGCGGGC CGAGGGGCTG AGCGTCACGG CCGTGTCCTG CCCCGAAGCG
GAAACGCTCG GCATCAATTC CCGCGCGGAC CTCGCGGCCG CGGAGGCGGT GTTTCAGGCG
CATGCGCGGG CTGAGTTGTT GGACATCGGC GTCACGCTGA CGGCTCCTGA GACCGTCCAT
CTGGCCTTTG ACACCATCAT TGGTCGTGAC ACGGTGATTG AACCCAATGT GGTCTTTGGT
CCCGGTGTCA CCGTTGAGAG CGGCGCTTTG ATCCGGGCGT TTTCGCACCT TGAGGGCTGC
CATGTGTCGC GTGGCGCCAA GGTCGGCCCC TACGCCCGCC TGCGCCCCGG CGCGGAGCTG
GCCGAGGACA CCCATGTGGG CAACTTCGTT GAAATCAAGA ACGCTGAGAT CGCCGCAGGC
GCCAAGGTGA ACCACCTGAC CTATATTGGC GATGCCTCTG TGGGTGAGGC GACGAATATC
GGAGCGGGCA CAATCACCTG CAACTACGAT GGCGTCATGA AGCATCGCAC CGAAATCGGC
GCGCGCGCCT TTATCGGATC AAACACGTGT TTGGTCGCCC CGGTGACCGT TGGCGATGAG
GCGATGACGG CAACAGGTGC TGTCATCACC AAGGATGTCG CTGATGGAGA TCTGGCGATT
GCGCGCGTCC AGCAGACGAA CAAACCAGGC CGCGCACGCA AGCTGATGGA TATGCTGCGC
GCCAAGAAAG CCGCAAAGGC CAAAGGGTAA
 
Protein sequence
MSTALVILAA GKGTRMNSDL PKVLHQIAHA PMLEHAMRAG GALDPERTVV VAGHEAEMVR 
AATAEIAPEA TVVLQEEQLG TGHAVLQARA ALEGFRGDVV VLYGDTPFVS AETLERMIEA
RSRADLVILG FEAADPARYG RLIMQGESLE KIVEFKDASD AERAITFCNS GLMACNAEVM
FGLLDQVGND NASGEYYLTD LVELARAEGL SVTAVSCPEA ETLGINSRAD LAAAEAVFQA
HARAELLDIG VTLTAPETVH LAFDTIIGRD TVIEPNVVFG PGVTVESGAL IRAFSHLEGC
HVSRGAKVGP YARLRPGAEL AEDTHVGNFV EIKNAEIAAG AKVNHLTYIG DASVGEATNI
GAGTITCNYD GVMKHRTEIG ARAFIGSNTC LVAPVTVGDE AMTATGAVIT KDVADGDLAI
ARVQQTNKPG RARKLMDMLR AKKAAKAKG