Gene Mkms_4334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4334 
SymbolglmU 
ID4612276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4553308 
End bp4554801 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content72% 
IMG OID639794019 
Productbifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase 
Protein accessionYP_940315 
Protein GI119870363 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.699537 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.383191 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCTT CGACGACCTC TTCGACCGAT ACTGCAGTTC TCGTGCTGGC CGCGGGCGCG 
GGCACCCGGA TGCGCTCGGA CATCCCGAAG GTGCTGCACA CCCTCGGCGG CCGCAGCATG
CTCGCGCACG CCCTGCACAC CGTGGCGAAG GTGGCCCCGC AGCACCTGGT GGTGGTGCTC
GGACACGACC GCGAACGCAT CGCCCCCGCC GTCGAGGCGC TGGCCACCGA CCTCGGCCGC
CCGATCGACG TCGCGATCCA GGATCAGCAG CTCGGCACCG GCCACGCCGC CGAGTGCGGG
CTCGCGGCGC TGCCCGAGGA CTTCACGGGG GTCGTCGTGG TGACCGCGGG CGACGTCCCG
CTGCTCGACG CCGACACCAT GGCCGACCTG CTGGCCACCC ACGGTTCGGC CGCGGCCACC
GTGCTGACCA CGACCGTCGA CGACCCGACC GGGTACGGGC GCATCCTGCG GACCCAGGAC
AACGAGGTCA CCAGCATCGT CGAACAGGCC GACGCCAGCC CGTCGCAGCG GGCCATCCGC
GAGGTCAACG CCGGCGTCTA CGCCTTCGAC ATCACCGCGC TGCGTTCGGC GCTGCGCCGC
CTGCGGTCCG ACAACGCCCA GCACGAGCTG TACCTCACCG ACGTCATCGC GATCTTCCGG
CAGGACGGCC TCAGCGTGCG GGCCCGGCAC GTCGACGACA GCGCCCTGGT GGCCGGCGTC
AACGACCGCG TGCAGTTGGC GGCGCTGGGC GCCGAGCTCA ACCGCCGCAT CGTCACCGCC
CACCAGCGCG CCGGTGTCAC CGTGATCGAC CCGGGCTCCA CCTGGATCGA CGTCGACGTG
ACCATCGGCC GCGACACCGT CATCCGGCCC GGCACCCAGT TGCTCGGCCG CACCCGCGTC
GGCGGGCGTT GTGACGTCGG ACCGGACACC ACGCTGAGCG ACGTCACCGT CGGCGACGGC
GCCTCGGTGG TCCGCACCCA CGGCTCGGAG TCCCTCATCG GCGCCGGCGC CACCGTCGGC
CCGTTCACCT ATCTGCGGCC GGGCACCGCG CTGGGCGCCG AGGGCAAACT CGGTGCATTC
GTGGAGACGA AGAACGCGAC GATCGGTGCA GGCACCAAGG TGCCGCACCT GACCTACGTC
GGCGACGCCG ACATCGGCGA GCACAGCAAC ATCGGCGCGT CGAGCGTCTT CGTCAACTAC
GACGGCGAGA CCAAGAACCG CACGACCATC GGGTCGCACG TGCGGACCGG CTCGGACACC
ATGTTCGTCG CGCCCGTGAC CGTCGGGGAC GGCGCCTACA CCGGTGCGGG CACGGTGATC
CGGCGCAACG TGCCGCCGGG CGCGCTGGCG GTCTCGGCCG GGTCGCAGCG CAACATCGAG
GGCTGGGTGG TCCGCAAACG CCCGGGTTCG GCCGCGGCAC GCGCGGCGGA GCGCGCATCG
GGTGAAGCAG CGGAGCAGGC GCTCGGCCAC CACGACGACT CCCAGGGGTC GTGA
 
Protein sequence
MTSSTTSSTD TAVLVLAAGA GTRMRSDIPK VLHTLGGRSM LAHALHTVAK VAPQHLVVVL 
GHDRERIAPA VEALATDLGR PIDVAIQDQQ LGTGHAAECG LAALPEDFTG VVVVTAGDVP
LLDADTMADL LATHGSAAAT VLTTTVDDPT GYGRILRTQD NEVTSIVEQA DASPSQRAIR
EVNAGVYAFD ITALRSALRR LRSDNAQHEL YLTDVIAIFR QDGLSVRARH VDDSALVAGV
NDRVQLAALG AELNRRIVTA HQRAGVTVID PGSTWIDVDV TIGRDTVIRP GTQLLGRTRV
GGRCDVGPDT TLSDVTVGDG ASVVRTHGSE SLIGAGATVG PFTYLRPGTA LGAEGKLGAF
VETKNATIGA GTKVPHLTYV GDADIGEHSN IGASSVFVNY DGETKNRTTI GSHVRTGSDT
MFVAPVTVGD GAYTGAGTVI RRNVPPGALA VSAGSQRNIE GWVVRKRPGS AAARAAERAS
GEAAEQALGH HDDSQGS