Gene YpAngola_A4200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A4200 
SymbolglmU 
ID5802680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4491752 
End bp4493128 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content47% 
IMG OID641341967 
Productbifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase 
Protein accessionYP_001608470 
Protein GI162418198 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.236226 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.243995 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTTATGT CTAACAGCTC AATGAGTGTC GTCATCCTTG CCGCGGGTAA GGGGACACGT 
ATGTATTCCG ACCTTCCTAA GGTATTGCAC CCATTGGCAG GTAAGCCAAT GGTTCAGCAT
GTTATTGATG CTGCCATGAA GCTAGGTGCA CAACATGTTC ATCTGGTTTA TGGGCACGGG
GGTGAGTTAC TGAAGAAGAC CTTGGCCGAT CCATCCTTGA ATTGGGTATT ACAGGCTGAG
CAACTGGGTA CTGGCCATGC GATGCAGCAA GCTGCCCCAC ATTTTGCTGA TGATGAAGAT
ATTTTGATGT TGTATGGCGA TGTGCCGCTG ATCTCTGTTG ATACGCTACA ACGTCTGCTG
GCCGCTAAGC CCGAGGGCGG GATTGGCTTG TTGACAGTAA AACTGGATAA CCCAAGCGGT
TATGGCCGTA TCGTTCGTGA GAATGGCGAT GTCGTCGGGA TCGTTGAACA TAAAGATGCC
AGCGATGCAC AACGGGAAAT CAACGAGATC AATACGGGGA TTCTGGTCGC AAATGGGCGC
GATTTAAAAC GCTGGCTATC ATTGTTGGAT AATAATAATG CTCAAGGTGA GTTTTATATC
ACTGACATTA TTGCGTTAGC GCATGCCGAT GGTAAGAAAA TTGCCACTGT CCATCCTACC
CGCTTGAGTG AAGTGGAAGG GGTGAATAAT CGCCTGCAAT TGTCTGCTCT TGAGCGCGTA
TTTCAGACAG AACAAGCGGA AAAATTATTG TTAGCGGGCG TGATGCTATT GGATCCCTCC
CGTTTTGATT TGCGCGGAGA ATTAACTCAT GGTCGCGATA TTACTATCGA TACCAATGTC
ATTATTGAAG GTCATGTGAT TTTAGGTGAC CGGGTGCGAA TTGGTACGGG GTGTGTACTC
AAAAACTGTG TGATTGGTGA TGACTCAGAA ATAAGCCCAT ATACCGTTTT AGAAGATGCC
CGCCTGGATG CCAACTGTAC TGTTGGGCCA TTTGCTCGTC TGCGTCCTGG CGCTGAATTG
GCTGAAGGTG CACATGTTGG TAATTTTGTT GAAATCAAAA AAGCCCGTTT GGGTAAAGGC
TCTAAAGCCG GTCATCTCTC TTATCTAGGT GACGCTGAGA TTGGGGCTGG CGTCAATATT
GGCGCGGGAA CTATAACGTG TAACTATGAC GGGGCTAATA AATTTAAAAC TATTATTGGC
GATGATGTTT TTGTCGGATC GGATACTCAA TTAGTTGCCC CGGTCACCGT TGCAAACGGT
GCAACCATTG GCGCGGGTAC GACTGTCACC CGTGATGTTG CCGAAAATGA ATTAGTCATC
AGCCGGGTTA AGCAGGTTCA TATTCAGGGA TGGAAGCGCC CGGTAAAGAA AAAGTAA
 
Protein sequence
MLMSNSSMSV VILAAGKGTR MYSDLPKVLH PLAGKPMVQH VIDAAMKLGA QHVHLVYGHG 
GELLKKTLAD PSLNWVLQAE QLGTGHAMQQ AAPHFADDED ILMLYGDVPL ISVDTLQRLL
AAKPEGGIGL LTVKLDNPSG YGRIVRENGD VVGIVEHKDA SDAQREINEI NTGILVANGR
DLKRWLSLLD NNNAQGEFYI TDIIALAHAD GKKIATVHPT RLSEVEGVNN RLQLSALERV
FQTEQAEKLL LAGVMLLDPS RFDLRGELTH GRDITIDTNV IIEGHVILGD RVRIGTGCVL
KNCVIGDDSE ISPYTVLEDA RLDANCTVGP FARLRPGAEL AEGAHVGNFV EIKKARLGKG
SKAGHLSYLG DAEIGAGVNI GAGTITCNYD GANKFKTIIG DDVFVGSDTQ LVAPVTVANG
ATIGAGTTVT RDVAENELVI SRVKQVHIQG WKRPVKKK