Gene Dd1591_4197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDd1591_4197 
Symbol 
ID8117647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDickeya zeae Ech1591 
KingdomBacteria 
Replicon accessionNC_012912 
Strand
Start bp4740268 
End bp4741638 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content56% 
IMG OID644854577 
ProductUDP-N-acetylglucosamine pyrophosphorylase 
Protein accessionYP_003006472 
Protein GI251791751 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAACA GTGCAATGAG TGTGGTTATC CTTGCCGCCG GTAAGGGAAC CCGCATGTAT 
TCCGATCTTC CCAAAGTTCT TCATCCTCTG GCGGGTAAAC CGATAGTTAA GCATGTGATC
GATGCGGCGA TGGCGGTTGG CGCTCGTCGT ATTCATCTGG TTTACGGACA CGGTGCAAAC
TTGATTCGGG AAACGCTGAC GGAAACGTCG TTGAATTGGG TATTGCAGGC CGAGCAGTTG
GGAACCGGCC ATGCGGTGCA GCAAGCGGCC GATGGTTTTG ACGATAACGA AGACATTCTG
ATTCTGTACG GCGATGTGCC GCTGATTTCC CCTGCAACGT TGCAGCGCCT GGTGGCAGCC
AAACCGCAAG GCGGGATTGG CCTGCTGACC GTCAATCTGG CTGACCCTAC CGGTTACGGC
CGTATTGTGC GGGACAACGG CGAAGTGGTG GGGATTGTGG AGCATAAAGA CGCCACCGAG
CAGCAGCGTG CGATCACCGA AATCAACACC GGCATTCTGG TGGCGGGCGG GCGCGATTTG
AAGCGTTGGT TAAGCCAGCT CAATAACCAC AATGTGCAGG GCGAATATTA TCTTACCGAT
ATCATCGCCA TGGCATCGCA GGAAGGCCAG CGCGTGGTGG CGGTGCAGCC GTCGCGTCTG
AGTGAGGTGG AAGGCATCAA TAACCGTCTG CAACTGGCGA CGCTGGAGCG TACTTACCAG
CGCGAACAGG CGGAGCAGTT ATTGCTGGCG GGTGTTATGC TGCTGGACCC TGCGCGTTTT
GACCTGCGTG GCGAACTGGT GCATGGCCGT GATGTGACGA TCGATGCCAA CGTCATTCTG
GAAGGCCGGG TGACGTTGGG CAATCGGGTG AAAATCGGTG CGGGCTGCGT GATCAAAAAC
AGTGAGATCG GCGACGATTG CGAGCTTAGC CCCTACACCG TGGCGGAAAA CGCAGTACTG
GAAGCGCGTT GTACCGTTGG CCCGTTTGCT CGCCTGCGCC CCGGCGCGGT GCTGGAAGAA
GAGGCGCATG TCGGCAATTT TGTTGAATTG AAAAAGGCGC GTCTCGGCAA AGGATCGAAA
GCCGGTCATC TGACTTACCT CGGTGATGCG GAAATCGGCT CTGATGTGAA CATCGGCGCA
GGTGTGATTA CCTGTAACTA CGACGGCGCC AATAAACACC AGACGATAAT TGGCGACGAT
GTGTTTGTCG GTTCGGATAG CCAGTTGATT GCACCGGTTA AGGTGGCCAA CGGCGCCACT
ATTGGGGCAG GCACTACCGT CACCCATGAT GTGGGTGAAA ACGAACTGGT TATCAGCCGC
GTTAAGCAGA CTCATATCAG CGGCTGGAAA CGCCCGGTGA AGAAAAAATA G
 
Protein sequence
MSNSAMSVVI LAAGKGTRMY SDLPKVLHPL AGKPIVKHVI DAAMAVGARR IHLVYGHGAN 
LIRETLTETS LNWVLQAEQL GTGHAVQQAA DGFDDNEDIL ILYGDVPLIS PATLQRLVAA
KPQGGIGLLT VNLADPTGYG RIVRDNGEVV GIVEHKDATE QQRAITEINT GILVAGGRDL
KRWLSQLNNH NVQGEYYLTD IIAMASQEGQ RVVAVQPSRL SEVEGINNRL QLATLERTYQ
REQAEQLLLA GVMLLDPARF DLRGELVHGR DVTIDANVIL EGRVTLGNRV KIGAGCVIKN
SEIGDDCELS PYTVAENAVL EARCTVGPFA RLRPGAVLEE EAHVGNFVEL KKARLGKGSK
AGHLTYLGDA EIGSDVNIGA GVITCNYDGA NKHQTIIGDD VFVGSDSQLI APVKVANGAT
IGAGTTVTHD VGENELVISR VKQTHISGWK RPVKKK