Gene EcHS_A3944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3944 
SymbolglmS 
ID5591203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3939018 
End bp3940847 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content53% 
IMG OID640923051 
Productglucosamine--fructose-6-phosphate aminotransferase 
Protein accessionYP_001460528 
Protein GI157163210 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0449] Glucosamine 6-phosphate synthetase, contains amidotransferase and phosphosugar isomerase domains 
TIGRFAM ID[TIGR01135] glucosamine--fructose-6-phosphate aminotransferase (isomerizing) 


Plasmid Coverage information

Num covering plasmid clones57 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGTGGAA TTGTTGGCGC GATCGCGCAA CGTGATGTAG CAGAAATCCT TCTTGAAGGT 
TTACGTCGTC TGGAATACCG CGGATATGAC TCTGCCGGTC TGGCCGTTGT TGATGCAGAA
GGTCATATGA CCCGCCTGCG TCGCCTCGGT AAAGTCCAGA TGCTGGCACA GGCAGCGGAA
GAACATCCTC TGCATGGCGG CACTGGTATT GCTCACACTC GCTGGGCGAC CCACGGTGAA
CCTTCAGAAG TGAATGCGCA TCCGCATGTT TCTGAACACA TTGTGGTGGT GCATAACGGC
ATCATCGAAA ACCATGAACC GCTGCGTGAA GAGCTAAAAG CGCGTGGCTA TACCTTCGTT
TCTGAAACCG ACACCGAAGT GATTGCCCAT CTGGTGAACT GGGAGCTGAA ACAAGGCGGG
ACTCTGCGTG AGGCCGTTCT GCGTGCTATC CCGCAGCTGC GTGGTGCGTA CGGTACAGTG
ATCATGGACT CCCGTCACCC GGATACCCTG CTGGCGGCAC GTTCTGGTAG TCCGCTGGTG
ATTGGCCTGG GGATGGGCGA AAACTTTATC GCTTCTGACC AGCTGGCGCT GTTGCCGGTG
ACCCGTCGCT TTATCTTCCT TGAAGAGGGC GATATTGCGG AAATCACTCG CCGTTCGGTA
AACATCTTCG ATAAAACTGG CGCGGAAGTA AAACGTCAGG ATATCGAATC CAATCTGCAA
TATGACGCGG GCGATAAAGG CATTTACCGT CACTACATGC AGAAAGAGAT CTACGAACAG
CCGAACGCGA TCAAAAACAC CCTTACCGGA CGCATCAGCC ACGGTCAGGT TGATTTAAGC
GAGCTGGGAC CGAACGCCGA CGAACTGCTG TCGAAGGTTG AGCATATTCA GATCCTCGCC
TGTGGTACTT CTTATAACTC CGGTATGGTT TCCCGCTACT GGTTTGAATC GCTAGCAGGT
ATTCCGTGCG ACGTCGAAAT CGCCTCTGAA TTCCGCTATC GCAAATCTGC CGTGCGTCGT
AACAGCCTGA TGATCACCTT GTCACAGTCT GGCGAAACCG CGGATACACT GGCTGGCCTG
CGTCTGTCGA AAGAGCTGGG TTACCTTGGT TCACTGGCAA TCTGTAACGT TCCGGGTTCT
TCTCTGGTGC GCGAATCCGA TCTGGCGCTA ATGACCAACG CGGGTACAGA AATCGGCGTG
GCATCCACTA AAGCATTCAC CACTCAGTTA ACTGTGCTGT TGATGCTGGT GGCGAAGCTG
TCTCGCCTGA AAGGTCTGGA TGCCTCCATT GAACATGACA TTGTGCATGG TCTGCAGGCG
TTGCCGAGCC GTATTGAGCA GATGCTGTCT CAGGACAAAC GCATTGAAGC TCTGGCAGAA
GATTTCTCTG ACAAACATCA CGCGCTGTTC CTGGGCCGTG GCGATCAGTA CCCAATCGCG
CTGGAAGGCG CATTGAAGCT GAAAGAGATC TCTTACATTC ACGCTGAAGC CTACGCTGCA
GGTGAACTGA AACACGGTCC GCTGGCGCTG ATTGATGCCG ATATGCCGGT TATCGTCGTT
GCACCGAACA ACGAATTGCT GGAAAAACTA AAATCCAACA TTGAAGAAGT TCGCGCGCGT
GGCGGTCAGT TGTATGTCTT CGCCGATCAG GATGCGGGTT TTGTAAGTAG CGATAACATG
CACATCATCG AGATGCCGCA TGTGGAAGAG GTGATTGCAC CAATCTTCTA CACCGTTCCG
CTGCAGCTAC TGGCTTATCA CGTCGCGCTG ATCAAAGGTA CCGACGTTGA CCAGCCGCGT
AACCTGGCAA AATCGGTTAC GGTTGAGTAA
 
Protein sequence
MCGIVGAIAQ RDVAEILLEG LRRLEYRGYD SAGLAVVDAE GHMTRLRRLG KVQMLAQAAE 
EHPLHGGTGI AHTRWATHGE PSEVNAHPHV SEHIVVVHNG IIENHEPLRE ELKARGYTFV
SETDTEVIAH LVNWELKQGG TLREAVLRAI PQLRGAYGTV IMDSRHPDTL LAARSGSPLV
IGLGMGENFI ASDQLALLPV TRRFIFLEEG DIAEITRRSV NIFDKTGAEV KRQDIESNLQ
YDAGDKGIYR HYMQKEIYEQ PNAIKNTLTG RISHGQVDLS ELGPNADELL SKVEHIQILA
CGTSYNSGMV SRYWFESLAG IPCDVEIASE FRYRKSAVRR NSLMITLSQS GETADTLAGL
RLSKELGYLG SLAICNVPGS SLVRESDLAL MTNAGTEIGV ASTKAFTTQL TVLLMLVAKL
SRLKGLDASI EHDIVHGLQA LPSRIEQMLS QDKRIEALAE DFSDKHHALF LGRGDQYPIA
LEGALKLKEI SYIHAEAYAA GELKHGPLAL IDADMPVIVV APNNELLEKL KSNIEEVRAR
GGQLYVFADQ DAGFVSSDNM HIIEMPHVEE VIAPIFYTVP LQLLAYHVAL IKGTDVDQPR
NLAKSVTVE