Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2932 |
Symbol | |
ID | 5593761 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2937615 |
End bp | 2938955 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640922049 |
Product | glucarate dehydratase |
Protein accession | YP_001459560 |
Protein GI | 157162242 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | [TIGR03247] glucarate dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACAC AATCCAGTCC TGTTATTACT GATATGAAAG TCATTCCGGT CGCCGGGCAC GATAGCATGT TGCTTAATAT TGGTGGGGCA CATAACGCAT ATTTCACCCG CAATATTGTG GTACTCACCG ATAACGCCGG GCATACCGGC ATTGGTGAAG CGCCGGGCGG AGAGGTGATT TATCAAACAC TGGTCGATGC TATTCCGATG GTTCTGGGCC AGGAAGTTGC GCGCCTGAAT AAAGTGGTTC AGCAGGTGCA TAAAGGTAAT CAGGCAGCCG ATTTTGATAC CTTCGGCAAA GGTGCCTGGA CTTTTGAATT GCGCGTTAAT GCCGTGGCGG CGCTGGAAGC CGCCTTGCTT GACCTGCTAG GTAAGGCGCT GAATGTTCCG GTCTGCGAAC TGTTGGGGCC AGGCAAGCAA CGCGAGACTA TTACCGTCCT CGGTTATCTG TTTTATATCG GTGATCGGAC CAAAACCGAT CTTCCTTATC TGGAAAATAC GCCGGGCAAC CATGAGTGGT ATCAGTTGCG CCATCAGAAA GCGATGAACA GCGAAGCCGT TGTGCGTCTG GCGGAAGCCT CACAGGATCG CTACGGCTTT AAAGATTTCA AACTTAAGGG CGGCGTGTTA CCTGGCGAGC AAGAAATCGA CACTGTTCGT GCATTGAAGA AACGCTTCCC GGATGCGCGG ATTACCGTTG ATCCCAACGG TGCATGGCTG CTTGATGAAG CCATTTCTTT ATGCAAAGGG CTGAATGATG TTCTTACCTA TGCCGAAGAT CCATGCGGCG CAGAACAGGG CTTCTCCGGA CGTGAAGTGA TGGCGGAATT TCGACGGGCG ACCGGCTTGC CCGTCGCGAC TAACATGATC GCCACCAACT GGCGCGAAAT GGGTCATGCG GTGATGCTCA ATGCGGTAGA TATTCCACTT GCCGATCCGC ACTTCTGGAC GCTTTCCGGT GCAGTCCGTG TGGCGCAGCT TTGCGACGAC TGGGGGCTGA CCTGGGGCTG CCATTCTAAT AACCATTTCG ATATCTCTCT GGCGATGTTT ACCCATGTGG GCGCGGCGGC ACCGGGTAAT CCTACCGCTA TCGATACCCA CTGGATTTGG CAGGAGGGCG ATTGTCGCCT GACCCAAAAT CCGCTGGAGA TTAAAAACGG AAAAATTGCC GTTCCTGATG CGCCCGGTCT GGGCGTGGAA CTGGACTGGG AACAGGTACA AAAGGCACAT GAGGCCTATA AACGTCTGCC TGGCGGTGCG CGTAACGACG CAGGTCCGAT GCAGTACCTG ATCCCCGGCT GGACCTTTGA CCGTAAACGT CCCGTTTTCG GCCGTCATTG A
|
Protein sequence | MTTQSSPVIT DMKVIPVAGH DSMLLNIGGA HNAYFTRNIV VLTDNAGHTG IGEAPGGEVI YQTLVDAIPM VLGQEVARLN KVVQQVHKGN QAADFDTFGK GAWTFELRVN AVAALEAALL DLLGKALNVP VCELLGPGKQ RETITVLGYL FYIGDRTKTD LPYLENTPGN HEWYQLRHQK AMNSEAVVRL AEASQDRYGF KDFKLKGGVL PGEQEIDTVR ALKKRFPDAR ITVDPNGAWL LDEAISLCKG LNDVLTYAED PCGAEQGFSG REVMAEFRRA TGLPVATNMI ATNWREMGHA VMLNAVDIPL ADPHFWTLSG AVRVAQLCDD WGLTWGCHSN NHFDISLAMF THVGAAAPGN PTAIDTHWIW QEGDCRLTQN PLEIKNGKIA VPDAPGLGVE LDWEQVQKAH EAYKRLPGGA RNDAGPMQYL IPGWTFDRKR PVFGRH
|
| |