Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2931 |
Symbol | |
ID | 5593760 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2936254 |
End bp | 2937594 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640922048 |
Product | glucarate dehydratase |
Protein accession | YP_001459559 |
Protein GI | 157162241 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | [TIGR03247] glucarate dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 0.923462 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTCTC AATTTACGAC GCCTGTTGTT ACTGAAATGC AGGTTATCCC GGTGGCGGGT CATGACAGTA TGCTGATGAA TCTGAGTGGT GCACACGCAC CGTTCTTTAC GCGTAATATT GTGATTATCA AAGATAATTC TGGTCACACT GGCGTAGGGG AAATTCCCGG CGGCGAGAAA ATCCGTAAAA CGCTGGAAGA TGCGATTCCG CTGGTAGTGG GTAAAACGCT GGGTGAATAC AAAAACGTTC TGACGCTGGT GCGTAATACT TTTGCCGATC GTGATGCTGG TGGGCGCGGT TTGCAGACAT TTGATTTGCG TACCACTATT CATGTAGTTA CCGGGATAGA AGCGGCAATG CTGGATCTGC TGGGGCAGCA TCTGGGGGTA AACGTGGCAT CGCTGCTGGG CGATGGTCAA CAGCGTAGCG AAGTCGAAAT GCTCGGTTAT CTGTTCTTCG TCGGTAATCG CAAAGCCACA CCGCTGCCGT ATCAAAGCCA GCCGGATGAC TCATGCGACT GGTATCGCCT GCGTCATGAA GACGCGATGA CGCCGGATGC GGTGGTGCGC CTGGCGGAAG CGGCATATGA AAAATATGGC TTCAACGATT TCAAACTGAA GGGCGGTGTA CTGGCCGGGG AAGAAGAAGC CGAGTCTATT GTGGCACTGG CGAAACGTTT CCCGCAGGCG CGTATTACGC TCGATCCTAA CGGTGCCTGG TCGCTGAACG AAGCGATTAA AATTGGTAAA TACCTGAAAG GGTCACTGGC TTATGCAGAA GATCCGTGTG GTGCGGAGCA AGGTTTCTCC GGGCGTGAAG TGATGGCAGA GTTCCGTCGC GCGACAGGTC TGCCGACTGC AACCAATATG ATCGCCACCG ACTGGCGGCA AATGGGCCAT ACGCTCTCCC TGCAATCCGT TGATATCCCG CTGGCGGATC CGCATTTCTG GACAATGCAA GGTTCGGTAC GTGTGGCGCA AATGTGCCAT GAATTTGGCC TGACCTGGGG TTCACACTCT AACAACCACT TCGATATTTC CCTGGCGATG TTTACCCATG TTGCCGCCGC TGCACCGGGT AAAATTACTG CTATTGATAC GCACTGGATT TGGCAGGAAG GCAATCAGCG CCTGACCAAA GAACCGTTTG AGATCAAAGG CGGGCTGGTA CAGGTGCCAG AAAAACCGGG GCTGGGTGTA GAAATCGATA TGGATCAAGT GATGAAAGCC CATGAGCTGT ATCAGAAACA CGGGCTTGGC GCGCGTGACG ATGCGATGGG AATGCAGTAT CTGATTCCTG GCTGGACGTT CGATAACAAG CGTCCGTGCA TGGTGCGTTA A
|
Protein sequence | MSSQFTTPVV TEMQVIPVAG HDSMLMNLSG AHAPFFTRNI VIIKDNSGHT GVGEIPGGEK IRKTLEDAIP LVVGKTLGEY KNVLTLVRNT FADRDAGGRG LQTFDLRTTI HVVTGIEAAM LDLLGQHLGV NVASLLGDGQ QRSEVEMLGY LFFVGNRKAT PLPYQSQPDD SCDWYRLRHE DAMTPDAVVR LAEAAYEKYG FNDFKLKGGV LAGEEEAESI VALAKRFPQA RITLDPNGAW SLNEAIKIGK YLKGSLAYAE DPCGAEQGFS GREVMAEFRR ATGLPTATNM IATDWRQMGH TLSLQSVDIP LADPHFWTMQ GSVRVAQMCH EFGLTWGSHS NNHFDISLAM FTHVAAAAPG KITAIDTHWI WQEGNQRLTK EPFEIKGGLV QVPEKPGLGV EIDMDQVMKA HELYQKHGLG ARDDAMGMQY LIPGWTFDNK RPCMVR
|
| |