Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_3092 |
Symbol | |
ID | 5589456 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 3099483 |
End bp | 3100823 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640926734 |
Product | glucarate dehydratase |
Protein accession | YP_001464110 |
Protein GI | 157157733 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | [TIGR03247] glucarate dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.026036 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACAC AATCCAGTCC TGTTATTACT GATATGAAAG TCATTCCGGT CGCCGGGCAC GATAGCATGT TGCTTAATAT TGGTGGGGCA CATAACGCAT ATTTCACCCG CAATATTGTG GTACTCACCG ATAACGCCGG GCATACCGGC ATTGGTGAAG CGCCGGGCGG AGAGGTGATT TATCAAACAC TGGTCGATGC TATTCCGATG GTTCTGGGCC AGGAAGTTGC GCGCCTGAAT AAAGTGGTTC AGCAGGTGCA TAAAGGTAAT CAGGCAGCCG ATTTTGATAC CTTCGGCAAA GGTGCCTGGA CTTTTGAATT GCGCGTTAAT GCCGTGGCGG CGCTGGAAGC CGCCTTGCTT GACCTGCTAG GTAAGGCGCT GAATGTTCCG GTCTGCGAAC TGTTGGGGCC AGGCAAGCAA CGCGAGACTA TTACCGTCCT CGGTTATCTG TTTTATATCG GTGATCGGAC CAAAACCGAT CTTCCTTATC TGGAAAATAC GCCGGGCAAC CATGAGTGGT ATCAGTTGCG CCATCAGAAA GCGATGAACA GCGAAGCCGT TGTGCGTCTG GCGGAAGCCT CACAGGATCG CTATGGCTTT AAAGATTTCA AACTTAAGGG CGGGGTGTTA CCTGGCGAGC AAGAAATCGA CACTGTTCGT GCATTGAAGA AACGCTTCCC TGATGCGCGG ATTACCGTTG ATCCCAACGG TGCCTGGCTA CTTGATGAAG CCATTTCTCT ATGCAAAGGG CTGAATGATG TTCTTACCTA TGCCGAAGAT CCATGCGGCG CAGAACAGGG TTTCTCCGGT CGTGAAGTTA TGGCGGAGTT TCGGCGGGCG ACCGGCTTGC CCGTCGCGAC TAACATGATC GCCACCAACT GGCGCGAAAT GGGTCATGCG GTGATGCTCA ATGCGGTAGA TATTCCACTT GCCGATCCGC ACTTCTGGAC TCTTTCCGGT GCAGTCCGTG TGGCGCAGCT TTGCGACGAC TGGGGGCTGA CCTGGGGCTG CCATTCTAAT AACCATTTCG ATATCTCTCT GGCGATGTTT ACCCATGTGG GCGCGGCGGC ACCGGGTAAT CCTACCGCTA TCGATACCCA CTGGATTTGG CAGGAGGGCG ATTGTCGCCT GACCAAAAAT CCGCTGGAGA TTAAAAACGG AAAAATTGCC GTTCCTGATG CGCCCGGTCT GGGCGTGGAA CTGGACTGGG AACAGGTACA AAAGGCACAT GAGGCCTATA AACGTCTGCC TGTCGGTGCG CGTAACGACG CAGGTCCGAT GCAGTACCTG ATCCCCGGCT GGACCTTTGA CCGTAAACGT CCCGTTTTCG GCCGTCATTG A
|
Protein sequence | MTTQSSPVIT DMKVIPVAGH DSMLLNIGGA HNAYFTRNIV VLTDNAGHTG IGEAPGGEVI YQTLVDAIPM VLGQEVARLN KVVQQVHKGN QAADFDTFGK GAWTFELRVN AVAALEAALL DLLGKALNVP VCELLGPGKQ RETITVLGYL FYIGDRTKTD LPYLENTPGN HEWYQLRHQK AMNSEAVVRL AEASQDRYGF KDFKLKGGVL PGEQEIDTVR ALKKRFPDAR ITVDPNGAWL LDEAISLCKG LNDVLTYAED PCGAEQGFSG REVMAEFRRA TGLPVATNMI ATNWREMGHA VMLNAVDIPL ADPHFWTLSG AVRVAQLCDD WGLTWGCHSN NHFDISLAMF THVGAAAPGN PTAIDTHWIW QEGDCRLTKN PLEIKNGKIA VPDAPGLGVE LDWEQVQKAH EAYKRLPVGA RNDAGPMQYL IPGWTFDRKR PVFGRH
|
| |