Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2926 |
Symbol | |
ID | 6144502 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3002657 |
End bp | 3003997 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641617795 |
Product | glucarate dehydratase |
Protein accession | YP_001744950 |
Protein GI | 170679833 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | [TIGR03247] glucarate dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACAC AATCCAGTCC TGTTATTACT GATATGAAAG TCATTCCGGT GGCCGGGCAT GACAGCATGT TGCTTAATAT TGGTGGTGCA CATAACGCAT ATTTCACCCG CAATATTGTG GTACTCACCG ATAACGCCGG GCATACCGGC ATTGGTGAAG CGCCGGGCGG AGAGGTGATT TATCAGACAC TGGTCGAGGC TATTCCGATG GTGCTGGGCC AGGAAGTTGC GCGCCTGAAT AAAGTGGTTC AGCAGGTGCA TAAAGGTAAT CAGGCAGCCG ATTTTGATAC CTTCGGCAAA GGTGCCTGGA CTTTTGAATT GCGCGTTAAT GCCGTGGCGG CGCTGGAAGC CGCCTTGCTT GACCTGCTAG GTAAGGCGCT GAATGTTCCG GTCTGCGAAC TGTTAGGGCC AGGCAAGCAA CGTGATGCTA TTACCGTTCT CGGTTATCTG TTTTATATCG GTGATCGGAC CAAAACCGAT CTTCCTTATC TGGAAAATAC TCCGGGCAAC CATGAGTGGT ATCAATTGCG GCATCAGAAA GCGATGAACA GCGAAGCCGT TGTGCGTCTG GCGGACGCCT CACAGGATCG CTATGGCTTT AAAGATTTCA AACTTAAGGG CGGAGTGTTA CCTGGCGAGC AAGAAATCGA CACTGTTCGT GCATTGAAGA AACGCTTCCC GGATGCGCGG ATTACCGTTG ATCCCAACGG AGCATGGCTG CTTGATGAAG CTATTTCTCT GTGCAAAGGG CTGAATGATG TTCTCACCTA TGCCGAAGAT CCGTGTGGCG CAGAACAGGG TTTCTCCGGA CGTGAAGTGA TGGCGGAATT TCGGCGGGCG ACCGGCTTGC CCGTCGCGAC TAATATGATC GCCACCAACT GGCGCGAAAT GGGCCATGCG GTGATGCTCA ATGCGGTAGA TATTCCACTT GCCGACCCGC ACTTCTGGAC GCTTTCCGGT GCAGTCCGCG TAGCGCAGCT TTGCGACGAC TGGGGGCTGA CCTGGGGCTG CCATTCCAAT AACCACTTTG ATATCTCTCT GGCGATGTTT ACCCATGTGG GCGCGGCGGC ACCGGGTAAT CCTACCGCTA TCGATACCCA CTGGATTTGG CAGGAGGGCG ATTGTCGCCT GACCAAAAAT CCGCTGGAGA TTAAAAATGG AAAAATTGCC GTTCCTGATG CGCCCGGTCT GGGCGTGGAA CTGAACTGGG AGCAGGTACA AAAGGCGCAT GAGGCCTATA AACGTCTGCC TGGCGGTGCG CGTAACGACG CGGGCCCGAT GCAGTACCTG ATCCCCGGCT GGACCTTTGA CCGTAAACGT CCCGTTTTCG GCCGTCATTG A
|
Protein sequence | MATQSSPVIT DMKVIPVAGH DSMLLNIGGA HNAYFTRNIV VLTDNAGHTG IGEAPGGEVI YQTLVEAIPM VLGQEVARLN KVVQQVHKGN QAADFDTFGK GAWTFELRVN AVAALEAALL DLLGKALNVP VCELLGPGKQ RDAITVLGYL FYIGDRTKTD LPYLENTPGN HEWYQLRHQK AMNSEAVVRL ADASQDRYGF KDFKLKGGVL PGEQEIDTVR ALKKRFPDAR ITVDPNGAWL LDEAISLCKG LNDVLTYAED PCGAEQGFSG REVMAEFRRA TGLPVATNMI ATNWREMGHA VMLNAVDIPL ADPHFWTLSG AVRVAQLCDD WGLTWGCHSN NHFDISLAMF THVGAAAPGN PTAIDTHWIW QEGDCRLTKN PLEIKNGKIA VPDAPGLGVE LNWEQVQKAH EAYKRLPGGA RNDAGPMQYL IPGWTFDRKR PVFGRH
|
| |