Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2925 |
Symbol | |
ID | 6145081 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3001296 |
End bp | 3002636 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641617794 |
Product | glucarate dehydratase |
Protein accession | YP_001744949 |
Protein GI | 170680334 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | [TIGR03247] glucarate dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTCTC AATTTTCGAC GCCTGTTGTT ACTGAAATGC AGGTCATCCC GGTGGCGGGC CATGACAGTA TGCTGATGAA TCTGAGTGGT GCACACGCAC CGTTCTTCAC GCGTAATATT GTGATTATCA AAGATAATTC TGGTCACACT GGGGTAGGGG AAATTCCCGG CGGCGAGAAA ATCCGTAAAA CGCTGGAAGA TGCGATTCCG CTGGTGGTGG GTAAAACGCT GGGTGAATAC AAAAACGTTC TGACGCTGGT GCGTAATACA TTTGCCGATC GTGATGCCGG CGGGCGCGGT TTGCAGACAT TTGACCTGCG GACCACTATT CATGTGGTTA CCGGGATAGA AGCGGCAATG CTGGATCTGC TGGGGCAGCA TCTGGGGGTT AACGTGGCAT CGCTGCTGGG CGATGGTCAA CAGCGTAGCG AAGTCGAAAT GCTCGGTTAT CTGTTCTTCG TCGGTAATCG CAAAGCCACA CCGCTGCCGT ATCAAAGCCA GCCGGATGAT CAATGTGACT GGTATCGCCT GCGTCACGAA GAAGCGATGA CGCCGGATGC GGTGGTGCGC CTGGCGGAAG CGGCATATGA AAAATATGGC TTCAACGATT TCAAACTGAA GGGCGGTGTA CTGGCCGGGG AAGAAGAAGC AGAGTCTATT GTGGCACTGG CGCAACGCTT CCCACAGGCG CGTATTACCC TCGATCCTAA CGGTGCCTGG TCGCTGAACG AAGCGATTAA AATTGGTACA TACCTGAAAG GTTCGCTGGC TTATGCAGAA GATCCGTGTG GTGCGGAGCA AGGTTTCTCC GGGCGTGAAG TGATGGCAGA GTTCCGTCGC GCGACTGGCC TGCCGACCGC AACCAATATG ATCGCCACCG ACTGGCGGCA GATGGGCCAT ACGCTCTCCC TGCAATCCGT TGATATCCCG CTGGCGGATC CGCATTTCTG GACGATGCAA GGTTCGGTAC GTGTGGCGCA AATGTGCCAT GAATTTGGCC TGACCTGGGG TTCACACTCT AACAACCACT TCGATATTTC TCTGGCGATG TTTACCCATG TAGCCGCTGC TGCACCGGGT AAAATTACTG CTATTGATAC GCACTGGATC TGGCAGGAAG GCAATCAGCG TCTGACCAAA GAACCGTTTG AGATCAAAGG CGGGCTGGTA CAGGTGCCGC AAAAACCAGG ATTGGGCGTA GAAATCGATA TGGATCAGGT GATGAAAGCC CACGAGCTGT ATCAGAAACA TGGGCTTGGT GCGCGTGACG ATGCGATGGG AATGCAGTAC CTGATTCCTG GCTGGACGTT TGATAACAAG CGCCCGTGCA TGGTGCGTTA A
|
Protein sequence | MSSQFSTPVV TEMQVIPVAG HDSMLMNLSG AHAPFFTRNI VIIKDNSGHT GVGEIPGGEK IRKTLEDAIP LVVGKTLGEY KNVLTLVRNT FADRDAGGRG LQTFDLRTTI HVVTGIEAAM LDLLGQHLGV NVASLLGDGQ QRSEVEMLGY LFFVGNRKAT PLPYQSQPDD QCDWYRLRHE EAMTPDAVVR LAEAAYEKYG FNDFKLKGGV LAGEEEAESI VALAQRFPQA RITLDPNGAW SLNEAIKIGT YLKGSLAYAE DPCGAEQGFS GREVMAEFRR ATGLPTATNM IATDWRQMGH TLSLQSVDIP LADPHFWTMQ GSVRVAQMCH EFGLTWGSHS NNHFDISLAM FTHVAAAAPG KITAIDTHWI WQEGNQRLTK EPFEIKGGLV QVPQKPGLGV EIDMDQVMKA HELYQKHGLG ARDDAMGMQY LIPGWTFDNK RPCMVR
|
| |