Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4057 |
Symbol | |
ID | 6146973 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4148615 |
End bp | 4149763 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641618882 |
Product | galactonate dehydratase |
Protein accession | YP_001746020 |
Protein GI | 170682934 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATCA CCAAAATTAC CACGTATCGT TTACCTCCCC GCTGGATGTT CCTGAAAATT GAAACCGATG AAGGCGTGGT CGGCTGGGGC GAGCCCGTGA TTGAAGGCCG CGCCCGTACG GTGGAAGCGG CAGTTCACGA GCTGGGTGAC TATTTGATTG GTCAGGATCC ATCGCGCATC AATGACTTAT GGCAAGTGAT GTATCGCGCC GGATTTTATC GTGGCGGTCC AATCCTGATG AGCGCCATCG CCGGTATCGA CCAGGCGTTA TGGGATATCA AAGGCAAAGT GCTGAATGCG CCGGTCTGGC AACTGATGGG CGGCCTGGTT CGCGACAAAA TTAAAGCCTA CAGTTGGGTC GGCGGCGATC GTCCGGCGGA TGTTATCGAC GGCATTAAAA CGCTACGCGA AATCGGCTTC GATACCTTCA AACTGAACGG TTGTGAAGAA CTGGGGCTAA TTGATAACTC CCGCGCGGTA GATGCGGCAG TCAACACCGT GGCACAAATT CGTGAAGCCT TTGGCAATCA GATTGAGTTT GGTCTTGATT TCCATGGTCG CGTCAGCGCG CCGATGGCGA AAGTGCTGAT TAAAGAGCTG GAACCGTATC GTCCGCTGTT TATTGAAGAG CCGGTGCTGG CGGAACAGGC CGAATACTAC CCGAAACTGG CGGCACAAAC GCATATTCCA CTGGCGGCGG GTGAGCGCAT GTTCTCACGC TTCGATTTTA AACGTGTGCT GGAGGCAGGC GGTATTTCGA TTCTGCAACC GGATCTCTCC CACGCAGGCG GTATTACCGA ATGCTACAAA ATTGCCGGAA TGGCAGAAGC CTATGACGTG ACCCTTGCGC CGCACTGTCC GCTCGGACCG ATTGCACTGG CGGCTTGCCT GCATATCGAC TTTGTTTCCT ATAACGCCGT CCTTCAGGAA CAAAGTATGG GCATTCATTA CAACAAAGGC GCGGAGTTAC TCGACTTTGT GAAAAACAAA GAGGACTTCA GCATGGTCGG CGGTTTCTTT AAACCATTAA CGAAACCGGG CTTAGGCGTG GAAATCGACG AAGCTAAAGT TATTGAGTTC AGTAAAAATG CCCCGGACTG GCGTAATCCG CTCTGGCGTC ATGAAGATAA CAGCGTAGCA GAGTGGTAA
|
Protein sequence | MKITKITTYR LPPRWMFLKI ETDEGVVGWG EPVIEGRART VEAAVHELGD YLIGQDPSRI NDLWQVMYRA GFYRGGPILM SAIAGIDQAL WDIKGKVLNA PVWQLMGGLV RDKIKAYSWV GGDRPADVID GIKTLREIGF DTFKLNGCEE LGLIDNSRAV DAAVNTVAQI REAFGNQIEF GLDFHGRVSA PMAKVLIKEL EPYRPLFIEE PVLAEQAEYY PKLAAQTHIP LAAGERMFSR FDFKRVLEAG GISILQPDLS HAGGITECYK IAGMAEAYDV TLAPHCPLGP IALAACLHID FVSYNAVLQE QSMGIHYNKG AELLDFVKNK EDFSMVGGFF KPLTKPGLGV EIDEAKVIEF SKNAPDWRNP LWRHEDNSVA EW
|
| |