Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5492 |
Symbol | |
ID | 6970406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 5142039 |
End bp | 5143268 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643389138 |
Product | L-sorbose 1-phosphate reductase |
Protein accession | YP_002273535 |
Protein GI | 209399864 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 0.647203 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACAA CAGCTCTGCG TCTTTATGGT AAACGTGATT TACGCCTGGA AACCTTTGAC CTTCCTGAAA TGCAGGAGGA TGAAATCCTC GCGACGGTGG TCACTGACAG CCTGTGCCTC TCTTCCTGGA AAGAGGCCAA TCTGGGTGAA AACCATAAAA AAGTACCCGA CGATGTGGCG ACCAACCCAA TCATCATCGG CCACGAGTTT TGCGGCGATA TTCTGGCCGT GGGTAAAAAG TGGCAGCACA AATTCCAGCC GGGTCAGCGT TATGTGATTC AGGCCAACCT GCAACTCCCC GACCGCCCGG ACTGCCCCGG CTACTCCTTC CCGTGGGTAG GCGGCGAGGC CACGCATGTG GTTATTCCCA ACGAGGTCAT GGAACAAGAT TGCCTGCTGG CATACGACGG CGAAACCTAT TTTGAAGGCT CGCTGGTTGA ACCGCTTTCC TGCGTGATTG GCGCGTTCAA CGCCAACTAT CATCTTCAGG AAGGTAGTTA TAACCACACG ATGGGGATTC GCCCGCAAGG GCGCATGCTG ATCCTCGGCG GCACCGGACC AATGGGACTG TTGGCGATTG ATTATGCGCT ACATGGACCC GTTAACCCGT CGCTGCTCGT CATTACCGAT ACCGACAACG ATAAATTGAG TTATGCGCGC AAGCACTATC CATCAGAACC GCAAACACTG ATTCATTATC TCAATGCCGC CGATGCAGCA TTTGATACGC TAATGGCGCT GAGTGGCGGT CACGGCTTCG ATGATATTTT CGTCTTTGTG CCTAATGAAG GACTGGTGAC TCTCGCCTCT TCCTTGCTGG CGACAGATGG TTGCCTGAAT TTCTTCGCCG GACCGCAGGA TAAACATTTC AGCGCGCCAA TTAATTTCTA CGATGTGCAT TATGCATTTA CCCACTACGT GGGCACGTCA GGCGGCAATA CCGACGACAT GCGCGCAGCG GTCAAATTGA TTGAAGAGAA AAAAGTGCAG GCCGCAAAAG TGGTAACACA TATTCTTGGG CTGAATGCCG CGGGCGAAAC CACGCTTGAA TTGCCTGCCG TCGGCGGCGG CAAAAAGCTG GTGTATACCG GGAAATACCT GCCGCTGACG TCACTCACGC AGATTCAGGA TCAAGCACTG GCGGCGATTC TGGCGCGTCA TCAGGGGATC TGGTCCGGTG AGGCGGAGCA ATATCTGCTC ACTCATGCAG AGGCAATTTC CCATGATTAA
|
Protein sequence | MKTTALRLYG KRDLRLETFD LPEMQEDEIL ATVVTDSLCL SSWKEANLGE NHKKVPDDVA TNPIIIGHEF CGDILAVGKK WQHKFQPGQR YVIQANLQLP DRPDCPGYSF PWVGGEATHV VIPNEVMEQD CLLAYDGETY FEGSLVEPLS CVIGAFNANY HLQEGSYNHT MGIRPQGRML ILGGTGPMGL LAIDYALHGP VNPSLLVITD TDNDKLSYAR KHYPSEPQTL IHYLNAADAA FDTLMALSGG HGFDDIFVFV PNEGLVTLAS SLLATDGCLN FFAGPQDKHF SAPINFYDVH YAFTHYVGTS GGNTDDMRAA VKLIEEKKVQ AAKVVTHILG LNAAGETTLE LPAVGGGKKL VYTGKYLPLT SLTQIQDQAL AAILARHQGI WSGEAEQYLL THAEAISHD
|
| |