Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_3266 |
Symbol | sorE |
ID | 5589213 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 3283002 |
End bp | 3284279 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640926903 |
Product | L-sorbose 1-phosphate reductase |
Protein accession | YP_001464275 |
Protein GI | 157158379 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACCA AAGTTGCTGC TATTTATGGC AAGCGGGATG TCCGTCTGCG CGAATTTGAA CTGCCAGAAA TTACCGATAA TGAACTGTTA GTGAGTGTAA TTTCTGACAG CGTCTGTTTA TCGACCTGGA AAGCGGCGTT ACTCGGTAGT GAACATAAAC GCGTACCCGA CGATTTAGAA AATCATCCGG TCATTACCGG GCATGAATGT GCCGGGGTTA TTGTCGAAGT GGGTAAAAAT CTCACTGGCA AATATAAAAA AGGCCAGCGT TTTGTATTGC AACCGGCGAT GGGGTTACCA AGCGGATATT CAGCAGGCTA CAGCTACGAA TATTTTGGCG GTAACGCCAC CTATATGATT ATTCCCGAAA TCGCCATTAA TTTGGGCTGC GTATTACCGT ATCACGGCTC TTATTTTGCT GCGGCGTCGC TGGCAGAGCC TATGTGCTGC ATTATTGGTG CTTATCATGC CAATTATCAC ACCACGCAAT ATGTTTATGA GCATCGCATG GGCGTCAAAC CTGGCGGCAA TATTGCACTG CTGGCGTGTG CGGGTCCGAT GGGCATTGGC GCTATCGATT ACGCCATTAA CGGTGGCATA CAACCGTCAC GGGTGGTGGT GGTCGATATC GACGACAAAC GCCTGGCGCA GGTGCAGAAG CTGCTGCCGG TGGATCTGGC GGCCAGTAAA GGCATTGAGC TGGTGTATGT GAATACCAAA GGGATGAGCG ATCCGGTCCA GACGCTGCGG GCGCTGACAG GAGATGTCGG GTTCGATGAC ATTTTTGTTT ATGCGGCGGT GCCTGCTGTC GTTGAGATGG CTGACGAATT ACTGGCGGAA GATGGCTGTC TGAACTTCTT TGCCGGGCCG ACGGATAAAA ACTTCAAAGT GCCGTTTAAT TTCTACAACG TCCATTACAA CAGCACGCAC GTAGTCGGTA CATCCGGCGG TTCAACGGAC GACATGAAAG AGGCGATTGC GTTAAGCGCC ACGGGGCAGT TGCAGCCGTC CTTTATGGTC ACGCATATCG GTGGGCTGGA TGCGGTGCCA GATACCGTGC TCAATCTGCC GGATATCCCT GGCGGTAAAA AACTCATTTA TAACGGCGTG ACCATGCCGC TCACTGCCAT TGCCGATTTT GCCGAAAAAG GCAAAACCGA TCCGCTGTTT AAAGAGTTGG CGCGGCTGGT TGAGGAAACG CACGGCATCT GGAATGAACA GGCCGAGAAA TATCTGCTGG CACAATTTGG CGTTGATATC GGGGAGGCCG CGCAATGA
|
Protein sequence | MKTKVAAIYG KRDVRLREFE LPEITDNELL VSVISDSVCL STWKAALLGS EHKRVPDDLE NHPVITGHEC AGVIVEVGKN LTGKYKKGQR FVLQPAMGLP SGYSAGYSYE YFGGNATYMI IPEIAINLGC VLPYHGSYFA AASLAEPMCC IIGAYHANYH TTQYVYEHRM GVKPGGNIAL LACAGPMGIG AIDYAINGGI QPSRVVVVDI DDKRLAQVQK LLPVDLAASK GIELVYVNTK GMSDPVQTLR ALTGDVGFDD IFVYAAVPAV VEMADELLAE DGCLNFFAGP TDKNFKVPFN FYNVHYNSTH VVGTSGGSTD DMKEAIALSA TGQLQPSFMV THIGGLDAVP DTVLNLPDIP GGKKLIYNGV TMPLTAIADF AEKGKTDPLF KELARLVEET HGIWNEQAEK YLLAQFGVDI GEAAQ
|
| |