Gene EcE24377A_3266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_3266 
SymbolsorE 
ID5589213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3283002 
End bp3284279 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content51% 
IMG OID640926903 
ProductL-sorbose 1-phosphate reductase 
Protein accessionYP_001464275 
Protein GI157158379 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACCA AAGTTGCTGC TATTTATGGC AAGCGGGATG TCCGTCTGCG CGAATTTGAA 
CTGCCAGAAA TTACCGATAA TGAACTGTTA GTGAGTGTAA TTTCTGACAG CGTCTGTTTA
TCGACCTGGA AAGCGGCGTT ACTCGGTAGT GAACATAAAC GCGTACCCGA CGATTTAGAA
AATCATCCGG TCATTACCGG GCATGAATGT GCCGGGGTTA TTGTCGAAGT GGGTAAAAAT
CTCACTGGCA AATATAAAAA AGGCCAGCGT TTTGTATTGC AACCGGCGAT GGGGTTACCA
AGCGGATATT CAGCAGGCTA CAGCTACGAA TATTTTGGCG GTAACGCCAC CTATATGATT
ATTCCCGAAA TCGCCATTAA TTTGGGCTGC GTATTACCGT ATCACGGCTC TTATTTTGCT
GCGGCGTCGC TGGCAGAGCC TATGTGCTGC ATTATTGGTG CTTATCATGC CAATTATCAC
ACCACGCAAT ATGTTTATGA GCATCGCATG GGCGTCAAAC CTGGCGGCAA TATTGCACTG
CTGGCGTGTG CGGGTCCGAT GGGCATTGGC GCTATCGATT ACGCCATTAA CGGTGGCATA
CAACCGTCAC GGGTGGTGGT GGTCGATATC GACGACAAAC GCCTGGCGCA GGTGCAGAAG
CTGCTGCCGG TGGATCTGGC GGCCAGTAAA GGCATTGAGC TGGTGTATGT GAATACCAAA
GGGATGAGCG ATCCGGTCCA GACGCTGCGG GCGCTGACAG GAGATGTCGG GTTCGATGAC
ATTTTTGTTT ATGCGGCGGT GCCTGCTGTC GTTGAGATGG CTGACGAATT ACTGGCGGAA
GATGGCTGTC TGAACTTCTT TGCCGGGCCG ACGGATAAAA ACTTCAAAGT GCCGTTTAAT
TTCTACAACG TCCATTACAA CAGCACGCAC GTAGTCGGTA CATCCGGCGG TTCAACGGAC
GACATGAAAG AGGCGATTGC GTTAAGCGCC ACGGGGCAGT TGCAGCCGTC CTTTATGGTC
ACGCATATCG GTGGGCTGGA TGCGGTGCCA GATACCGTGC TCAATCTGCC GGATATCCCT
GGCGGTAAAA AACTCATTTA TAACGGCGTG ACCATGCCGC TCACTGCCAT TGCCGATTTT
GCCGAAAAAG GCAAAACCGA TCCGCTGTTT AAAGAGTTGG CGCGGCTGGT TGAGGAAACG
CACGGCATCT GGAATGAACA GGCCGAGAAA TATCTGCTGG CACAATTTGG CGTTGATATC
GGGGAGGCCG CGCAATGA
 
Protein sequence
MKTKVAAIYG KRDVRLREFE LPEITDNELL VSVISDSVCL STWKAALLGS EHKRVPDDLE 
NHPVITGHEC AGVIVEVGKN LTGKYKKGQR FVLQPAMGLP SGYSAGYSYE YFGGNATYMI
IPEIAINLGC VLPYHGSYFA AASLAEPMCC IIGAYHANYH TTQYVYEHRM GVKPGGNIAL
LACAGPMGIG AIDYAINGGI QPSRVVVVDI DDKRLAQVQK LLPVDLAASK GIELVYVNTK
GMSDPVQTLR ALTGDVGFDD IFVYAAVPAV VEMADELLAE DGCLNFFAGP TDKNFKVPFN
FYNVHYNSTH VVGTSGGSTD DMKEAIALSA TGQLQPSFMV THIGGLDAVP DTVLNLPDIP
GGKKLIYNGV TMPLTAIADF AEKGKTDPLF KELARLVEET HGIWNEQAEK YLLAQFGVDI
GEAAQ