Gene ECH74115_5492 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5492 
Symbol 
ID6970406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5142039 
End bp5143268 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content53% 
IMG OID643389138 
ProductL-sorbose 1-phosphate reductase 
Protein accessionYP_002273535 
Protein GI209399864 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.647203 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAA CAGCTCTGCG TCTTTATGGT AAACGTGATT TACGCCTGGA AACCTTTGAC 
CTTCCTGAAA TGCAGGAGGA TGAAATCCTC GCGACGGTGG TCACTGACAG CCTGTGCCTC
TCTTCCTGGA AAGAGGCCAA TCTGGGTGAA AACCATAAAA AAGTACCCGA CGATGTGGCG
ACCAACCCAA TCATCATCGG CCACGAGTTT TGCGGCGATA TTCTGGCCGT GGGTAAAAAG
TGGCAGCACA AATTCCAGCC GGGTCAGCGT TATGTGATTC AGGCCAACCT GCAACTCCCC
GACCGCCCGG ACTGCCCCGG CTACTCCTTC CCGTGGGTAG GCGGCGAGGC CACGCATGTG
GTTATTCCCA ACGAGGTCAT GGAACAAGAT TGCCTGCTGG CATACGACGG CGAAACCTAT
TTTGAAGGCT CGCTGGTTGA ACCGCTTTCC TGCGTGATTG GCGCGTTCAA CGCCAACTAT
CATCTTCAGG AAGGTAGTTA TAACCACACG ATGGGGATTC GCCCGCAAGG GCGCATGCTG
ATCCTCGGCG GCACCGGACC AATGGGACTG TTGGCGATTG ATTATGCGCT ACATGGACCC
GTTAACCCGT CGCTGCTCGT CATTACCGAT ACCGACAACG ATAAATTGAG TTATGCGCGC
AAGCACTATC CATCAGAACC GCAAACACTG ATTCATTATC TCAATGCCGC CGATGCAGCA
TTTGATACGC TAATGGCGCT GAGTGGCGGT CACGGCTTCG ATGATATTTT CGTCTTTGTG
CCTAATGAAG GACTGGTGAC TCTCGCCTCT TCCTTGCTGG CGACAGATGG TTGCCTGAAT
TTCTTCGCCG GACCGCAGGA TAAACATTTC AGCGCGCCAA TTAATTTCTA CGATGTGCAT
TATGCATTTA CCCACTACGT GGGCACGTCA GGCGGCAATA CCGACGACAT GCGCGCAGCG
GTCAAATTGA TTGAAGAGAA AAAAGTGCAG GCCGCAAAAG TGGTAACACA TATTCTTGGG
CTGAATGCCG CGGGCGAAAC CACGCTTGAA TTGCCTGCCG TCGGCGGCGG CAAAAAGCTG
GTGTATACCG GGAAATACCT GCCGCTGACG TCACTCACGC AGATTCAGGA TCAAGCACTG
GCGGCGATTC TGGCGCGTCA TCAGGGGATC TGGTCCGGTG AGGCGGAGCA ATATCTGCTC
ACTCATGCAG AGGCAATTTC CCATGATTAA
 
Protein sequence
MKTTALRLYG KRDLRLETFD LPEMQEDEIL ATVVTDSLCL SSWKEANLGE NHKKVPDDVA 
TNPIIIGHEF CGDILAVGKK WQHKFQPGQR YVIQANLQLP DRPDCPGYSF PWVGGEATHV
VIPNEVMEQD CLLAYDGETY FEGSLVEPLS CVIGAFNANY HLQEGSYNHT MGIRPQGRML
ILGGTGPMGL LAIDYALHGP VNPSLLVITD TDNDKLSYAR KHYPSEPQTL IHYLNAADAA
FDTLMALSGG HGFDDIFVFV PNEGLVTLAS SLLATDGCLN FFAGPQDKHF SAPINFYDVH
YAFTHYVGTS GGNTDDMRAA VKLIEEKKVQ AAKVVTHILG LNAAGETTLE LPAVGGGKKL
VYTGKYLPLT SLTQIQDQAL AAILARHQGI WSGEAEQYLL THAEAISHD