Gene ECH74115_1958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1958 
Symbol 
ID6967578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1849793 
End bp1850845 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content55% 
IMG OID643385884 
Productoxidoreductase, zinc-binding dehydrogenase family 
Protein accessionYP_002270373 
Protein GI209398887 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.126859 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGT TAGTAGCCAC AGCACCGCGC GTTGCTGCGC TGGTTGAGTA TGAAGATCGG 
GCGATTTTAG CCAATGAAGT GAAGATCCGC GTGCGTTTCG GTGCACCGAA ACACGGAACG
GAAGTGGTCG ACTTCCGCGC CGCCAGCCCG TTTATCGATG AAGACTTTAA CGGCGAATGG
CAGATGTTCA CCCCGCGTCC GGCAGATGCG CCGCGCGGCA TTGAGTTTGG CAAATTCCAG
CTTGGCAACA TGGTGGTTGG CGACATTATC GAGTGCGGCA GCGACGTTAC CGACTACGCG
GTGGGCGACA GCGTATGCGG CTACGGCCCG CTCTCCGAGA CGATCATCAT TAACGCAGTG
AATAACTACA AGCTGCGCAA AATGCCGGAA GGCAGCTCCT GGAAAAACGC TGTCTGCTAC
GACCCGGCGC AGTTTGCCAT GAGTGGCGTT CGCGATGCCA ACGTACGCGT AGGGGATTTT
GTAGTGGTGG TAGGGCTTGG CGCGATCGGT CAAATTGCCA TCCAACTGGC TAAACGCGCT
GGCGCGTCGG TGGTAATTGG CGTCGATCCT ATTGCCCATC GTTGTGATAT TGCTCGTCGT
CACGGTGCGG ATTTCTGCCT TAATCCCATT GGCACTGACG TAGGCAAAGA GATCAAAACG
CTGACCGGCA AGCAGGGTGC CGATGTGATT ATCGAAACCA GCGGTTACGC CGACGCGCTG
CAGTCGGCGC TGCGCGGCCT GGCTTACGGT GGCACCATCT CCTATGTCGC GTTTGCTAAA
CCGTTTGCTG AAGGTTTTAA CCTCGGACGC GAAGCGCATT TCAATAACGC CAAAATTGTC
TTCTCTCGCG CGTGCAGTGA ACCGAACCCG GATTATCCGC GCTGGAGCCG TAAGCGTATT
GAAGAAACCT GCTGGAAACT GCTGATGAAC GGTTATCTCA ATTGCGAAGA TTTAATCGAC
CCGGTAGTGA CCTTTGCCAA CAGCCCGGAA AGCTACATGC AGTATGTCGA TCAGCATCCG
GAACAGAGCA TCAAAATGGG CGTCACGTTT TAA
 
Protein sequence
MKKLVATAPR VAALVEYEDR AILANEVKIR VRFGAPKHGT EVVDFRAASP FIDEDFNGEW 
QMFTPRPADA PRGIEFGKFQ LGNMVVGDII ECGSDVTDYA VGDSVCGYGP LSETIIINAV
NNYKLRKMPE GSSWKNAVCY DPAQFAMSGV RDANVRVGDF VVVVGLGAIG QIAIQLAKRA
GASVVIGVDP IAHRCDIARR HGADFCLNPI GTDVGKEIKT LTGKQGADVI IETSGYADAL
QSALRGLAYG GTISYVAFAK PFAEGFNLGR EAHFNNAKIV FSRACSEPNP DYPRWSRKRI
EETCWKLLMN GYLNCEDLID PVVTFANSPE SYMQYVDQHP EQSIKMGVTF