Gene ECH74115_0695 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0695 
Symbol 
ID6970855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp726717 
End bp727955 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content50% 
IMG OID643384730 
Productoxidoreductase, zinc-binding dehydrogenase family 
Protein accessionYP_002269243 
Protein GI209396266 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCAT TGACTTATCA CGGCCCACAT CACGTTCAGG TAGAAAATGT TCCCGATCCG 
GGCATTGAAC AGGCAGATGA TATTATTCTG CGTATAACGG CAACGGCGAT CTGTGGCTCT
GACCTCCATC TTTATCGAGG CAAAATACCC CAGGTTAAAC ATGGCGATAT TTTTGGTCAT
GAATTTATGG GGGAAGTAGT TGAAACCGGA AAGGATGTAA AAAATTTGCA AAAAGGCGAC
CGGGTGGTAA TTCCGTTCGT CATTGCTTGT GGCGACTGTT TTTTCTGTCG ACTGCAGCAA
TATGCTGCCT GCGAAAATAC CAATGCGGGT AAAGGCGCTG CGCTCAATAA AAAACAGATA
CCAGCTCCCG CGGCATTGTT TGGTTATAGT CACCTGTATG GTGGCGTTCC TGGTGGACAG
GCGGAGTATG TCCGTGTCCC TAAAGGGAAT GTGGGGCCGT TTAAAGTACC GCCTTTGCTT
TCAGATGATA AAGCCCTTTT CCTTTCTGAT ATTCTGCCAA CGGCATGGCA GGCAGCAAAA
AATGCGCAGA TCCAACAAGG TTCAAGCGTT GCTGTCTATG GTGCCGGTCC TGTGGGATTG
TTGACAATCG CCTGTGCACG GTTGCTCGGT GCGGAACAGA TTTTTGTTGT TGATCATCAT
CCCTACCGCT TGCGTTTCGC CGCTGACCGC TACGGCGCGA TCCCGATTAA TTTCGATGAA
GATAGCGATC CGGCACAGTC AATTATTGAA CAAACGGCAG GTCACCGGGG CGTGGATGCA
GTAATAGACG CCGTCGGTTT TGAAGCGAAA GGCAGCACCA CGGAAACGGT GCTGACTAAC
CTGAAACTGG AGGGCAGCAG CGGTAAAGCG TTGCGTCAGT GTATTGCGGC GGTCAGGCGT
GGCGGCATTG TTAGCGTACC GGGCGTCTAC GCTGGATTTA TTCACGGTTT CCTGTTTGGC
GACGCCTTTG ATAAAGGGTT GACGTTTAAA ATGGGACAGA CCCACGTTCA CGCATGGCTG
GGAGAATTAT TACCGTTAAT TGAGAAAGGA TTACTGAAAC CAGAAGAAAT TGTTACCCAC
TATATGCCGT TTGAAGAGGC CGCCCGGGGA TATGAGATCT TTGAAAAGCG TGAAGAGGAG
TGCCGTAAGG TGATTCTGGT ACCCGGTGCA CAAAGCGCAG AGGCGGCGCA GAAGGCGGTT
TCAGGTCTGG TGAATGCGAT GCCGGGGGGA ACAATATGA
 
Protein sequence
MKALTYHGPH HVQVENVPDP GIEQADDIIL RITATAICGS DLHLYRGKIP QVKHGDIFGH 
EFMGEVVETG KDVKNLQKGD RVVIPFVIAC GDCFFCRLQQ YAACENTNAG KGAALNKKQI
PAPAALFGYS HLYGGVPGGQ AEYVRVPKGN VGPFKVPPLL SDDKALFLSD ILPTAWQAAK
NAQIQQGSSV AVYGAGPVGL LTIACARLLG AEQIFVVDHH PYRLRFAADR YGAIPINFDE
DSDPAQSIIE QTAGHRGVDA VIDAVGFEAK GSTTETVLTN LKLEGSSGKA LRQCIAAVRR
GGIVSVPGVY AGFIHGFLFG DAFDKGLTFK MGQTHVHAWL GELLPLIEKG LLKPEEIVTH
YMPFEEAARG YEIFEKREEE CRKVILVPGA QSAEAAQKAV SGLVNAMPGG TI