Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1958 |
Symbol | |
ID | 6967578 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1849793 |
End bp | 1850845 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643385884 |
Product | oxidoreductase, zinc-binding dehydrogenase family |
Protein accession | YP_002270373 |
Protein GI | 209398887 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.126859 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGT TAGTAGCCAC AGCACCGCGC GTTGCTGCGC TGGTTGAGTA TGAAGATCGG GCGATTTTAG CCAATGAAGT GAAGATCCGC GTGCGTTTCG GTGCACCGAA ACACGGAACG GAAGTGGTCG ACTTCCGCGC CGCCAGCCCG TTTATCGATG AAGACTTTAA CGGCGAATGG CAGATGTTCA CCCCGCGTCC GGCAGATGCG CCGCGCGGCA TTGAGTTTGG CAAATTCCAG CTTGGCAACA TGGTGGTTGG CGACATTATC GAGTGCGGCA GCGACGTTAC CGACTACGCG GTGGGCGACA GCGTATGCGG CTACGGCCCG CTCTCCGAGA CGATCATCAT TAACGCAGTG AATAACTACA AGCTGCGCAA AATGCCGGAA GGCAGCTCCT GGAAAAACGC TGTCTGCTAC GACCCGGCGC AGTTTGCCAT GAGTGGCGTT CGCGATGCCA ACGTACGCGT AGGGGATTTT GTAGTGGTGG TAGGGCTTGG CGCGATCGGT CAAATTGCCA TCCAACTGGC TAAACGCGCT GGCGCGTCGG TGGTAATTGG CGTCGATCCT ATTGCCCATC GTTGTGATAT TGCTCGTCGT CACGGTGCGG ATTTCTGCCT TAATCCCATT GGCACTGACG TAGGCAAAGA GATCAAAACG CTGACCGGCA AGCAGGGTGC CGATGTGATT ATCGAAACCA GCGGTTACGC CGACGCGCTG CAGTCGGCGC TGCGCGGCCT GGCTTACGGT GGCACCATCT CCTATGTCGC GTTTGCTAAA CCGTTTGCTG AAGGTTTTAA CCTCGGACGC GAAGCGCATT TCAATAACGC CAAAATTGTC TTCTCTCGCG CGTGCAGTGA ACCGAACCCG GATTATCCGC GCTGGAGCCG TAAGCGTATT GAAGAAACCT GCTGGAAACT GCTGATGAAC GGTTATCTCA ATTGCGAAGA TTTAATCGAC CCGGTAGTGA CCTTTGCCAA CAGCCCGGAA AGCTACATGC AGTATGTCGA TCAGCATCCG GAACAGAGCA TCAAAATGGG CGTCACGTTT TAA
|
Protein sequence | MKKLVATAPR VAALVEYEDR AILANEVKIR VRFGAPKHGT EVVDFRAASP FIDEDFNGEW QMFTPRPADA PRGIEFGKFQ LGNMVVGDII ECGSDVTDYA VGDSVCGYGP LSETIIINAV NNYKLRKMPE GSSWKNAVCY DPAQFAMSGV RDANVRVGDF VVVVGLGAIG QIAIQLAKRA GASVVIGVDP IAHRCDIARR HGADFCLNPI GTDVGKEIKT LTGKQGADVI IETSGYADAL QSALRGLAYG GTISYVAFAK PFAEGFNLGR EAHFNNAKIV FSRACSEPNP DYPRWSRKRI EETCWKLLMN GYLNCEDLID PVVTFANSPE SYMQYVDQHP EQSIKMGVTF
|
| |