Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2500 |
Symbol | |
ID | 6968470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2368152 |
End bp | 2369228 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643386369 |
Product | oxidoreductase, zinc-binding dehydrogenase family |
Protein accession | YP_002270851 |
Protein GI | 209399082 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0119751 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCAC TGGCTCGGTT TGGCAAAGCC TTTGGCGGCT ACAAGATGAT CGATGTGCCA CAACCCATTT GTGGCCCGGA AGATGTAGTG ATTGAAATTA AAGCCGCGGC AATCTGCGGC GCAGACATGA AGCACTACAA TGTCGATAGC GGTTCTGATG AGTTTAACTC TATCCGCGGC CATGAGTTCG CAGGTTGTAT TGCGCAGGTT GGTGAAAAAG TCAAAGACTG GAAAGTGGGG CAACGCGTCG TGTCGGATAA CAGCGGTCAC GTCTGCGGCG TTTGTCCGGC CTGTGAACAA GGTGATTTTC TGTGTTGTAC AGAAAAAGTA AACCTTGGTC TGGATAACAA TACCTGGGGC GGTGGTTTTT CCAAATATTG TCTGGTTCCT GGTGAAATTC TCAAAATTCA TCGTCATGCG TTGTGGGAAA TCCCTGATGG TGTTGATTAT GAGGACGCAG CCGTACTTGA CCCTATCTGT AATGCCTACA AATCCATCGC GCAGCAATCG AAATTCCTCC CTGGTCAGGA TGTGGTCGTC ATCGGCACTG GCCCACTCGG GCTGTTCTCC GTACAAATGG CGCGAATTAT GGGGGCGGTA AATATCGTCG TCGTTGGTCT GCAAGAAGAT GTGGCGGTCC GCTTCCCGGT TGCAAAAGAA CTGGGTGCGA CGGCAGTAGT AAATGGTTCT ACCGAAGATG TGGTGGCGCG CTGCCAGCAA ATTTGTGGCA AAGACAATCT GGGACTGGTG ATTGAATGCT CCGGTGCCAA TATCGCACTG AAACAAGCCA TCGAAATGCT CCGTCCGAAC GGGGAAGTGG TACGCGTTGG AATGGGCTTC AAACCTCTTG ATTTCTCGAT TAATGACATT ACCGCCTGGA ACAAAAGCAT CATTGGGCAT ATGGCCTATG ACTCCACCTC ATGGCGTAAC GCTATCAGGC TATTAGCCAG CGGCGCTATC AAAGTCAAAC CGATGATCAC GCATCGTATC GGCCTGTCGC AATGGCGCGA AGGGTTTGAT GCGATGGTCG ATAAAACCGC AATCAAAGTG ATCATGACTT ACGACTTTGA TGAATAA
|
Protein sequence | MKALARFGKA FGGYKMIDVP QPICGPEDVV IEIKAAAICG ADMKHYNVDS GSDEFNSIRG HEFAGCIAQV GEKVKDWKVG QRVVSDNSGH VCGVCPACEQ GDFLCCTEKV NLGLDNNTWG GGFSKYCLVP GEILKIHRHA LWEIPDGVDY EDAAVLDPIC NAYKSIAQQS KFLPGQDVVV IGTGPLGLFS VQMARIMGAV NIVVVGLQED VAVRFPVAKE LGATAVVNGS TEDVVARCQQ ICGKDNLGLV IECSGANIAL KQAIEMLRPN GEVVRVGMGF KPLDFSINDI TAWNKSIIGH MAYDSTSWRN AIRLLASGAI KVKPMITHRI GLSQWREGFD AMVDKTAIKV IMTYDFDE
|
| |