Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4989 |
Symbol | tdh |
ID | 6967787 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4640257 |
End bp | 4641282 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643388670 |
Product | L-threonine 3-dehydrogenase |
Protein accession | YP_002273097 |
Protein GI | 209400006 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | [TIGR00692] L-threonine 3-dehydrogenase [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.014 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 72 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCGT TATCCAAACT GAAAGCGGAA GAGGGCATCT GGATGACCGA CGTTCCTGTA CCGGAACTCG GGCATAACGA TCTGCTGATT AAAATCCGTA AAACAGCCAT CTGCGGGACT GACGTTCACA TCTATAACTG GGATGAGTGG TCGCAAAAAA CCATCCCGGT GCCGATGGTC GTAGGCCATG AATATGTCGG TGAAGTGGTA GGTATTGGTC AGGAAGTGAA AGGCTTCAAA ATCGGCGATC GCGTTTCTGG CGAAGGCCAT ATCACCTGCG GTCACTGTCG CAACTGTCGT GGCGGTCGTA CCCATCTGTG CCGCAACACC ATTGGTGTCG GTGTTAACCG TCCGGGCTGC TTCGCCGAAT ATCTGGTGAT TCCGGCGTTC AACGCTTTCA AAATCCCCGA CAATATCTCC GACGACCTGG CTTCCATTTT TGATCCCTTC GGTAACGCCG TGCATACCGC GCTGTCGTTC GATCTGGTGG GCGAAGATGT GCTGGTTTCT GGTGCAGGTC CGATAGGTAT TATGGCTGCG GCGGTGGCGA AACACGTTGG TGCACGCAAT GTGGTGATCA CCGATGTTAA CGAATACCGC CTTGAGCTGG CGCGCAAAAT GGGTATCACC CGTGCGGTTA ACGTCGCGAA AGAAAACCTT AATGATGTGA TGGCTGAACT GGGCATGACC GAAGGCTTTG ATGTCGGTCT GGAAATGTCC GGCGCGCCGC CAGCGTTTCG TACCATGCTC GATACCATGA ACCACGGCGG TCGTATTGCG ATGCTGGGTA TTCCACCGTC TGACATGTCT ATCGACTGGA CCAAAGTGAT CTTTAAAGGC TTGTTCATTA AAGGCATTTA CGGTCGTGAG ATGTTTGAAA CCTGGTACAA GATGGCGGCG CTGATTCAGT CTGGCCTCGA TCTCTCGCCG ATCATTACCC ATCGTTTCTC TATCGATGAT TTCCAGAAGG GCTTTGACGC TATGTGTTCG GGCCAGTCCG GGAAAGTTAT TCTGAGCTGG GATTAA
|
Protein sequence | MKALSKLKAE EGIWMTDVPV PELGHNDLLI KIRKTAICGT DVHIYNWDEW SQKTIPVPMV VGHEYVGEVV GIGQEVKGFK IGDRVSGEGH ITCGHCRNCR GGRTHLCRNT IGVGVNRPGC FAEYLVIPAF NAFKIPDNIS DDLASIFDPF GNAVHTALSF DLVGEDVLVS GAGPIGIMAA AVAKHVGARN VVITDVNEYR LELARKMGIT RAVNVAKENL NDVMAELGMT EGFDVGLEMS GAPPAFRTML DTMNHGGRIA MLGIPPSDMS IDWTKVIFKG LFIKGIYGRE MFETWYKMAA LIQSGLDLSP IITHRFSIDD FQKGFDAMCS GQSGKVILSW D
|
| |