Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4120 |
Symbol | tdh |
ID | 5590371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 4109452 |
End bp | 4110477 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640927739 |
Product | L-threonine 3-dehydrogenase |
Protein accession | YP_001465099 |
Protein GI | 157157199 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | [TIGR00692] L-threonine 3-dehydrogenase [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.357417 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCGT TATCCAAACT GAAAGCGGAA GAGGGCATCT GGATGACCGA CGTTCCTGTA CCGGAACTCG GGCATAACGA TCTGCTGATT AAAATCCGTA AAACAGCCAT CTGCGGGACT GACGTTCACA TCTATAACTG GGATGAGTGG TCGCAAAAAA CCATCCCGGT GCCGATGGTC GTGGGCCATG AATATGTCGG TGAAGTGGTA GGTATTGGTC AGGAAGTGAA GGGCTTCATG ATTGGCGATC GCGTTTCTGG CGAAGGCCAT ATCACCTGCG GTCACTGTCG CAACTGTCGT GGCGGTCGTA CCCATCTGTG CCGCAACACG ATCGGCGTCG GCGTTAACCG CCCGGGCTGC TTCGCCGAAT ATCTGGTGAT TCCGGCGTTC AACGCCTTCA AAATTCCCGA CAATATCTCC GACGACCTGG CTTCCATTTT TGATCCCTTC GGTAACGCCG TGCATACCGC GCTGTCGTTC GATCTGGTTG GCGAGGATGT GCTGGTCTCT GGTGCAGGCC CGATTGGTAT TATGGCAGCG GCGGTGGCGA AACACGTTGG TGCACGCAAT GTGGTGATCA CTGATGTTAA CGAATATCGT CTCGAACTGG CGCGCAAAAT GGGTATCACC CGTGCGGTCA ACGTCGCTAA AGAAAACCTT AATGATGTGA TGGCAGAACT GGGCATGACC GAAGGCTTTG ATGTCGGTCT GGAAATGCCC GGTGCGCCGC CAGCGTTTCG TACCATGCTC GACACCATGA ACCACGGCGG TCGCATTGCG ATGCTGGGTA TTCCACCGTC CGATATGTCT ATCGACTGGA CCAAAGTAAT CTTTAAAGGC TTGTTCATTA AAGGTATTTA CGGTCGTGAG ATGTTTGAAA CCTGGTACAA GATGGCGGCG CTGATTCAGT CAGGTCTGGA TCTCTCGCCG ATCATTACCC ATCGTTTCTC TATCGATGAT TTCCAGAAGG GCTTTGACGC TATGCGTTCG GGCCAGTCCG GGAAAGTAAT TCTGAGTTGG GATTAA
|
Protein sequence | MKALSKLKAE EGIWMTDVPV PELGHNDLLI KIRKTAICGT DVHIYNWDEW SQKTIPVPMV VGHEYVGEVV GIGQEVKGFM IGDRVSGEGH ITCGHCRNCR GGRTHLCRNT IGVGVNRPGC FAEYLVIPAF NAFKIPDNIS DDLASIFDPF GNAVHTALSF DLVGEDVLVS GAGPIGIMAA AVAKHVGARN VVITDVNEYR LELARKMGIT RAVNVAKENL NDVMAELGMT EGFDVGLEMP GAPPAFRTML DTMNHGGRIA MLGIPPSDMS IDWTKVIFKG LFIKGIYGRE MFETWYKMAA LIQSGLDLSP IITHRFSIDD FQKGFDAMRS GQSGKVILSW D
|
| |