Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3953 |
Symbol | tdh |
ID | 6143740 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4031537 |
End bp | 4032562 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641618779 |
Product | L-threonine 3-dehydrogenase |
Protein accession | YP_001745918 |
Protein GI | 170683839 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | [TIGR00692] L-threonine 3-dehydrogenase [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0503034 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.824331 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCGT TATCCAAACT GAAAGCGGAA GAGGGCATCT GGATGACCGA CGTTCCTGTA CCAGAACTCG GGCATAACGA TCTGCTGATT AAAATCCGTA AAACAGCCAT CTGCGGGACT GACGTTCACA TCTATAACTG GGATGAGTGG TCGCAAAAAA CCATCCCGGT GCCGATGGTC GTGGGCCATG AATATGTCGG TGAAGTGGTA GGTATTGGTC AGGAAGTGAA AGGCTTCAAG ATTGGCGATC GCGTTTCTGG CGAAGGCCAT ATCACCTGCG GTCACTGTCG CAACTGTCGT GGCGGTCGTA CCCATCTGTG CCGCAACACG ATCGGCGTCG GCGTTAACCG TCCGGGCTGC TTTGCCGAGT ATCTGGTGAT TCCGGCGTTC AACGCTTTCA AAATCCCCGA CAATATCTCC GATGATCTGG CTTCCATTTT TGATCCCTTC GGTAACGCCG TGCATACCGC GCTGTCGTTT GATCTGGTGG GCGAGGATGT GCTGGTCTCT GGTGCAGGCC CGATTGGTAT TATGGCGGCG GCGGTGGCGA AACACGTTGG TGCACGTAAC GTGGTGATTA CTGATGTTAA CGAATACCGC CTTGAGCTGG CGCGTAAAAT GGGTATCACC CGTGCGGTTA ACGTCGCGAA AGAAAACCTT AATGATGTGA TGACAGAACT GGGCATGACC GAAGGCTTTG ATGTCGGTCT GGAAATGTCC GGTGCGCCGC CAGCGTTTCG TACCATGCTC GACACCATGA ACCACGGCGG TCGTATTGCG ATGCTGGGTA TTCCACCGTC CGATATGTCT ATCGACTGGA CCAAAGTAAT CTTTAAAGGC TTGTTCATTA AAGGTATTTA CGGTCGTGAG ATGTTTGAAA CCTGGTACAA GATGGCGGCG CTGATTCAGT CTGGCCTCGA TCTCTCGCCG ATCATTACCC ATCGTTTCTC TATCGATGAT TTCCAGAAGG GCTTTGACGC TATGCGTTCG GGCCAGTCCG GGAAAGTTAT TCTGAGCTGG GATTAA
|
Protein sequence | MKALSKLKAE EGIWMTDVPV PELGHNDLLI KIRKTAICGT DVHIYNWDEW SQKTIPVPMV VGHEYVGEVV GIGQEVKGFK IGDRVSGEGH ITCGHCRNCR GGRTHLCRNT IGVGVNRPGC FAEYLVIPAF NAFKIPDNIS DDLASIFDPF GNAVHTALSF DLVGEDVLVS GAGPIGIMAA AVAKHVGARN VVITDVNEYR LELARKMGIT RAVNVAKENL NDVMTELGMT EGFDVGLEMS GAPPAFRTML DTMNHGGRIA MLGIPPSDMS IDWTKVIFKG LFIKGIYGRE MFETWYKMAA LIQSGLDLSP IITHRFSIDD FQKGFDAMRS GQSGKVILSW D
|
| |