Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4044 |
Symbol | tdh |
ID | 6270320 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 3776329 |
End bp | 3777354 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641727884 |
Product | L-threonine 3-dehydrogenase |
Protein accession | YP_001882316 |
Protein GI | 187731381 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | [TIGR00692] L-threonine 3-dehydrogenase [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.100127 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCGT TATCCAAACT GAAAGCGGAA GAGGGCATCT GGATGACCGA CGTCCCTGTA CCGGAACTCG GGCATAACGA TCTGCTGATT AAAATCCGTA AAACAGCCAT CTGCGGGACT GACGTTCACA TCTATAACTG GGATGAGTGG TCGCAAAAAA CCATCCCGGT GCCGATGGTC GTAGGCCATG AATATGTCGG TGAAGTGGTA GGTATTGGTC AGGAAGTGAA AGGCTTCAAA ATCGGCGATC GCGTTTCTGG CGAAGGCCAT ATCACCTGCG GTCACTGTCG CAACTGTCGT GGCGGTCGTA CCCATCTGTG CCGCAACACG ATCGGCGTCG GCGTTAACCG TCCGGGCTGC TTCGCCGAAT ATCTGGTGAT CCCGGCGTTC AACGCCTTCA AAATCCCAGA CAATATCTCC GACGACCTGG CTTCCATTTT TGATCCCTTC GGTAACGCCG TGCATACCGC GCTGTCGTTC GATCTGGTTG GCGAGGATGT GCTGGTCTCT GGTGCAGGTC CGATAGGTAT TATGGCTGCG GCGGTGGCGA AACACGTTGG TGCACGCAAT GTGGTGATCA CTGATGTTAA CGAATACCGC CTTGAGCTGG CGCGCAAAAT GGGTATCACC CGTGCGGTTA ACGTCGCGAA AGAAAACCTT AATGATGTGA TGGCTGAACT GGGCATGACC GAAGGCTTTG ATGTCGGTCT GGAAATGTCC GGTGCGCCGC CAGCGTTTCG TACCATGCTT GACACCATGA ACCACGGCGG CCGTATTGCG ATGCTGGGTA TTCCGCCGTC TGATATGTCT ATCGACTGGA CCAAAGTGAT CTTTAAAGGC TTGTTCATTA AAGGTATTTA CGGTCGTGAG ATGTTTGAAA CCTGGTACAA GATGGCGGCG CTGATTCAGT CTGGCCTCGA TCTCTCGCCG ATCATTACCC ATCGTTTCTC TATCGATGAT TTCCAGAAGG GCTTTGACGC TATGCGTTCG GGCCAGTCCG GGAAAGTAAT TCTGAGCTGG GATTAA
|
Protein sequence | MKALSKLKAE EGIWMTDVPV PELGHNDLLI KIRKTAICGT DVHIYNWDEW SQKTIPVPMV VGHEYVGEVV GIGQEVKGFK IGDRVSGEGH ITCGHCRNCR GGRTHLCRNT IGVGVNRPGC FAEYLVIPAF NAFKIPDNIS DDLASIFDPF GNAVHTALSF DLVGEDVLVS GAGPIGIMAA AVAKHVGARN VVITDVNEYR LELARKMGIT RAVNVAKENL NDVMAELGMT EGFDVGLEMS GAPPAFRTML DTMNHGGRIA MLGIPPSDMS IDWTKVIFKG LFIKGIYGRE MFETWYKMAA LIQSGLDLSP IITHRFSIDD FQKGFDAMRS GQSGKVILSW D
|
| |