Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_03425 |
Symbol | tdh |
ID | 8112598 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 3657597 |
End bp | 3658622 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 644849598 |
Product | hypothetical protein |
Protein accession | YP_003001171 |
Protein GI | 251786867 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | [TIGR00692] L-threonine 3-dehydrogenase [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0415585 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCGT TATCCAAACT GAAAGCGGAA GAGGGCATCT GGATGACCGA CGTTCCTGTA CCGGAACTCG GGCATAACGA TCTGCTGATT AAAATCCGTA AAACAGCCAT CTGCGGGACT GACGTTCACA TCTATAACTG GGATGAGTGG TCGCAAAAAA CCATCCCGGT GCCGATGGTC GTGGGCCATG AATATGTCGG TGAAGTGGTA GGTATTGGTC AGGAAGTGAA GGGCTTCAAG ATTGGCGATC GCGTTTCTGG CGAAGGCCAT ATCACCTGCG GTCACTGTCG CAACTGTCGT GGCGGTCGTA CCCATCTGTG CCGCAACACG ATCGGCGTCG GCGTTAACCG CCCGGGCTGC TTCGCCGAAT ATCTGGTGAT TCCGGCGTTC AACGCCTTCA AAATTCCCGA CAATATCTCC GACGACCTGG CTTCCATTTT TGATCCCTTC GGTAATGCCG TGCATACCGC GCTGTCGTTC GATCTGGTTG GCGAGGATGT GCTGGTTTCT GGTGCAGGCC CGATTGGTAT TATGGCAGCG GCGGTGGCGA AACACGTTGG TGCACGCAAT GTGGTGATCA CTGATGTTAA CGAATACCGC CTTGAGCTGG CGCGTAAAAT GGGTATCACC CGTGCGGTTA ACGTCGCCAA AGAAAATCTC AATGACGTGA TGGCGGAGTT AGGCATGACC GAAGGTTTTG ATGTCGGTCT GGAAATGTCC GGTGCGCCGC CAGCGTTTCG TACCATGCTT GACACCATGA ACCACGGCGG CCGTATTGCG ATGCTGGGTA TTCCGCCGTC TGATATGTCT ATCGACTGGA CCAAAGTGAT CTTTAAAGGC TTGTTCATTA AAGGTATTTA CGGTCGTGAG ATGTTTGAAA CCTGGTACAA GATGGCGGCG CTGATTCAGT CTGGCCTCGA TCTTTCGCCG ATCATTACCC ATCGTTTCTC TATCGATGAT TTCCAGAAGG GCTTTGACGC TATGCGTTCG GGCCAGTCCG GGAAAGTTAT TCTGAGCTGG GATTAA
|
Protein sequence | MKALSKLKAE EGIWMTDVPV PELGHNDLLI KIRKTAICGT DVHIYNWDEW SQKTIPVPMV VGHEYVGEVV GIGQEVKGFK IGDRVSGEGH ITCGHCRNCR GGRTHLCRNT IGVGVNRPGC FAEYLVIPAF NAFKIPDNIS DDLASIFDPF GNAVHTALSF DLVGEDVLVS GAGPIGIMAA AVAKHVGARN VVITDVNEYR LELARKMGIT RAVNVAKENL NDVMAELGMT EGFDVGLEMS GAPPAFRTML DTMNHGGRIA MLGIPPSDMS IDWTKVIFKG LFIKGIYGRE MFETWYKMAA LIQSGLDLSP IITHRFSIDD FQKGFDAMRS GQSGKVILSW D
|
| |