Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1861 |
Symbol | |
ID | 5590957 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 1878233 |
End bp | 1879309 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640921003 |
Product | zinc-binding dehydrogenase family oxidoreductase |
Protein accession | YP_001458555 |
Protein GI | 157161237 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.000288752 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCAC TGGCTCGGTT TGGCAAGGCC TTTGGCGGCT ACAAGATGAT TGATGTCCCA CAACCCATTT GTGGCCCGGA AGACGTAGTG ATTGAAATTA AAGCCGCGGC AATCTGCGGC GCAGACATGA AGCACTACAA TGTCGATAGC GGTTCTGATG AGTTTAATTC TATCCGCGGC CATGAGTTCG CAGGTTGTAT TGCGCAGGTT GGTGAAAAAG TCAAAGACTG GAAAGTGGGG CAACGCGTCG TGTCGGATAA CAGCGGTCAC GTTTGCGGCG TTTGTCCGGC CTGTGAACAG GGTGATTTTC TGTGTTGTAC AGAAAAGGTA AACCTTGGTC TGGATAACAA TACCTGGGGC GGTGGTTTTT CCAAATATTG TCTGGTTCCT GGTGAAATTC TCAAAATTCA TCGTCATGCG TTGTGGGAAA TCCCTGATGG TGTTGATTAT GAGGACGCAG CCGTACTTGA CCCTATCTGT AATGCCTATA AATCCATCGC GCAGCAATCG AAATTCCTTC CTGGTCAGGA TGTTGTCGTC ATCGGCACTG GCCCACTCGG GCTGTTTTCC GTACAAATGG CGCGGATTAT GGGGGCGGTA AATATCGTCG TCGTTGGTCT GCAAGAAGAT GTGGCGGTCC GCTTCCCGGT TGCAAAAGAA CTGGGGGCGA CGGCAGTAGT AAATGGTTCT ACCGAAGATG TGGTGGCGCG CTGCCAGCAA ATTTGTGGCA AAGATAATCT GGGACTGGTG ATTGAATGCT CCGGTGCCAA TATCGCACTG AAACAAGCCA TCGAAATGCT CCGCCCGAAT GGGGAAGTGG TACGCGTTGG AATGGGCTTC AAACCTCTTG ATTTCTCGAT TAATGACATT ACCGCCTGGA ATAAAAGCAT CATTGGGCAT ATGGCCTATG ACTCCACCTC ATGGCGTAAC GCTATCAGGC TATTAGCCAG CGGCGCTATC AAAGTCAAAC CGATGATCAC GCATCGTATC GGCCTGTCGC AATGGCGCGA AGGGTTTGAT GCGATGGTCG ATAAAACCGC AATCAAAGTG ATCATGACTT ACGACTTTGA TGAATAA
|
Protein sequence | MKALARFGKA FGGYKMIDVP QPICGPEDVV IEIKAAAICG ADMKHYNVDS GSDEFNSIRG HEFAGCIAQV GEKVKDWKVG QRVVSDNSGH VCGVCPACEQ GDFLCCTEKV NLGLDNNTWG GGFSKYCLVP GEILKIHRHA LWEIPDGVDY EDAAVLDPIC NAYKSIAQQS KFLPGQDVVV IGTGPLGLFS VQMARIMGAV NIVVVGLQED VAVRFPVAKE LGATAVVNGS TEDVVARCQQ ICGKDNLGLV IECSGANIAL KQAIEMLRPN GEVVRVGMGF KPLDFSINDI TAWNKSIIGH MAYDSTSWRN AIRLLASGAI KVKPMITHRI GLSQWREGFD AMVDKTAIKV IMTYDFDE
|
| |