Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2697 |
Symbol | |
ID | 5592265 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2712257 |
End bp | 2713351 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640921814 |
Product | zinc-binding dehydrogenase family oxidoreductase |
Protein accession | YP_001459338 |
Protein GI | 157162020 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 65 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATTA ATCGTATTGT TAATGAGGGA TTTATGAAAA CGATGCTGGC AGCTTATTTA CCAGGAAATT CGACCGTCGA TCTACGGGAA GTTGCGGTGC CGACACCAGG GATTAACCAG GTACTGATCA AAATGAAATC TTCCGGAATT TGCGGAAGCG ATGTCCACTA TATCTACCAC CAGCACCGCG CTACGGCAGC GGCACCCGAC AAACCGTTAT ACCAAGGATT TATCAACGGT CATGAACCGT GCGGGCAGAT TGTGGCGATG GGGCAAGGCT GCCGCCATTT TAAAGAGGGC GACCGCGTGC TGGTGTATCA CATTTCTGGC TGTGGTTTTT GCCCGAACTG CCGCCGCGGC TTTCCTATCT CTTGTACTGG CAAAGGAAAA GCGGCTTACG GCTGGCAGCG TGACGGCGGT CATGCCGAAT ATCTGCTGGC GGAAGAAAAA GATCTGATCC TCCTGCCGGA TGCGCTGAGC TACGAAGATG GTGCGTTTAT CAGTTGCGGC GTTGGCACGG CCTATGAAGG GATTTTGCGC GGCGAAGTTT CCGGCAGCGA TAACGTGCTG GTGGTCGGTC TGGGGCCGGT CGGCATGATG GCGATGATGC TGGCGAAAGG TCGCGGTGCA AAACGGATCA TCGGCGTTGA TATGCTGCCG GAACGTCTGG CGATGGCAAA ACAGTTAGGG GTGATGGATC ACGGCTATTT AGCAACCACC GAAGGTCTGC CGCAGATTAT CGCCGAACTC ACCCACGGTG GCGCGGATGT TGCGCTCGAT TGTTCCGGTA ATGCCGCAGG TCGCTTGCTG GCACTGCAAT CCACCGCTGA CTGGGGACGG GTGGTTTACA TTGGTGAAAC CGGAAAAGTG GAATTCGAGG TCAGCGCCGA TCTGATGCAC CATCAACGGC GGATTATCGG CTCCTGGGTG ACCAGTCTGT TCCATATGGA AAAATGCGCC CATGATCTGA CGGACTGGAA GCTGTGGCCG CGTAACGCCA TTACCCATCG CTTCTCGCTG GAACAGGCAG GTGATGCCTA TGCGCTGATG GCGAGCGGCA AATGCGGGAA AGTTGTGATT AACTTCCCGG ATTAA
|
Protein sequence | MKINRIVNEG FMKTMLAAYL PGNSTVDLRE VAVPTPGINQ VLIKMKSSGI CGSDVHYIYH QHRATAAAPD KPLYQGFING HEPCGQIVAM GQGCRHFKEG DRVLVYHISG CGFCPNCRRG FPISCTGKGK AAYGWQRDGG HAEYLLAEEK DLILLPDALS YEDGAFISCG VGTAYEGILR GEVSGSDNVL VVGLGPVGMM AMMLAKGRGA KRIIGVDMLP ERLAMAKQLG VMDHGYLATT EGLPQIIAEL THGGADVALD CSGNAAGRLL ALQSTADWGR VVYIGETGKV EFEVSADLMH HQRRIIGSWV TSLFHMEKCA HDLTDWKLWP RNAITHRFSL EQAGDAYALM ASGKCGKVVI NFPD
|
| |