Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3296 |
Symbol | yqhD |
ID | 6143233 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3370449 |
End bp | 3371612 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641618126 |
Product | alcohol dehydrogenase yqhD |
Protein accession | YP_001745276 |
Protein GI | 170682079 |
COG category | [C] Energy production and conversion |
COG ID | [COG1979] Uncharacterized oxidoreductases, Fe-dependent alcohol dehydrogenase family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.495379 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAACT TTAATCTGCA CACCCCAACC CGCATTCTGT TTGGTAAAGG CGCAATCGCT GGTTTACGCG AGCAAATTCC TCACGATGCA CGCGTATTGA TTACCTACGG CGGCGGCAGC GTGAAAAAAA CCGGCGTTCT CGATCAAGTT CTGGATGCCC TGAAAGGCAT GGACGTGCTG GAATTTGGCG GTATTGAGCC AAACCCGGCT TATGAAACGC TGATGAACGC CGTGAAACTG GTTCGCGAAC AAAAAGTGAC TTTCCTGCTG GCGGTTGGCG GCGGTTCTGT ACTGGACGGC ACCAAATTTA TCGCCGCAGC GGCTAATTAT CCGGAAAATA TCGATCCGTG GCACATTCTG CAAACGGGCG GCAAAGAGAT TAAAAGCGCC ATCCCGATGG GCTGTGTGCT GACGCTGCCA GCAACCGGTT CAGAATCCAA CGCAGGCGCG GTGATCTCCC GTAAAACCAC AGGCGACAAG CAGGCATTCC ATTCCGCCCA TGTTCAGCCG GTATTTGCCG TGCTCGATCC GGTTTATACC TACACCCTGC CGCCGCGTCA GGTGGCTAAC GGTGTAGTGG ATGCCTTTGT GCACACCGTG GAACAGTATG TTACCAAACC GGTTGATGCC AAAATTCAGG ACCGTTTCGC AGAAGGCATT TTGTTGACGC TGATCGAAGA TGGTCCGAAA GCCCTGAAAG AGCCAGAAAA CTACGATGTG CGCGCCAACG TTATGTGGGC GGCGACTCAG GCGCTGAACG GTTTGATCGG CGCTGGCGTA CCGCAGGACT GGGCAACGCA TATGCTAGGC CACGAACTGA CTGCGATGCA CGGTCTGGAT CACGCACAAA CACTGGCTAT CGTCCTGCCT GCACTGTGGA ATGAAAAACG CGATACCAAG CGCGCTAAGC TGCTGCAATA TGCTGAACGC GTCTGGAACA TCACTGAAGG TTCCGACGAT GAGCGTATTG ACGCCGCGAT TGCCGCAACC CGCAATTTCT TTGAGCAATT AGGCGTGCCG ACCCACCTCT CCGACTACGG TCTGGACGGC AGCTCCATCC CGGCTTTGTT GAAAAAACTG GAAGAGCACG GTATGACCCA ACTGGGCGAA AATCATGACA TTACGCTGGA TGTCAGCCGC CGCATATACG AAGCCGCCCG CTAA
|
Protein sequence | MNNFNLHTPT RILFGKGAIA GLREQIPHDA RVLITYGGGS VKKTGVLDQV LDALKGMDVL EFGGIEPNPA YETLMNAVKL VREQKVTFLL AVGGGSVLDG TKFIAAAANY PENIDPWHIL QTGGKEIKSA IPMGCVLTLP ATGSESNAGA VISRKTTGDK QAFHSAHVQP VFAVLDPVYT YTLPPRQVAN GVVDAFVHTV EQYVTKPVDA KIQDRFAEGI LLTLIEDGPK ALKEPENYDV RANVMWAATQ ALNGLIGAGV PQDWATHMLG HELTAMHGLD HAQTLAIVLP ALWNEKRDTK RAKLLQYAER VWNITEGSDD ERIDAAIAAT RNFFEQLGVP THLSDYGLDG SSIPALLKKL EEHGMTQLGE NHDITLDVSR RIYEAAR
|
| |