Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1516 |
Symbol | |
ID | 4601112 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1463999 |
End bp | 1464985 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639774291 |
Product | alcohol dehydrogenase |
Protein accession | YP_920916 |
Protein GI | 119720421 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.997072 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGGCCG CGTTACTCTA CGGTCCGGGA GACCTGAGGG TCGAGGACGT ACCGCTCCCA GAGAAGCCGG AGGGATGGGC ACTCGTAAAG ACCCTCGCGG TCGGTATCTG CGGGACGGAC AAGGCGTTCT ACAAAGGCAC GTACCCGCTC TTCAAGAAGC CCCTCATCCC CGGGCACGAG GTCTCGGGGG TCGTCGTGGA GGGGCCGGAG GAACTCGTGG GCAGGCTCGT GGTCTCCGAG ATCAATTTCC CGTGCTGGAG GTGCGAGTAC TGCAGGTCGG GCCTCTACAC TCACTGCCCC TACAAGAAGA CTCTCGGCAT AGACTTCGAT GGGGGCCTCG CCGAGTACTT CGTCGCCCCT GCAACCGCGC TTCACGCCGC CGAGGGCTTA GACCCCGTAG TCGCCACCGA GGTCGAGCCC CTAGCGGCCT TGCTCAACGC GCTGAGGCTC AAGCCGCCCG CCCCGGGCGA CAGCGTAGCC GTGGTGGGAA CCGGGAACCT CGCCGTGATG CTCGTCCAGG TGCTCAAGGA CGCGGGCTTC AGGCCCGTCG TCGTCTCGCG GGCTGGTAGC TCCAAGGCAG AGATACTCAG GAGCCTAGAC GTTGAGGTGG TGACGGCAGA GCAGGCGTCC AGGCTGGGCG GCGAGTCCGG GGACGGCGTC GGGTTCGACG TGGTGTTCGA GGTCTCCGGG GACCCCTCAG CGCTTAACCT CGCGGTCGAC CTCGTAAAGC CGAGGGGCAC TGTTCACTTA AAGTCGACGC CCGGCAACCC TGGGAGCGTT AACCTCACCA AGGCCGTAGT CAAGGAGGTC GCCGTCATCG GCTCCAGGTG CGGTACCTTC AGGGAGTTCC GCGAAGCCAT AAGGCTGTTG AGAGAGGGGA GGGTGAGGCC CGTGATTACC AGCGTCTACG GCGGGTTGGA GAAGGCGAGG GAAGCCTTCG AGAGGTCGTT CAGGCCCGGA GAGGTAAAGG TCGTAGTCAA GCCATAG
|
Protein sequence | MKAALLYGPG DLRVEDVPLP EKPEGWALVK TLAVGICGTD KAFYKGTYPL FKKPLIPGHE VSGVVVEGPE ELVGRLVVSE INFPCWRCEY CRSGLYTHCP YKKTLGIDFD GGLAEYFVAP ATALHAAEGL DPVVATEVEP LAALLNALRL KPPAPGDSVA VVGTGNLAVM LVQVLKDAGF RPVVVSRAGS SKAEILRSLD VEVVTAEQAS RLGGESGDGV GFDVVFEVSG DPSALNLAVD LVKPRGTVHL KSTPGNPGSV NLTKAVVKEV AVIGSRCGTF REFREAIRLL REGRVRPVIT SVYGGLEKAR EAFERSFRPG EVKVVVKP
|
| |