Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0821 |
Symbol | |
ID | 4601992 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 774143 |
End bp | 775339 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639773598 |
Product | alcohol dehydrogenase |
Protein accession | YP_920225 |
Protein GI | 119719730 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.973197 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGCAG CCTTAGTGAC CGCGGACTTT GCGCCTAGAC CCGGCTACAA GATCACGCAG GACGAGATAA GGACCCACAA GGTGAGAGAG GGGGCGAAGG TCTGGCGGAA CCCGAAGCTC GTGCTGAGAA CAGACTACCC TGTACCGGAG CCGAAGCCCG ACGAGATACT CATACGCGTC AAGGCGGTAG GCATATGCGG GTCCGATATA CACTTCCTCG AGACGGATAG CGAAGGCTAT ATCCTTTACC CCGGCCTCAC GAGGTTCCCC GTGGTTATCG GGCACGAGTT CAGCGGCGTG GTCGAGAAGG TAGGAACCAA CGTCAAGACG TTCAAGCCGG GAGACATGGT GACGAGCGAG GAGATGTTCT GGTGCGGGGA ATGCGATGCG TGTAGAAGCG TGGACTTCAA CCACTGCCTG CGGCTAAACG ACCCGGCGGA CCTCGAGTTC GGCGAGCTCG GCTTCACGCA CGACGGCGCG ATGGCAGACT ACGTAGTCGT CAAGGCCAAG TATGCCTGGA AGATAGACTC GCTCCTCGAC AGGTACGGTA GCGAGGACAA AGCGTTCGAG GCGGGGAGCC TCGTAGAGCC TACCAGCGTA GCCTACCACG CCATGTTCAC GAGGGCAGGC GGCTTCAAAC CGGGAGCCTA CGTCGCGGTC TGGGGCGCAG GCCCCATAGG GCTTGCAGCC ATAGCTCTCG CGAAGGCCGC GGGCGCGGGC AAGGTCATCG CGTTCGAGGT AAGCCCGACG AGAAGAGAGC TCGCAAAGAA GGTGGGCGCA GACTACGTGT TCAACCCGGT CGAGCTCTCG AAGAACGGCG TTGAGCCTTG GGAGAAGATA ATGGAGGTGA CCGGGGGCCA GGGCGCGGAC TTCCACGTTG AAGCTGCCGG CGCGCCGAGA CACACCATAC CGCAGATGCA GAAAAGCCTC GCCATTAACG GCAAGATAGT GCAGATAGGG CGGGCGGCTG AGGACGTCCC GATATACCTT GAAGTGTTCC AGGTTAGGAG AGGCCAGATA TTCGGGAGCC AGGGGCACTC CGGCTTCGGC AACTTCGGCA ACGTCATACG GCTGATGGCC GCCGGCAAGA TAGACATGAC CCAGATCATA ACCGCCAGGT TCTCTCTCGA CGAGGTCTAC AAGGCTTTCG AGAGGGCCCA CCAGAGGATA GACGGCAAGA TAACTCTCAA ACCGTAA
|
Protein sequence | MRAALVTADF APRPGYKITQ DEIRTHKVRE GAKVWRNPKL VLRTDYPVPE PKPDEILIRV KAVGICGSDI HFLETDSEGY ILYPGLTRFP VVIGHEFSGV VEKVGTNVKT FKPGDMVTSE EMFWCGECDA CRSVDFNHCL RLNDPADLEF GELGFTHDGA MADYVVVKAK YAWKIDSLLD RYGSEDKAFE AGSLVEPTSV AYHAMFTRAG GFKPGAYVAV WGAGPIGLAA IALAKAAGAG KVIAFEVSPT RRELAKKVGA DYVFNPVELS KNGVEPWEKI MEVTGGQGAD FHVEAAGAPR HTIPQMQKSL AINGKIVQIG RAAEDVPIYL EVFQVRRGQI FGSQGHSGFG NFGNVIRLMA AGKIDMTQII TARFSLDEVY KAFERAHQRI DGKITLKP
|
| |