Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0861 |
Symbol | |
ID | 4602204 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 810940 |
End bp | 813174 |
Gene Length | 2235 bp |
Protein Length | 744 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639773639 |
Product | DEAD/DEAH box helicase domain-containing protein |
Protein accession | YP_920265 |
Protein GI | 119719770 |
COG category | [R] General function prediction only |
COG ID | [COG1205] Distinct helicase family with a unique C-terminal domain including a metal-binding cysteine cluster |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCCTCGG ACGACGTACT CGAGGCTTTG AGGAACGACA GCGGCCACAG GGTAGTCTAC GTCTTCTTCG AGAGCCCTAG GGAGCCTGAG CCCGGGCCCC CCGTGGAGGA GGTCGTAAAG GAAGAGGTGC TCGTCAAGGC TTTGAAAAGC AGGGGTATAG CGAGGCTCTA CAGGTTTCAG GCTGAGGCTA TCGAGTCGAT AAGGTCCGGG AACAACACGC TCATAATAGC GGGTACCGGT ACTGGGAAGA CGGAGGCGTT CCTGGTACCA ATCCTGGAGA GGCTCTATAG GGAGCCCGAG GAGACCGGCA TCCTGGTGTA CCCTACGAAG GCGCTCGCAA GGGACCAGGT GGAGAGGATA TACTCGTACG CCGGGGCTGT CTTCGGCTTC AGGGTCTCCG TGTACGACGG GGATACCCCT GAGAAGGAGC GCGAGATGCT ACAAGTGTAC CCGCCCAGGC TACTCGTAAC GAACCCGGAC ATGCTCCACC TGAGCCTGCG CAGAGGGGAA GGCGTGAAGC CTATACTCGA GAAAGCCAAG TTCGTCGTGT TGGACGATGC GCATATATAT AGCGGGGTTT TCGGGTCCCA CGTTCACTAC GTCTTAAAGC GCATGAAGAG GTTCATGCCA GCGGACGCAG TCTTCGTGGC CTCCTCAGCC ACCATAGGAA ACCCGCGCGA GTTCTCCCAA AAGCTCTTCG GAGAGGAGTG CAGGGTTGTC GAAGCCGGTC TGACCCGGCG TGCCCCCGTG TACCACGTGA TGGTAAGGCC CGCGGCGAGG TCACGCATGG CGGAGGCGCT GCGCCTCTTG AAGATCCTCG CTGAGCGGCG TCTGCGGACG TTACTTTTCG TGGACAGCCA CAAGGTGGCG GAGAGCCTCA GCGTCCTAGC GCAACGCGAG GGGCTTAAAG TCTCCGTGCA CAGGGCGGGG CTTCTCCCGA GCCACAGAAG GAGCGTCGAG GAGAAGCTCA GGAAGGGGGA GCTCGACGCC GTCGTGACGA CCCCTACGCT GGAGCTAGGG ATAGACATAG GCGAGCTCGA CGCCGTCGTG CTGTACGGCG TGCCTCCCAC GTTCAGCAAG TACGTGCAGA GAGCCGGCAG GGTGGGGCGG CGGGGCAGGA CGGGGTACGT CGTCATGATC CTCGGGGACG ACCCGATAAG CTCGTACTAT GAGAGAAACC CCGCGGAGTT TTTCAACCAA AAGCCGGACC CGGTGTTCCT CGATCCCCTA AACGAGGAAG TTATGCGCGT ACACCTGGTG GCTATGGCTC AGGATGCGCC GTTCAGGCTG GACAGCCTAG GCGGGGCTGA GCGCAGAGTA TGCGAGAAGC TACTAGAAGA GGGGTTGCTG AGGCTAGCCC CGGGAGGGTT CGTCGAGCCG ACAAGCAGGG GGTTAAGGTT CCTGGCGTCC CGCGACAACA TCAGGGGTAT AGGGGAGCAG GTGAAGATAG TAACGGATAC AGGCAAGGTC ATCGGGTTCC GGGAGATGCC CCAGGCGATA AAGGAGCTCT ACCCGGGAGC TATCTATATC CATGGGGGAA CTCCCTACAT CTCCCTGGGC ATAGAAGGTA GGAAGGCGAG GGTCAAGCTT CTACCGACGT CCACGATACC CGTGACGACC TCGCCGCTAT ACTACACCTG GCCGTTCGAG AATGAGGTCC TCGACGAGAG GAGCGTTTTC GGGATACGCG TCTCCTACCT GGAGCTCGAG GTTGCGGATC ACGTCTACGG CTACGTCACG AAGAGCTTCC CAGAGGGCCA CGTCATTTCG CAGAAGATAC TCGAGGAGGA GCTCACCTAC AGGTTCAAGA CGAAGGGGTT GCTCCTGGAG ATGCAGCCTA ACCCCCAGTG GAGCGAGCTT CAAAACGCGG AAGCCTTCCA CGCAGTAGAA CACGCCCTGA TCTACGCGGG GCAGCTCGTC GTAGGCGCTG CGCCGACAGA CATGGGTGGC ATCAGCTTCC CCTCCGGCCA CATATACATC TACGACGCCT TCCCGGGAGG GTCGGGCGTA ACGAAGCAAC TCTTGGAGAG GCTCAAAGAA GCTCTGCACA AGGCGTTCGA CATAGTCTCG AGGTGCACCT GCGAGGATGG ATGCCCCAAG TGCATTTTCT CTCCTTACTG CGGCAACAAC AACAGGATAC TCTCGAGGCG GAAAGCCTCC CAGGTTCTCG GCGAGGTACT ATCGCTGAGG ATCGCGGGCA GAAGCCCCGA GAGGTACGGC AAGCCTCTCG TGTAG
|
Protein sequence | MSSDDVLEAL RNDSGHRVVY VFFESPREPE PGPPVEEVVK EEVLVKALKS RGIARLYRFQ AEAIESIRSG NNTLIIAGTG TGKTEAFLVP ILERLYREPE ETGILVYPTK ALARDQVERI YSYAGAVFGF RVSVYDGDTP EKEREMLQVY PPRLLVTNPD MLHLSLRRGE GVKPILEKAK FVVLDDAHIY SGVFGSHVHY VLKRMKRFMP ADAVFVASSA TIGNPREFSQ KLFGEECRVV EAGLTRRAPV YHVMVRPAAR SRMAEALRLL KILAERRLRT LLFVDSHKVA ESLSVLAQRE GLKVSVHRAG LLPSHRRSVE EKLRKGELDA VVTTPTLELG IDIGELDAVV LYGVPPTFSK YVQRAGRVGR RGRTGYVVMI LGDDPISSYY ERNPAEFFNQ KPDPVFLDPL NEEVMRVHLV AMAQDAPFRL DSLGGAERRV CEKLLEEGLL RLAPGGFVEP TSRGLRFLAS RDNIRGIGEQ VKIVTDTGKV IGFREMPQAI KELYPGAIYI HGGTPYISLG IEGRKARVKL LPTSTIPVTT SPLYYTWPFE NEVLDERSVF GIRVSYLELE VADHVYGYVT KSFPEGHVIS QKILEEELTY RFKTKGLLLE MQPNPQWSEL QNAEAFHAVE HALIYAGQLV VGAAPTDMGG ISFPSGHIYI YDAFPGGSGV TKQLLERLKE ALHKAFDIVS RCTCEDGCPK CIFSPYCGNN NRILSRRKAS QVLGEVLSLR IAGRSPERYG KPLV
|
| |