Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0331 |
Symbol | |
ID | 4601698 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 297750 |
End bp | 300908 |
Gene Length | 3159 bp |
Protein Length | 1052 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639773091 |
Product | DEAD/DEAH box helicase domain-containing protein |
Protein accession | YP_919743 |
Protein GI | 119719248 |
COG category | [R] General function prediction only |
COG ID | [COG1205] Distinct helicase family with a unique C-terminal domain including a metal-binding cysteine cluster |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGAACT CAGAGGAGAT ATTGCAAAAG CTGGGCTACA GGTACTATGC GTTTAGAGAG CCCCCCTCGG AGCCCGAGAC AGCGGACGTC AGCTTCGCGG ACATATTGGG CGTTGAGAAA GCCTCGGGGA GGCTTGCAGC CTTGTTGGGC AGAAAGCTCT ACAAGCACCA GGTAGAAGCC TTCAAGGCGT TAAGCGAAGG GAAGAACGTC GTTCTGAAAG CCGGTACGGG TAGCGGGAAA ACAGAGGCAT GGTTCCTGTA CGTGGTGAAG AGTAAGAAGC GTGCCTTGGC AGTCTACCCT ACGCTGGCAC TAGCCTCTGA CCAGCTGGCG AGGCTGAAGG ACTACTCGCA GAGCCTGGGG GTAAGGGTCT CCCAGATAGA TGCTACGTCG AAGTCCGAGA TGTTCGAGAA GGGTTTAAAG TCCTCGGGCA TAAGGTCTGC TCTCTCGGAG TCTGACATAG TGGTAACGAA CCCCGCCTTC CTGTTGATGG ACCTTAAACG CCTAGCTACA AAGCCGTCCA CCTGTACTCT TTTCAGCTTT TTTCAGCAGT TGGATCTTCT CGTCCTAGAC GAGATCGACT TCTACAGCCC CAGAGAGCTA GCCCTGCTTG TCTCGATTAT GAAGATTCTC TCGGAGCTAC GCCCTACCCT ACAGTTCGCA GTCCTCACAG CCGGGCTTTC GAACCCCTCG GAGTTCTGCG ACATCATGCG GGGAATGAAC GGAAGGGAGT GCGCCGTGGT GGAGGGGAAA CCTTTCAAGA GAGAAAACAG GTACATCCTC GTCCTCGGAA AGAACCTGGA CAAACTCTGG GAATTCGCCA GGGAACACTA CGAGGAGTTA CTGAACGCCG GCGCGGGCGA GGACATCAGG AAGGCCCTAG AAGACTTCGA CTACTTTCAG CGCAACCTCT ACAAGGTGCT CGAAGCCCTG AGAGCTGTGG GTGTGGATCC TCCCTCGCCG AGCGTAGACC CCGTGGAGAT ACTGGAGAAC TACCTCTACG ACGACGGCGT CACGATAGTG TTCACGGGTA GCATTAACAA GGCGGAGGAG GTTTACCGAA AGCTCAGGCA CAGAGTTGGA GAAGAGTTAG CGGGGCGCGT GGCTGTTCAC CACCACCTCG TCGCTAAGTC GAAGAGGAGG GAGGTAGAGG AGAAGATAAG GGAGGGAGCC GTACGCGTAG TACTCTCGCC GAGAACGCTG AGCCAGGGTA TTGACATAGG TAACGTTGTC CGCGTTGTTC ACCTTGGGTT ACCGGAGGAG GTTCGCGAAT TCTACCAGAA GGAAGGCAGG AAGGGTAGAC GGTTGGAGCA GGAGTTCACG GAGTCTGTAA TCATACCGCA CACGAGGTGG GACCGCGAGC TCCTATCGAG GGGCGTCGAC CTTTTTGTGA AGTGGGTTAA CGCGCCGCTA GAGGTAACGC TGGTAAACAA GGACAACAAG TACTCTCTAC TCTTCTACTT GATCTTCAAG CTGAAATCGG GCAGAAGCCT CTCCAAGGAG GAGGCGGAGT TCCTGGAAGG GTTGGGGCTC CTTGAGAACG GGGAGCTGAC CAAGAAGGGC GAACAAGCAT GGTACTACAT TAACTTCTAC GAGTATGCAC CCCCGTTCGG CGTCAAGAGG GTTTACAGGG ACGAGTCCGG GGAGAAGTAC CTTGAGGATG TATCGTTCTC TGACCTCGTG GAGAAGTTCC AGGTTGGCTG CTTCGACTAT ACCTCCGACG GGTTGGTTAC CGGGATCCTG CTGGGTGGAA GCAGGGGGCG AGCAGTAAGG AGGATAGAGG TATCCCCTAT AAGGGAGAGC GTGCTTTACG CGCACGACGC TATAGCTTAT GCTCTGGAGG AGTACAAGAA GACGAAGCTC CAGTGGGGAG AAAAGCCCAG CCTCTACTCG GACTACATGA AGGGGAGCGT TCGATCGGAG GCGATAAGCA ACGTTATACC GCCGACTGGA GGCTTTGGCA TGTACATAAA GCTCCCGTAC AAGGTGGTTT GGCACGTTGA AAGCGAACGA GGATCTCTGC TTAGCCTCTC CGGCAAGACG TTCATTTACC GCAGGAAGAG GACAATTGAG GTTCCGGGCT TCGTCGCCGG GAGGTACAGC GACTTCACGT ACGGCGAGCT CTACGAGCTG AACCCCGGCG ACGAGCTGAA GAACATAAGG CTCGGGGCTG CTTTCCTATC GGTCTTTCTG AGGGAACGCT ACAACATGCC GCTACATACG TTTTCCTTCT CATTCTCGGC GCTGGGCGGC AGGAAAACAG TCGTAATATG GGAGGAGGAG TGCGCGGGCT TCATCGAGAA GATGGACTGG TTCAAGGTAT ACAGGGAGAT AGACGACTAC AAGCCTTCCG AGCTGGCAGA GATATACCTC CTACTGCGCG ACGAAGAAGC TTACAGCGAG TGGCTTTCAT TCTCGGGGGA CTGGGAGGTA GCGAAGGCGT TCACCAAAAG GCTCCTCGAA TACGTGTTGC AGAAGAGGAG GATTAAGGTT GTGTTTTATG GCGAAGAAAT GTTTGTTCCG AAGCCTGGCA GGCATCTGAA GCTGGTAGCG CTGGACACAC TGCTCGTACC CATCCGTGAG GGTGGAGAGG TGTCGAAAGC CTACGTAGGA ATATTCGATG GAGATGAGTC CGCGGTCTCG GTTTTCACGA AGGAGTTCTA CAAGGTAAGC GGTCCCGGAG ACGAGTTGAA CAACAAACTC ATGAACTTAG TGAACGATGG CTTCAGAGTC CTCATTTACG ATTTGGACAG AGTCCGCGGA GAACTGCACG CAGCGGGGTT AACCTACCAT GCGGCCACCC TCGCCGGCCT ACTCCAGCTG GGCGCAGTTA TCGACGTGAA GAAAGTTGTC GAAGAGAAAA CGGGTCTTCA GCTACCGTTA TCCACGGCTA AACAGCTACT GAGTAAGGAC GCGTTAAGCT CGCTAGGTAT CGAAAAGACG GTAGACCTAG GAGATCTAGA ACTAGAGCTA GCCGGTTTTT ACTCGAGGCT AAGAACGCCT GGGAGGAGCC CAAAGCGGCT ACCCTCCATG AAATTCCTCG ACGAAGCCTC CAGGAGGTTC ATCGACGAAA ACGTTAGGGT CATCTACCTC TTATGGCTCA TCTTCAACAG CCAGGCTGGT CAAAAAGCCT TAGAGAAAAT GATGCAAGTA CGCGAATAG
|
Protein sequence | MLNSEEILQK LGYRYYAFRE PPSEPETADV SFADILGVEK ASGRLAALLG RKLYKHQVEA FKALSEGKNV VLKAGTGSGK TEAWFLYVVK SKKRALAVYP TLALASDQLA RLKDYSQSLG VRVSQIDATS KSEMFEKGLK SSGIRSALSE SDIVVTNPAF LLMDLKRLAT KPSTCTLFSF FQQLDLLVLD EIDFYSPREL ALLVSIMKIL SELRPTLQFA VLTAGLSNPS EFCDIMRGMN GRECAVVEGK PFKRENRYIL VLGKNLDKLW EFAREHYEEL LNAGAGEDIR KALEDFDYFQ RNLYKVLEAL RAVGVDPPSP SVDPVEILEN YLYDDGVTIV FTGSINKAEE VYRKLRHRVG EELAGRVAVH HHLVAKSKRR EVEEKIREGA VRVVLSPRTL SQGIDIGNVV RVVHLGLPEE VREFYQKEGR KGRRLEQEFT ESVIIPHTRW DRELLSRGVD LFVKWVNAPL EVTLVNKDNK YSLLFYLIFK LKSGRSLSKE EAEFLEGLGL LENGELTKKG EQAWYYINFY EYAPPFGVKR VYRDESGEKY LEDVSFSDLV EKFQVGCFDY TSDGLVTGIL LGGSRGRAVR RIEVSPIRES VLYAHDAIAY ALEEYKKTKL QWGEKPSLYS DYMKGSVRSE AISNVIPPTG GFGMYIKLPY KVVWHVESER GSLLSLSGKT FIYRRKRTIE VPGFVAGRYS DFTYGELYEL NPGDELKNIR LGAAFLSVFL RERYNMPLHT FSFSFSALGG RKTVVIWEEE CAGFIEKMDW FKVYREIDDY KPSELAEIYL LLRDEEAYSE WLSFSGDWEV AKAFTKRLLE YVLQKRRIKV VFYGEEMFVP KPGRHLKLVA LDTLLVPIRE GGEVSKAYVG IFDGDESAVS VFTKEFYKVS GPGDELNNKL MNLVNDGFRV LIYDLDRVRG ELHAAGLTYH AATLAGLLQL GAVIDVKKVV EEKTGLQLPL STAKQLLSKD ALSSLGIEKT VDLGDLELEL AGFYSRLRTP GRSPKRLPSM KFLDEASRRF IDENVRVIYL LWLIFNSQAG QKALEKMMQV RE
|
| |