Gene Tpen_0331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0331 
Symbol 
ID4601698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp297750 
End bp300908 
Gene Length3159 bp 
Protein Length1052 aa 
Translation table11 
GC content54% 
IMG OID639773091 
ProductDEAD/DEAH box helicase domain-containing protein 
Protein accessionYP_919743 
Protein GI119719248 
COG category[R] General function prediction only 
COG ID[COG1205] Distinct helicase family with a unique C-terminal domain including a metal-binding cysteine cluster 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAACT CAGAGGAGAT ATTGCAAAAG CTGGGCTACA GGTACTATGC GTTTAGAGAG 
CCCCCCTCGG AGCCCGAGAC AGCGGACGTC AGCTTCGCGG ACATATTGGG CGTTGAGAAA
GCCTCGGGGA GGCTTGCAGC CTTGTTGGGC AGAAAGCTCT ACAAGCACCA GGTAGAAGCC
TTCAAGGCGT TAAGCGAAGG GAAGAACGTC GTTCTGAAAG CCGGTACGGG TAGCGGGAAA
ACAGAGGCAT GGTTCCTGTA CGTGGTGAAG AGTAAGAAGC GTGCCTTGGC AGTCTACCCT
ACGCTGGCAC TAGCCTCTGA CCAGCTGGCG AGGCTGAAGG ACTACTCGCA GAGCCTGGGG
GTAAGGGTCT CCCAGATAGA TGCTACGTCG AAGTCCGAGA TGTTCGAGAA GGGTTTAAAG
TCCTCGGGCA TAAGGTCTGC TCTCTCGGAG TCTGACATAG TGGTAACGAA CCCCGCCTTC
CTGTTGATGG ACCTTAAACG CCTAGCTACA AAGCCGTCCA CCTGTACTCT TTTCAGCTTT
TTTCAGCAGT TGGATCTTCT CGTCCTAGAC GAGATCGACT TCTACAGCCC CAGAGAGCTA
GCCCTGCTTG TCTCGATTAT GAAGATTCTC TCGGAGCTAC GCCCTACCCT ACAGTTCGCA
GTCCTCACAG CCGGGCTTTC GAACCCCTCG GAGTTCTGCG ACATCATGCG GGGAATGAAC
GGAAGGGAGT GCGCCGTGGT GGAGGGGAAA CCTTTCAAGA GAGAAAACAG GTACATCCTC
GTCCTCGGAA AGAACCTGGA CAAACTCTGG GAATTCGCCA GGGAACACTA CGAGGAGTTA
CTGAACGCCG GCGCGGGCGA GGACATCAGG AAGGCCCTAG AAGACTTCGA CTACTTTCAG
CGCAACCTCT ACAAGGTGCT CGAAGCCCTG AGAGCTGTGG GTGTGGATCC TCCCTCGCCG
AGCGTAGACC CCGTGGAGAT ACTGGAGAAC TACCTCTACG ACGACGGCGT CACGATAGTG
TTCACGGGTA GCATTAACAA GGCGGAGGAG GTTTACCGAA AGCTCAGGCA CAGAGTTGGA
GAAGAGTTAG CGGGGCGCGT GGCTGTTCAC CACCACCTCG TCGCTAAGTC GAAGAGGAGG
GAGGTAGAGG AGAAGATAAG GGAGGGAGCC GTACGCGTAG TACTCTCGCC GAGAACGCTG
AGCCAGGGTA TTGACATAGG TAACGTTGTC CGCGTTGTTC ACCTTGGGTT ACCGGAGGAG
GTTCGCGAAT TCTACCAGAA GGAAGGCAGG AAGGGTAGAC GGTTGGAGCA GGAGTTCACG
GAGTCTGTAA TCATACCGCA CACGAGGTGG GACCGCGAGC TCCTATCGAG GGGCGTCGAC
CTTTTTGTGA AGTGGGTTAA CGCGCCGCTA GAGGTAACGC TGGTAAACAA GGACAACAAG
TACTCTCTAC TCTTCTACTT GATCTTCAAG CTGAAATCGG GCAGAAGCCT CTCCAAGGAG
GAGGCGGAGT TCCTGGAAGG GTTGGGGCTC CTTGAGAACG GGGAGCTGAC CAAGAAGGGC
GAACAAGCAT GGTACTACAT TAACTTCTAC GAGTATGCAC CCCCGTTCGG CGTCAAGAGG
GTTTACAGGG ACGAGTCCGG GGAGAAGTAC CTTGAGGATG TATCGTTCTC TGACCTCGTG
GAGAAGTTCC AGGTTGGCTG CTTCGACTAT ACCTCCGACG GGTTGGTTAC CGGGATCCTG
CTGGGTGGAA GCAGGGGGCG AGCAGTAAGG AGGATAGAGG TATCCCCTAT AAGGGAGAGC
GTGCTTTACG CGCACGACGC TATAGCTTAT GCTCTGGAGG AGTACAAGAA GACGAAGCTC
CAGTGGGGAG AAAAGCCCAG CCTCTACTCG GACTACATGA AGGGGAGCGT TCGATCGGAG
GCGATAAGCA ACGTTATACC GCCGACTGGA GGCTTTGGCA TGTACATAAA GCTCCCGTAC
AAGGTGGTTT GGCACGTTGA AAGCGAACGA GGATCTCTGC TTAGCCTCTC CGGCAAGACG
TTCATTTACC GCAGGAAGAG GACAATTGAG GTTCCGGGCT TCGTCGCCGG GAGGTACAGC
GACTTCACGT ACGGCGAGCT CTACGAGCTG AACCCCGGCG ACGAGCTGAA GAACATAAGG
CTCGGGGCTG CTTTCCTATC GGTCTTTCTG AGGGAACGCT ACAACATGCC GCTACATACG
TTTTCCTTCT CATTCTCGGC GCTGGGCGGC AGGAAAACAG TCGTAATATG GGAGGAGGAG
TGCGCGGGCT TCATCGAGAA GATGGACTGG TTCAAGGTAT ACAGGGAGAT AGACGACTAC
AAGCCTTCCG AGCTGGCAGA GATATACCTC CTACTGCGCG ACGAAGAAGC TTACAGCGAG
TGGCTTTCAT TCTCGGGGGA CTGGGAGGTA GCGAAGGCGT TCACCAAAAG GCTCCTCGAA
TACGTGTTGC AGAAGAGGAG GATTAAGGTT GTGTTTTATG GCGAAGAAAT GTTTGTTCCG
AAGCCTGGCA GGCATCTGAA GCTGGTAGCG CTGGACACAC TGCTCGTACC CATCCGTGAG
GGTGGAGAGG TGTCGAAAGC CTACGTAGGA ATATTCGATG GAGATGAGTC CGCGGTCTCG
GTTTTCACGA AGGAGTTCTA CAAGGTAAGC GGTCCCGGAG ACGAGTTGAA CAACAAACTC
ATGAACTTAG TGAACGATGG CTTCAGAGTC CTCATTTACG ATTTGGACAG AGTCCGCGGA
GAACTGCACG CAGCGGGGTT AACCTACCAT GCGGCCACCC TCGCCGGCCT ACTCCAGCTG
GGCGCAGTTA TCGACGTGAA GAAAGTTGTC GAAGAGAAAA CGGGTCTTCA GCTACCGTTA
TCCACGGCTA AACAGCTACT GAGTAAGGAC GCGTTAAGCT CGCTAGGTAT CGAAAAGACG
GTAGACCTAG GAGATCTAGA ACTAGAGCTA GCCGGTTTTT ACTCGAGGCT AAGAACGCCT
GGGAGGAGCC CAAAGCGGCT ACCCTCCATG AAATTCCTCG ACGAAGCCTC CAGGAGGTTC
ATCGACGAAA ACGTTAGGGT CATCTACCTC TTATGGCTCA TCTTCAACAG CCAGGCTGGT
CAAAAAGCCT TAGAGAAAAT GATGCAAGTA CGCGAATAG
 
Protein sequence
MLNSEEILQK LGYRYYAFRE PPSEPETADV SFADILGVEK ASGRLAALLG RKLYKHQVEA 
FKALSEGKNV VLKAGTGSGK TEAWFLYVVK SKKRALAVYP TLALASDQLA RLKDYSQSLG
VRVSQIDATS KSEMFEKGLK SSGIRSALSE SDIVVTNPAF LLMDLKRLAT KPSTCTLFSF
FQQLDLLVLD EIDFYSPREL ALLVSIMKIL SELRPTLQFA VLTAGLSNPS EFCDIMRGMN
GRECAVVEGK PFKRENRYIL VLGKNLDKLW EFAREHYEEL LNAGAGEDIR KALEDFDYFQ
RNLYKVLEAL RAVGVDPPSP SVDPVEILEN YLYDDGVTIV FTGSINKAEE VYRKLRHRVG
EELAGRVAVH HHLVAKSKRR EVEEKIREGA VRVVLSPRTL SQGIDIGNVV RVVHLGLPEE
VREFYQKEGR KGRRLEQEFT ESVIIPHTRW DRELLSRGVD LFVKWVNAPL EVTLVNKDNK
YSLLFYLIFK LKSGRSLSKE EAEFLEGLGL LENGELTKKG EQAWYYINFY EYAPPFGVKR
VYRDESGEKY LEDVSFSDLV EKFQVGCFDY TSDGLVTGIL LGGSRGRAVR RIEVSPIRES
VLYAHDAIAY ALEEYKKTKL QWGEKPSLYS DYMKGSVRSE AISNVIPPTG GFGMYIKLPY
KVVWHVESER GSLLSLSGKT FIYRRKRTIE VPGFVAGRYS DFTYGELYEL NPGDELKNIR
LGAAFLSVFL RERYNMPLHT FSFSFSALGG RKTVVIWEEE CAGFIEKMDW FKVYREIDDY
KPSELAEIYL LLRDEEAYSE WLSFSGDWEV AKAFTKRLLE YVLQKRRIKV VFYGEEMFVP
KPGRHLKLVA LDTLLVPIRE GGEVSKAYVG IFDGDESAVS VFTKEFYKVS GPGDELNNKL
MNLVNDGFRV LIYDLDRVRG ELHAAGLTYH AATLAGLLQL GAVIDVKKVV EEKTGLQLPL
STAKQLLSKD ALSSLGIEKT VDLGDLELEL AGFYSRLRTP GRSPKRLPSM KFLDEASRRF
IDENVRVIYL LWLIFNSQAG QKALEKMMQV RE