Gene Tpen_0985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0985 
Symbol 
ID4601961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp934943 
End bp937384 
Gene Length2442 bp 
Protein Length813 aa 
Translation table11 
GC content50% 
IMG OID639773763 
Producthelicase domain-containing protein 
Protein accessionYP_920388 
Protein GI119719893 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.033377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTATCCC TGATTGAGAT ACTGAGGTCT GCCGGCAAGG ATCTCTACAA GCACCAGCTC 
GACTTTGTGT CTGATGCTCT GTGGTTTGAC GAGCCCCGTA TTCTCTTGGC TGACGACGTA
GGGCTGGGTA AAACAATCCA GGCTCTCCTC TACATTAAGG CTCTCCTCGA GCTGGGGAGG
GTGAACCATG TTCTAGTAAT AGTGCCTCGT GCTGTCGTGG AGCAGTGGGC ATCTGAGCTG
GAGATGTTCG AGATACCATT CTACATCGTG GAGTCGCCTG ACTTCCCGCT GGGACACAGG
GTCTACCTAG TCACCCTGGA TAGAGCTAAG GTGGACAGCT ACATGGAGGC TTTGGACAAG
ATAAACTGGG ACTTGGTAGT TGTTGATGAG GCTCACAAGC TGAGGCTAGA GACGCTCAGG
TCTAAGGTGG CTATTCTGTG TAGAAGGGCG AGGGGATGTC TTTTACTCAC TGCTACTCCG
CATACGGGCG ACGAGAGTAG CTTCAAGTTC CTGATAGGAC TCGCGAACAG CTATGTTGTA
CGGCGTGAAA AGAAGGATGT GGAGGAGTAT GAGGGGCGCA AGATATTCCC GTCTTTGGGG
TACTGGATAG TGCAAGTGAA AGCAACGAAA GAGGAGAGTG ACGCGCTTCA TAAAGTGCTT
AAATTCCTGG AAAACGATCA GATAGAGCAG ATTGTGCGCG TAGTAGTGGA GAAGAGGGCG
ATGTCGAGCC CAACAGCGTT TTTCAAGACA CTCGGAAAGG TTGTAGGAGG GTATTGCAGC
GAAGAGTTAC TGGAAGAGGG GGAGCTCGAT GCCTGCATTG GTAACGTTGC CGGTGTAAAG
AAGCTTGAGG AGCTCGCAAA GAAATATGCT AGCGCTGCCG ACAGGAAGTT GGATGCTCTC
GAAAAGTTGC TGAAGGATCA CCTCAAGGGG AGAAAAGTAC TTGTCTTCAC AGAATATGCG
ACTACCGCCG AGTACCTATT TGAGAAGCTT GTTGAGAAGC TTGAGGGTTG TAAGATCGTA
GATAGTGGGG AGGGCTACGC AAAGGCTGAC TGTAGCGAGT TTGGAGTTAT GTATGTTACA
GCTAAGGCTA GGGACAGGAT TGACGTAAGC AGGGAAGGCG CGCTCCTAGC GTCCGCGTAT
CCTACAGCTG TACTTATATC CACGGACATA ATGTCCGAGG GCGTAAACCT CCAGGCTTTC
GATGTTGTTG TTAACTACGA GGTTGTATGG AGCCCGACAA AGCATGTGCA AAGGATAGGT
AGAATCTGGA GGTTTGGACA GAAGGCAGAG AAAATACTAG TTATCGACAT GGTTCTAAAG
ACTACTCTCT CGCAGGACGA ATACAGCAAC TATTTGACTT TGTTGGAGAA ATTGTACGAG
ATATCTCTCG CTGCCCTACC ACCCCAGAGC TACGGCGAGT TCGAGATCTA TGAGGTCGAC
CAAGAGCTCA GGAAGATAGT GGAGATAGGG TCTTCGGCTT ATCTAGGAGA GGAGGAGGTC
TACGAGGCTT TAAGGTCTGG TAGGCTGGAG GACCTGCGGA GCAGGATAAA GCGGATTCTC
AAAGCTAAAG AGAACATGAG GTGGAAGAGC AGGAATGAGG TGGATGAGGG TTTACGCGTA
AAGCTTGGCT ACCCACCGGA GAAGAAGCCG GAGCCCGGGG GAGGATACTA TGTTGCCAAC
GTTACTTTTG AGCGTAACGG AGTGAAGCTT TACTCCGAGC GTATACTCCT AAGGTTGCCA
ACACCGCTGA GTAGAAGTAG GTCGGTACAG GAGGGGGTCT TCAGAGAGCT GGAAGTTCCC
TGGGATGCAG TAGTAGAGGA TACTGGGTCG CTCAAGGAAG ACGAGAGAGA GGAGGTTAAT
CGTATGGTGT GGATCGAGGT ATGGCATCCA CTGCAGCAGT ACCTTTCAAG AAACAACCTT
CCCGAAGGTG GTATTAACGT AGAAGTAAAA CGTGCAAGGG TAGAGTCTAT AGGGGTTGCA
GAGCTAATAC CCGTTACCTT GAGCTTCGAG GAGCTTGTCG AGAGAGAAGT GAGGTACAGT
AGGAACAGGG AGCGCACAGA AAGGGCTGCT GCTAGATGCA TCAGGAGTTT ACTCGAAAAT
CTGGGCTATA CAATTGTGGA GGAATATGCG AGTATTCCTC GACCCTTCGA CATGGTGGTG
AAAAAGGACG GCATTCTTTA CACGGTGGAA GTCAAAGGTA AATGGGTTGG GAAGCGTGAT
GAACCGTTAT CCTTCACTGC GAATGAGATT GACTGGGCTT CGAGGTTCCC AGATAGGCAC
ATTGTGTGCA TAGCATACGT GGACAGAGAT TACTGTGAAG ACGTGGAGTG CTACTACTTC
AATGAATTTC AGAAGAAATG GGTTCTGGAA ACTGTGAGAG GAATAGAGTA CAAGTATAAC
GCTAGGAAAA AAAAGGGAGC CGACAAAACC GAACCTCAAT AG
 
Protein sequence
MVSLIEILRS AGKDLYKHQL DFVSDALWFD EPRILLADDV GLGKTIQALL YIKALLELGR 
VNHVLVIVPR AVVEQWASEL EMFEIPFYIV ESPDFPLGHR VYLVTLDRAK VDSYMEALDK
INWDLVVVDE AHKLRLETLR SKVAILCRRA RGCLLLTATP HTGDESSFKF LIGLANSYVV
RREKKDVEEY EGRKIFPSLG YWIVQVKATK EESDALHKVL KFLENDQIEQ IVRVVVEKRA
MSSPTAFFKT LGKVVGGYCS EELLEEGELD ACIGNVAGVK KLEELAKKYA SAADRKLDAL
EKLLKDHLKG RKVLVFTEYA TTAEYLFEKL VEKLEGCKIV DSGEGYAKAD CSEFGVMYVT
AKARDRIDVS REGALLASAY PTAVLISTDI MSEGVNLQAF DVVVNYEVVW SPTKHVQRIG
RIWRFGQKAE KILVIDMVLK TTLSQDEYSN YLTLLEKLYE ISLAALPPQS YGEFEIYEVD
QELRKIVEIG SSAYLGEEEV YEALRSGRLE DLRSRIKRIL KAKENMRWKS RNEVDEGLRV
KLGYPPEKKP EPGGGYYVAN VTFERNGVKL YSERILLRLP TPLSRSRSVQ EGVFRELEVP
WDAVVEDTGS LKEDEREEVN RMVWIEVWHP LQQYLSRNNL PEGGINVEVK RARVESIGVA
ELIPVTLSFE ELVEREVRYS RNRERTERAA ARCIRSLLEN LGYTIVEEYA SIPRPFDMVV
KKDGILYTVE VKGKWVGKRD EPLSFTANEI DWASRFPDRH IVCIAYVDRD YCEDVECYYF
NEFQKKWVLE TVRGIEYKYN ARKKKGADKT EPQ