Gene Tpen_1812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1812 
Symbol 
ID4602049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1753782 
End bp1755635 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content61% 
IMG OID639774585 
ProductDEAD_2 domain-containing protein 
Protein accessionYP_921210 
Protein GI119720715 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1199] Rad3-related DNA helicases 
TIGRFAM ID[TIGR00604] DNA repair helicase (rad3) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0429091 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGTTACCG GCAGGTACGC GTTTCCCTAC AAGCCCAGGA GGCACCAGCT GGAGGTTGCC 
GAGAAGATCG TAGAGCTGTT GAAACGCGGG AACGTCGTGC TGGAGGCGCC TACGGGCTTC
GGCAAGACAC CGGTCGTCAT CTACGCCCTT CTACCATTCA TGGAGCGCGG GGGGAGGGTC
GTATGGGCGG TGAGGACGGG TAGCGAGACC GACAGGCCGG TCGAGGAGTT CAGGGTCTTC
AGGGAGAAGT CGGGCGCGAG GTTCGTCGCA GTCGGGTTGC GCGGGAAGAA GGACATGTGC
CTGCTGGCCG GGGAGAGGGG CGGGCAACTC GACTACAGCG AGGTATCGTA CATATGTAGC
CGCGAAAGGA GGAGGTGCAA GTACTACAGG AGGCTGCAGG AAGGCGTGGA CTACTCCGAG
CTCCTTGAGA GGGGCGCCCT AACTTACAGG GATGTGTTCG AGTGGGCGAG AAGAGAGGGG
GTCTGCCCGT ACTTCCTCCA GCGGGAGTTG CTCAAAGTCG CGGACCTCGT CGCCCTGAGC
TACAACTACG TCGTGGACGA AGATCTCTCG TGGACCCTTA GAGGAGCTTT CCCGACTAGC
CAGTCCATAC TCGTAGTCGA CGAGGCGCAC AACCTCCAAG AACTCAACCT CGGCGGAGAC
ACCGTGACGG AGGGGACTAT CGCGAGGGCC AAGGAGGAAG CCGCCGAGAT GGGGGACGAA
GAGGTGTACG GGTTCGTGGA GGCGCTCGAG GCGAAGGTTA GGGAGAAGTA CGGCTCCCTA
CCGGAGGAGG GGTCCGAGGT CTTCCGTGCG GAGGACCTCC TCGACGGCTT CGACGAAGAC
ATCATCGAAA AGACTCTGCG GCTCGGCGAA CTTGTCAGGG AGAAGAGGTA CAGGAGCGGC
GCGAGGCCCC GCAGTAGCGC CCACCACTTG GCGGAATTCC TCCGGAAGGC CGTCGACCTC
GAAGGCGTGA AGGGCATAGC GCTCATAGCC GAGAAGGTCG ACGGCAGAGT CCACCTCAAT
ATATGGGACA TGAGGTCCGG GGAGATACTC TCGGGCGTCT GGAGGAGGTT TAGAAGGGTG
ATCTTCATGT CCGGGACTCT CTCGCCGATC GAAGCCTTCG CCGAGACTGT GGGGTTAAGC
GACTACACCC CGGTAACCGT TCCGAGCCCC TACGACGAGA CAAACGCCTC TGTCTACCTC
GTAAAGGACC TGACGACGCG GGGTGAGGAG CTCTCCGAGG AGATGGCCGA GAGGTACGTC
TCGGCTGTCG GCAGGGTTTT AGGCAGGGTT AAGAGGAACA GCGCCGTGTT CACTGCTAGC
TACAGGGTTC TGCAGAGGCT TCTCGAGGCT GGCTTAAAGG AGGAGGCCGA AAGGCTCGGC
TACGAGGTTG TCGTTGAGCG GAGGGATATG TCCGGGCAGG AGGCAGGCGC AGCTCTCGAG
AAGTTCAAAA ACCTCGCCGC GGAGGGGAGG GGGCTCCTCC TGGCCCCGAT GGGAGGCAGG
TTCGCAGAGG GCGCGGACTT CCCGGGAGAG GAGCTAATGT GCGTCTTCCT GGCGGGTATC
CCGTTCGAGA AGCCTACGAC GAAGACAAAC CTATACATAA AGTACTACGA GGAGCTCTAC
GGCCCCGAGA AGGGGAGGCT TTACGCGTAC GTGTACCCAG CCCTGAGGAG GGCTAGCCAG
GCTATCGGGA GGGCGTTGAG AAGCCCGAGG GACCAAGCGG TCATAGTTCT AGGCGATTCA
AGGTATAGGA ACTACATGGG GTTGCTTCCG GACTACGTGC GGGAGCTCGC GGTGGAGGTA
AAGTCGCAGG ATCTCGACTC TGTTGAACCT CCATGGGAAA AGATAAAGCT TTGA
 
Protein sequence
MVTGRYAFPY KPRRHQLEVA EKIVELLKRG NVVLEAPTGF GKTPVVIYAL LPFMERGGRV 
VWAVRTGSET DRPVEEFRVF REKSGARFVA VGLRGKKDMC LLAGERGGQL DYSEVSYICS
RERRRCKYYR RLQEGVDYSE LLERGALTYR DVFEWARREG VCPYFLQREL LKVADLVALS
YNYVVDEDLS WTLRGAFPTS QSILVVDEAH NLQELNLGGD TVTEGTIARA KEEAAEMGDE
EVYGFVEALE AKVREKYGSL PEEGSEVFRA EDLLDGFDED IIEKTLRLGE LVREKRYRSG
ARPRSSAHHL AEFLRKAVDL EGVKGIALIA EKVDGRVHLN IWDMRSGEIL SGVWRRFRRV
IFMSGTLSPI EAFAETVGLS DYTPVTVPSP YDETNASVYL VKDLTTRGEE LSEEMAERYV
SAVGRVLGRV KRNSAVFTAS YRVLQRLLEA GLKEEAERLG YEVVVERRDM SGQEAGAALE
KFKNLAAEGR GLLLAPMGGR FAEGADFPGE ELMCVFLAGI PFEKPTTKTN LYIKYYEELY
GPEKGRLYAY VYPALRRASQ AIGRALRSPR DQAVIVLGDS RYRNYMGLLP DYVRELAVEV
KSQDLDSVEP PWEKIKL