Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1812 |
Symbol | |
ID | 4602049 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1753782 |
End bp | 1755635 |
Gene Length | 1854 bp |
Protein Length | 617 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639774585 |
Product | DEAD_2 domain-containing protein |
Protein accession | YP_921210 |
Protein GI | 119720715 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1199] Rad3-related DNA helicases |
TIGRFAM ID | [TIGR00604] DNA repair helicase (rad3) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0429091 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGTTACCG GCAGGTACGC GTTTCCCTAC AAGCCCAGGA GGCACCAGCT GGAGGTTGCC GAGAAGATCG TAGAGCTGTT GAAACGCGGG AACGTCGTGC TGGAGGCGCC TACGGGCTTC GGCAAGACAC CGGTCGTCAT CTACGCCCTT CTACCATTCA TGGAGCGCGG GGGGAGGGTC GTATGGGCGG TGAGGACGGG TAGCGAGACC GACAGGCCGG TCGAGGAGTT CAGGGTCTTC AGGGAGAAGT CGGGCGCGAG GTTCGTCGCA GTCGGGTTGC GCGGGAAGAA GGACATGTGC CTGCTGGCCG GGGAGAGGGG CGGGCAACTC GACTACAGCG AGGTATCGTA CATATGTAGC CGCGAAAGGA GGAGGTGCAA GTACTACAGG AGGCTGCAGG AAGGCGTGGA CTACTCCGAG CTCCTTGAGA GGGGCGCCCT AACTTACAGG GATGTGTTCG AGTGGGCGAG AAGAGAGGGG GTCTGCCCGT ACTTCCTCCA GCGGGAGTTG CTCAAAGTCG CGGACCTCGT CGCCCTGAGC TACAACTACG TCGTGGACGA AGATCTCTCG TGGACCCTTA GAGGAGCTTT CCCGACTAGC CAGTCCATAC TCGTAGTCGA CGAGGCGCAC AACCTCCAAG AACTCAACCT CGGCGGAGAC ACCGTGACGG AGGGGACTAT CGCGAGGGCC AAGGAGGAAG CCGCCGAGAT GGGGGACGAA GAGGTGTACG GGTTCGTGGA GGCGCTCGAG GCGAAGGTTA GGGAGAAGTA CGGCTCCCTA CCGGAGGAGG GGTCCGAGGT CTTCCGTGCG GAGGACCTCC TCGACGGCTT CGACGAAGAC ATCATCGAAA AGACTCTGCG GCTCGGCGAA CTTGTCAGGG AGAAGAGGTA CAGGAGCGGC GCGAGGCCCC GCAGTAGCGC CCACCACTTG GCGGAATTCC TCCGGAAGGC CGTCGACCTC GAAGGCGTGA AGGGCATAGC GCTCATAGCC GAGAAGGTCG ACGGCAGAGT CCACCTCAAT ATATGGGACA TGAGGTCCGG GGAGATACTC TCGGGCGTCT GGAGGAGGTT TAGAAGGGTG ATCTTCATGT CCGGGACTCT CTCGCCGATC GAAGCCTTCG CCGAGACTGT GGGGTTAAGC GACTACACCC CGGTAACCGT TCCGAGCCCC TACGACGAGA CAAACGCCTC TGTCTACCTC GTAAAGGACC TGACGACGCG GGGTGAGGAG CTCTCCGAGG AGATGGCCGA GAGGTACGTC TCGGCTGTCG GCAGGGTTTT AGGCAGGGTT AAGAGGAACA GCGCCGTGTT CACTGCTAGC TACAGGGTTC TGCAGAGGCT TCTCGAGGCT GGCTTAAAGG AGGAGGCCGA AAGGCTCGGC TACGAGGTTG TCGTTGAGCG GAGGGATATG TCCGGGCAGG AGGCAGGCGC AGCTCTCGAG AAGTTCAAAA ACCTCGCCGC GGAGGGGAGG GGGCTCCTCC TGGCCCCGAT GGGAGGCAGG TTCGCAGAGG GCGCGGACTT CCCGGGAGAG GAGCTAATGT GCGTCTTCCT GGCGGGTATC CCGTTCGAGA AGCCTACGAC GAAGACAAAC CTATACATAA AGTACTACGA GGAGCTCTAC GGCCCCGAGA AGGGGAGGCT TTACGCGTAC GTGTACCCAG CCCTGAGGAG GGCTAGCCAG GCTATCGGGA GGGCGTTGAG AAGCCCGAGG GACCAAGCGG TCATAGTTCT AGGCGATTCA AGGTATAGGA ACTACATGGG GTTGCTTCCG GACTACGTGC GGGAGCTCGC GGTGGAGGTA AAGTCGCAGG ATCTCGACTC TGTTGAACCT CCATGGGAAA AGATAAAGCT TTGA
|
Protein sequence | MVTGRYAFPY KPRRHQLEVA EKIVELLKRG NVVLEAPTGF GKTPVVIYAL LPFMERGGRV VWAVRTGSET DRPVEEFRVF REKSGARFVA VGLRGKKDMC LLAGERGGQL DYSEVSYICS RERRRCKYYR RLQEGVDYSE LLERGALTYR DVFEWARREG VCPYFLQREL LKVADLVALS YNYVVDEDLS WTLRGAFPTS QSILVVDEAH NLQELNLGGD TVTEGTIARA KEEAAEMGDE EVYGFVEALE AKVREKYGSL PEEGSEVFRA EDLLDGFDED IIEKTLRLGE LVREKRYRSG ARPRSSAHHL AEFLRKAVDL EGVKGIALIA EKVDGRVHLN IWDMRSGEIL SGVWRRFRRV IFMSGTLSPI EAFAETVGLS DYTPVTVPSP YDETNASVYL VKDLTTRGEE LSEEMAERYV SAVGRVLGRV KRNSAVFTAS YRVLQRLLEA GLKEEAERLG YEVVVERRDM SGQEAGAALE KFKNLAAEGR GLLLAPMGGR FAEGADFPGE ELMCVFLAGI PFEKPTTKTN LYIKYYEELY GPEKGRLYAY VYPALRRASQ AIGRALRSPR DQAVIVLGDS RYRNYMGLLP DYVRELAVEV KSQDLDSVEP PWEKIKL
|
| |