Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0829 |
Symbol | |
ID | 5055915 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 736470 |
End bp | 739217 |
Gene Length | 2748 bp |
Protein Length | 915 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640468390 |
Product | DEAD/DEAH box helicase domain-containing protein |
Protein accession | YP_001153067 |
Protein GI | 145591065 |
COG category | [R] General function prediction only |
COG ID | [COG1201] Lhr-like helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.468511 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTTTTCC GCCTTCTGCA CCCCAAGCTG ACGGAGGCCG TTGAGGAGCT CGGCTACTCG GAGCCCACGC CGGCGCAGTC TGCCGCGATC CCGGAGATTC TGTCCGGCTC CCACGTGCTT CTAATCGCGC CTACTGGTTC CGGGAAGACG GAGGCGGCGC TTTTCCCCGT GATGTCGAGG CTTCTGGAGG CGGAAGAGAG AAGGGGCGTA AGCGCTCTTT ACATCACCCC CCTCCGGTCT TTAAACAGGG ACCTCCTCAG AAGGCTTCTC GCCATTGCGG ACAAGATCGG CCTCACCGTC GCGGTGCGCC ACAGCGATAC GCCGGAGGCT GAGAGGAGGC TCCAGGCGGC GAAGCCGCCG GACATACTCA TAACGACGCC GGAGACTCTG CAGATTCTCC TCCTCCACAA AAGCATGCGC CAGGCGTTGA GGGGGGTCCG CTTCGTGGTG GTGGACGAGG TGCACGAGCT AGTCAACAGC AAGAGGGGGG TGCAACTGGC GGTCGGCCTC GAGAGGCTGG TGGAGCTTGC GGGGGAGTTC CAGCGGGTTG GCCTCTCGGC CACTGTAGGC GCGCCGGAGC TTGTGGCGGG CTTTCTAGGA GGCGGGAGGC CCGTCAAGAT CGTCGACGTT TCGGCGGAGA AGAGGTATGA GATAGACGTG GTGTGGCCGC TCCCCTCAGA CGAGGACTAC GTAGACGCTG AGAAGTTCGA CGCCACCCCG GAGGCTGTGG CGAGGATAAA GAGGGTGGCC GAGTACGTGA AGTCGCCTAG GGGGCCCGTC CTCGTCTTCA CTAACACCAG AGACGGCGCC GAGTTCTTGG CCTCAAGGCT TAAGCAGATA CTGGGCGACG TGGTAGAGGT GCACCACTCC TCCCTTTCTA GGGAGCACAG GATCTCCGTA GAGGAGAGGC TTAAGAGGGG CGAGCTCAAG GCGGTTGTGG CCACGTCCAG CCTAGAGCTG GGCATAGACA TCGGCGACGT AGACCTCGTG GTGCAGTACG GCTCCCCGAG GCAGGTCTCG AAGCTGGTGC AACGTGTGGG GAGGGCTGGG CACAGACTCG GTCTCGTCTC CCGCGGCATC GTCGTGGCGG CCGACCTGGA GGACTACCTA GAGTCGGAGG TAATTGCAGA GAGGGCGGCG AAGGGGCTTT TGGAAAGGGA GGTGGAGTAC CACGAAAATG CCCTCGACGT GTTGCTCCAC CAAGTGGTGG GCATCGCCCT GGAGGCTAGG CTGGACGGGA GGGAGGTAGA GGTTAACTAC GTAATGCGCA TTGTTAGGAG GGCCCACCCC TACCGGAACC TAACAGAGGA GGACCTACGG CTCGTCTTGG ACTTCGCCGA GAGGCACGGC TTGCTTAAGG GGCTGAGGCC GAGGAAGGGC AGTATAAGGT ACTACTTCGA GAATGTGTCG ACTATCCCAG ACGAGAAGAG CTACAGGGCT GTGGACGACT CCACCGGGAG GGCGGTGGGG GAGCTAGATC GGGAGTTTGT CTACTCCATC GAGCCGGGGA CGAAGATAGT GTTGTCGGGG AGGGTGTGGA CCTTCGCCAG GCGGGAGGGC GACGTGGTTT ACCTCTACCC GGACTACGAC GTATCTGGCG CCCTCCCGGC GTGGCTCGGC GAGCAGATAC CTGTGCCGTA TGAAATAGCC CAGGAGGTTT GCAAGAGGCG GGCCGAGGTT CTGCTAAGGG CGTTGAGGGG AGAGGAGGGG CTGCCGGTGG ACGTGGAGGG CCTCACGCCG GAGCTAGTGC CGGCGCCGGA TAGGCTACAT GTCCACATTG TGGAGAATAG ATACGCAGTG GTCCACAGCT GTCTTGGCCA CAAGGGGAAC GAGGCGCTGG GGGCCTACCT CTCCCACGCC CTATCAGGAT ATGTGGGTCC CGTAGGGTAC AGGTCAGACG CCTACCGCGT ACTGCTCATT TTTAGAGACT TCGTCCCTCT AAACGCCTTG GAGGAGGTGT TGAGGAGGCC GCAGTGGTTT GTCTACACGA CGCTTAAAAA CGCGGTTCGG AGCTCTAAGC TGTTTCGATA CCGCTTCCTC CAAGTGGCGC GGAGGACGGG GCTTGTGTCG AAAGACGCGG AGGACGTGCC GTCGAGGCTA CCGGAGGTGT ACGCCGACGA CCTCCCCGGC GTGGAGACAC TAAACGAGAT ATTTGTGGAG AGGCTCGACG CAAAGTCCCT ACTATCCCTC CTAGAAAAAA TCGCCAACGG CGAGGTCCCA CTGGTTGTGA AGAGGCTGGC GAAGCCCACC GCCCTTGAGA AGCCGATACT GGAGGAGGCC CTCCGCCTAG ACTTCTCCTT CAAGGGGCTG TCTAGGGAGT CCCTCGCCGA TCTGGTGAGG AGGCGGATTT TGAACAAATA CGCCACGCTT CTCTGCCTGA ACTGCGGCTG GGTCTACGTA GCGAGAGCGG CCGCCCTCCC GGAAGACGTC TCGTGTCAGA AGTGCGGCGT GAGGGCCCTA GCAGTCGTGA AGGGCGTCGA CGTGGAGAAG GCGCGGCAAG TTCTCAGCAA ATACAAGATA AGACAGAAGA TGAGCAAGGA GGAGGCCAAG ATCCTCGAGC ACCTCCAGCT CAGCGCCTCA CTTGTCCTGG AGCACGGGAA GCTCGGCGTG TTGGTACAAC TCGCCCACGG TGTGGGGCCC AAGACTGCCG TCAAGATTCT GAACAAGCTC GTGGAAGGCG CCGACTTGTG GATTGCAATT ATGGACGCGG AGCGGCAGTT CGCCACCACC AGGGCGTTCT GGGACTAG
|
Protein sequence | MVFRLLHPKL TEAVEELGYS EPTPAQSAAI PEILSGSHVL LIAPTGSGKT EAALFPVMSR LLEAEERRGV SALYITPLRS LNRDLLRRLL AIADKIGLTV AVRHSDTPEA ERRLQAAKPP DILITTPETL QILLLHKSMR QALRGVRFVV VDEVHELVNS KRGVQLAVGL ERLVELAGEF QRVGLSATVG APELVAGFLG GGRPVKIVDV SAEKRYEIDV VWPLPSDEDY VDAEKFDATP EAVARIKRVA EYVKSPRGPV LVFTNTRDGA EFLASRLKQI LGDVVEVHHS SLSREHRISV EERLKRGELK AVVATSSLEL GIDIGDVDLV VQYGSPRQVS KLVQRVGRAG HRLGLVSRGI VVAADLEDYL ESEVIAERAA KGLLEREVEY HENALDVLLH QVVGIALEAR LDGREVEVNY VMRIVRRAHP YRNLTEEDLR LVLDFAERHG LLKGLRPRKG SIRYYFENVS TIPDEKSYRA VDDSTGRAVG ELDREFVYSI EPGTKIVLSG RVWTFARREG DVVYLYPDYD VSGALPAWLG EQIPVPYEIA QEVCKRRAEV LLRALRGEEG LPVDVEGLTP ELVPAPDRLH VHIVENRYAV VHSCLGHKGN EALGAYLSHA LSGYVGPVGY RSDAYRVLLI FRDFVPLNAL EEVLRRPQWF VYTTLKNAVR SSKLFRYRFL QVARRTGLVS KDAEDVPSRL PEVYADDLPG VETLNEIFVE RLDAKSLLSL LEKIANGEVP LVVKRLAKPT ALEKPILEEA LRLDFSFKGL SRESLADLVR RRILNKYATL LCLNCGWVYV ARAAALPEDV SCQKCGVRAL AVVKGVDVEK ARQVLSKYKI RQKMSKEEAK ILEHLQLSAS LVLEHGKLGV LVQLAHGVGP KTAVKILNKL VEGADLWIAI MDAERQFATT RAFWD
|
| |