Gene Pars_2321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2321 
Symbol 
ID5054542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp2074588 
End bp2077971 
Gene Length3384 bp 
Protein Length1127 aa 
Translation table11 
GC content56% 
IMG OID640469873 
ProductDNA-directed RNA polymerase subunit B 
Protein accessionYP_001154517 
Protein GI145592515 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR03670] DNA-directed RNA polymerase subunit B 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.388752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.000074653 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTTGATC TACTTCCTCT ACCCGTGATA TCACCCTCTG GAGACGGAGG GCTTCTTACA 
AAAGACGATA GGTGGGCCTT GGTGGAGAGG TTTATAAAAG ACAAGGGGCT CGCCAGCCAC
CAGATAAAGT CTTTCAACGA CTTCCTAGAC AAGAAGCTCC CCCGCATTGT CGAGGATTTC
AAGGTGGTTG ATACAGAGAT TAAGGGGCTG AAACTGGTGT TGGAGAAAAT CGAGGTTGGG
TGGCCGAGGA TTAAGGAGTC GGACGGTTCC GAGTCCCTTA TCTACCCCAT GGAGGCTCGG
CTCCGCAACG CCACCTACTC GGCGCCCTTG TACCTAACCG CGGTTTTGTA CGTAGATGAT
GAGCCCTACG CCACAGAAAC GTTCTACATA GGGGAGTTGC CCATAATGGT TAAGTCGAAA
CGTTGCAACT TGACCCGGCT GAGGCCAAGT GAGTACCCTA AGAGGTTTGA GGACCCCCAA
GACTTCGGCG GCTACTTTAT CATAAACGGG AGCGAGCGGG TTATTATAAG CCAGGAGGAC
CTCGTCGCTG ATAGGCCAAT TTACGACAAG GGCGACAAGC CCTCCGTGAA GTTCTTGGCC
AAGACTATAT CCACCGGCAT AGGGTACAGA AGCACGTTGA CTGTTGAGCT GAACAAAGAC
GGGGTGATTT ACGCCACGCT TTCGGCAATA CCCGTCAAAA TACCCTTCCC CATTTACATG
AAGGCTCTCG GCCTCGAGAC AGACGAGGAC GTGGTGAAGG CCGTGTCGGA CGACCCAGAC
ATACAGAAAG AGCTCCTCCC CTCACTCGTG GTTGCCAACC AGATAGCGAT AACCCGCGAA
GACGCGCTTG ACTACATAGG CGGCAAGGTG GCAGTGGGGC AACCCCGGCC CGTCAGAGTG
GAGCGGGCGT TGCAACTGCT AGATAGGTAC TTCCTGCCTC ACCTTGGCAC CACGGTGCCG
GATGAGAAGA AGCAACAAGA AATTAGGCTG AAAAAGGCGC TGATGCTGGG GCAAATTGTT
AAGGGTCTTG TGGAGCTCCA GCTGGGGAGG AGGAAGCCCG ACGATAAGGA CCACGTGGCA
AACAAGAGGG TGCGCTTAGT TGGCGACTTG ATGACCCAGC TCTTCCGCAC GGTGTTTAAG
CAGTTGCTCC AGGAGCTGAG GAGCCAGCTG GAGAAGTACT ACGCCAGGGG GAGGATACCC
CACCTGCAGA CAATTGTAAG GCCGGACATA ATAACCGAGC GCGTCAGGCA AGCCCTAGCT
ACGGGTAACT GGGTAGGGGG CAAGACGGGT GTCTCCCAGA TTTTAGACCG CACCAACTAC
CTCTCCACTC TGAGCTACCT GAGGCGTGTT GTGTCCTCCC TATCCAGGAC CCAGCCCCAC
TTCGAGGCCC GCGACCTCCA CCCAACCCAG TGGGGGAGGC TGTGCGCAGT TGAGACGCCT
GAGGGCCAAA ACGTGGGGCT TGTGAAGAAC CTCGCCCTCC TCGCCGAGAT AACCACTGGT
GTGGACGAAA ACGATGTTGA ACAGATGCTC TTACAGCAGG GCGTCGTGCC TATACTAAAA
GCTAGAGAGG AGGGGGTCCG TGGCGCCGAG GTCTACCTAA ACGGGAGGCT GATAGGTATC
CACCCAGAAC CGGAAGAATT GGTAAAGACG GTAAGGAGCT TGAGGAGGCA GGGCAAGATA
AGCGATGAGA TAAACATCGC TTACCTCAAC GGCGTCGTTT ACGTGAACAG CGACGGGGGG
CGCATAAGGA GGCCTCTCCT AGTCGTGGAA GACGGGAAGC TGAAACTCAC AAAGGACATT
GTTGAGAGGG TGAAGAGGGG GGAGCTCACC TGGGACGACC TATTAAAAAT GGGCGTAGTG
GAGTACCTAG ACGCCGACGA AGAAGAAAAC GCCCACATAG CTGTTGACCC CGAGGGCGAC
CTAAGCAATT ACACCCATGT GGAGATTATA CCATCCTCCA TCCTGGGGGC AATTGCCTCG
ATTATACCCT TCCTAGAGCA CAACCAGTCG CCGAGAAACC AATACGAGGC CGCGATGGCC
AAGCAGAGTC TGGGCTTGCC GCAGTCCAAC TTCTTGTACA AGCTGGACTC CAGAGGTCAT
ATGTTGTACT ACCCAGAGAG GCCGATAGTG ACTACCAGGG GTCTGGAGTT GGTGGGTTAT
TCGAAGCGGC CTGCAGGCCA GAACGCCGTA GTGGCTCTGC TCACCTATAC GGGGTACAAC
ATCGAGGATG CCGTCATCCT AAACAAGGCG TCAGTGGAGC GCGGCATGTT CCGCTCGGTG
TTCTACCGCA CATACGAGAC GGAGGAGCAG AGATACCCAG GCGGCGAGGA GGACAAAATC
GAAATACCTG ACAGCTCGGT CAAGGGGTAT AGGGGGCCAG AGGCCTACAG CCACCTAGAC
GAAGACGGCA TAGCCCCGCC TGAGGTGTAT GTGAGTAGCA GCGAGGTGTT GATAGGCAAA
ACATCTCCGC CGAGGTTCTA CACCACGCTG GAGACAGAGC GGATACTGAA AGAGAGACGT
GACGCCTCGG TGGCCGTAAG GCGCGGGGAA AAGGGTATAG TGGACAGGGT CATAGTCACG
GAGTCTCCCG AGGGCAACAA GCTTGTTAAG GTGAGGCTGA GGGAGCTCAG AATTCCGGAG
CTCGGCGACA AGTTCGCAAG CCGGCACGGC CAGAAGGGAG TTGTGGGGAT GTTGCTTAGA
CAAGAGGATA TGCCATTCAC AGAGGAGGGC ATTGTCCCCG ACATAATCGT AAACCCACAT
GCCCTGCCCT CGCGTATGAC CGTCGCCCAG TTGCTAGAAA GCATGGCCGG GAAGGTCGGC
GCCGCCACAG GGAACCTAGT AGACGCCACA CCCTTTGAGG GGGTAAAGGA GGAGGACTTG
AGGAAGCTGT TGCTAAAACT CGGCTACAAG TGGGACGGCA AAGAGGTCAT GTACAGCGGC
ATAACCGGCG AGAAGCTCGT AGCGGACATC TTCATAGGCA TTGTGTACTA CCAGAAGCTA
CACCACATGG TAGCCGACAA GATACACGCC CGCGCTAGGG GCCCCGTGCA GATCCTCACG
AGACAACCCA CCGAGGGCCG CTCCCGCGAA GGCGGCCTAA GGCTGGGAGA GATGGAGCGC
GACGTCTTGA TAGCGCACGG CGCCTCGGCG TTGCTCTACG AGAGGCTTGT GGAGTCGAGC
GATAAGTACA CGATGTATGT CTGCGAGCTG TGCGGCCTGC CGGCGTATCT GGATGCCAAG
AGTAATAAGC CGAAGTGCCC AATCCACGGC GATACGGGGC AGTTCGCCAA GGTCACGGTG
CCCTATGCCT TTAAGCTTCT GCTTCAAGAG CTGATTGCGC TGGGTATATA CCCCAAGCTG
GAGCTCTCTG AGGTGCTGGA CTAA
 
Protein sequence
MVDLLPLPVI SPSGDGGLLT KDDRWALVER FIKDKGLASH QIKSFNDFLD KKLPRIVEDF 
KVVDTEIKGL KLVLEKIEVG WPRIKESDGS ESLIYPMEAR LRNATYSAPL YLTAVLYVDD
EPYATETFYI GELPIMVKSK RCNLTRLRPS EYPKRFEDPQ DFGGYFIING SERVIISQED
LVADRPIYDK GDKPSVKFLA KTISTGIGYR STLTVELNKD GVIYATLSAI PVKIPFPIYM
KALGLETDED VVKAVSDDPD IQKELLPSLV VANQIAITRE DALDYIGGKV AVGQPRPVRV
ERALQLLDRY FLPHLGTTVP DEKKQQEIRL KKALMLGQIV KGLVELQLGR RKPDDKDHVA
NKRVRLVGDL MTQLFRTVFK QLLQELRSQL EKYYARGRIP HLQTIVRPDI ITERVRQALA
TGNWVGGKTG VSQILDRTNY LSTLSYLRRV VSSLSRTQPH FEARDLHPTQ WGRLCAVETP
EGQNVGLVKN LALLAEITTG VDENDVEQML LQQGVVPILK AREEGVRGAE VYLNGRLIGI
HPEPEELVKT VRSLRRQGKI SDEINIAYLN GVVYVNSDGG RIRRPLLVVE DGKLKLTKDI
VERVKRGELT WDDLLKMGVV EYLDADEEEN AHIAVDPEGD LSNYTHVEII PSSILGAIAS
IIPFLEHNQS PRNQYEAAMA KQSLGLPQSN FLYKLDSRGH MLYYPERPIV TTRGLELVGY
SKRPAGQNAV VALLTYTGYN IEDAVILNKA SVERGMFRSV FYRTYETEEQ RYPGGEEDKI
EIPDSSVKGY RGPEAYSHLD EDGIAPPEVY VSSSEVLIGK TSPPRFYTTL ETERILKERR
DASVAVRRGE KGIVDRVIVT ESPEGNKLVK VRLRELRIPE LGDKFASRHG QKGVVGMLLR
QEDMPFTEEG IVPDIIVNPH ALPSRMTVAQ LLESMAGKVG AATGNLVDAT PFEGVKEEDL
RKLLLKLGYK WDGKEVMYSG ITGEKLVADI FIGIVYYQKL HHMVADKIHA RARGPVQILT
RQPTEGRSRE GGLRLGEMER DVLIAHGASA LLYERLVESS DKYTMYVCEL CGLPAYLDAK
SNKPKCPIHG DTGQFAKVTV PYAFKLLLQE LIALGIYPKL ELSEVLD