Gene Pars_1884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1884 
Symbol 
ID5055604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1690356 
End bp1692716 
Gene Length2361 bp 
Protein Length786 aa 
Translation table11 
GC content57% 
IMG OID640469430 
ProductDNA polymerase II 
Protein accessionYP_001154087 
Protein GI145592085 
COG category[L] Replication, recombination and repair 
COG ID[COG0417] DNA polymerase elongation subunit (family B) 
TIGRFAM ID[TIGR00592] DNA polymerase (pol2) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.115615 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGATCC GGGTTTGGCC TCTTGACGTC ACTTACAGCG TAGTAGGCGG TGTGCCCGAG 
GTGAGAGTTT TCGGCATAGC AGAGGGGGGC GAGAGGGTGG TCCTCGCGGA TAGGTCTTTT
AGGCCGTATT TCTACGTAGA TTGTGCCGGT TGTGATCCTC ATGCCGTGAA GACGCACCTC
GGCAGGACGG CCCCAGTGGA GGGTGTGGAA TTGGTTGAGA GGCGATTTCT GGGTCGGCCG
CGGCAGTTTT TGAAAGTAGT GGCCAAGATC CCGGAGGATG TCAGAAGGCT GAGGGAGGTC
GCCTTTACAA TTCCGGGGGT GCGGGGGGTG TACGAGGCCG ACATACGGTT CTACATGAGG
TATATGATAG ACATGGGGGT TGTCCCTTGT AGCTGGAACG TCGCCGAGGT GGAAGTGGCA
GATGAGAAGC TGGGGTTTCT TCCGGTGTAC AGGGTGGTGA AGTGGGGCGG CGCCGTTGAG
GGGTTTCCCC CACCTCTCCG GGTCTTGGCG TTTGACATAG AGGTTTACAA CGAGCGGGGG
ACCCCCGACC CGGCGAGGGA TCCCATCGTC ATGATCGCTG TGCAGTCCAG CGACGGCCGG
CTTGAGGTTT TCGAGGCCTC TGGCAGGGAT GACAGAGGAG TGCTCCGCTC GTTTGTGGAG
TACGTGAGGT CTTTCGACCC CGATGTCGTT GTCGGCTACA ACTCGAACAA CTTTGACTGG
CCCTACCTCG CGGAGCGCGC CAAGGCTGTC GGCGTACCGC TGAGAGTAGA TAGGCTTGGC
GGCGCGCCTC AGCAGAGCGT CTACGGCCAC TGGTCGGTGC TGGGGAGGGC CAACGTCGAC
CTCTATAACA TCGTCGACGA GTTCCCCGAA ATTAAGCTGA AGACTTTGGA CAGGGTGGCG
GAGTACTTCG GCGTAATGCG GCGGGATGAA AGGGTCTTAA TACCGGGGCA CAAGATATAC
GAGTACTGGC GGGACCCCTC CAAGAGGCCG CTGCTCCGGC AGTACGTAAT AGACGACGTG
AGGTCCACAT ACGGCCTTTC CGAGAAGCTG TTGCCGTTTC TCATACAGCT CTCCTCCGTT
TCCGGCCTCC CGCTTGACCA AGTAGCGGCG GCGAGCGTTG GGAACAGGGT TGAGTGGATG
CTCCTCAGAT ACGCCTACGG GCTGGGCGAG GTGGCGCCAA ACCGCGAGGA GCGGGAGTAC
GAGCCGTACA AAGGCGCAAT CGTGCTGGAG CCCAAGCCCG GCCTCTACGG CGATGTACTC
GTCCTTGACT TCTCCTCGAT GTACCCCAAC GTCATGATGC GGTACAACCT CTCTCCGGAC
ACCTACCTAG AGCCCGGCGA GCCTGATCCG CCGGAGGGCG TATACGTGGC GCCGGAGGTG
GGGCATAGGT TTAGGAAGGA GCCCCCCGGC TTTATACCAC AAGTGTTGAG ACGGCTTGTG
ACGCTCCGGA GGGCGGTGAG GGAGGAGATG AAGAAGTACC AGCCAGAAAC CTCGGAGTAC
AGAATACTCG ATGAGAGGCA GAAGGCGTTG AAGATTATGG CAAACGCCAT GTATGGCTAC
ACGGGGTGGG TCGGCGCGAG GTGGTACAAG AAAGAAGTAG CCGAGTCAGT CACCGCCTTT
GCCAGGGCCA TACTAAAAGA CGTGATCGGC TACGCGAGGC GGCTGGGGAT CATGGTGATA
TACGGCGACA CTGACAGCCT CTTTGTGAAG AAGGGCGGCG ATCTGGAGAA ACTTGCGAGG
TACGTAGATG AGAAGTACGG AATAGAGATC AAGGCGGATA AGGACTACGA GAAGGTCCTA
TTTACCGAGG CCAAGAAGAG GTACGCCGGC CTTCTCAGAG ATGGGAGAAT AGACATCGTG
GGATTCGAAG TGGTGAGAGG GGATTGGAGC GAGCTGGCCA AGGAGGTCCA GCTCAAGGTA
ATCGAGCTTA TACTCAAGGC GCGGGACCTC TCTGAGGCAA GGCAGAGAGT AATTAAGTAC
GTAAAGGATG TTATCGAGAG GCTGAAAAGC GGAAAGTTCG ACCTAGACGA CCTCATAATC
TGGAAAACAC TTGACAAGGA ACTAGGCGAA TACAAGGCCC ATCCCCCGCA TGTGCGGGCG
GCGCTCATAT TGAAGAAGAA GGGGTACAAG GTGGGGAGGG GGACCACCAT CGGCTACGTC
ATCGTTAAGG GAGGCGAGAA AGTATCCGAG AGGTCGCTTC CCTACATTCT GGTAGACGAC
TTGGCGAAAA TAGACATTGA CTACTACATA GAGAAGCAGG TGATACCAGC TGCTCTGAGA
ATTGCAGAAG TCATAGGGGT GAAAGAAGGG GACTTGAGAG CAGGTAGGAG CGAAAAATCG
TTGCTTGACT TCTTTGAGTG A
 
Protein sequence
MEIRVWPLDV TYSVVGGVPE VRVFGIAEGG ERVVLADRSF RPYFYVDCAG CDPHAVKTHL 
GRTAPVEGVE LVERRFLGRP RQFLKVVAKI PEDVRRLREV AFTIPGVRGV YEADIRFYMR
YMIDMGVVPC SWNVAEVEVA DEKLGFLPVY RVVKWGGAVE GFPPPLRVLA FDIEVYNERG
TPDPARDPIV MIAVQSSDGR LEVFEASGRD DRGVLRSFVE YVRSFDPDVV VGYNSNNFDW
PYLAERAKAV GVPLRVDRLG GAPQQSVYGH WSVLGRANVD LYNIVDEFPE IKLKTLDRVA
EYFGVMRRDE RVLIPGHKIY EYWRDPSKRP LLRQYVIDDV RSTYGLSEKL LPFLIQLSSV
SGLPLDQVAA ASVGNRVEWM LLRYAYGLGE VAPNREEREY EPYKGAIVLE PKPGLYGDVL
VLDFSSMYPN VMMRYNLSPD TYLEPGEPDP PEGVYVAPEV GHRFRKEPPG FIPQVLRRLV
TLRRAVREEM KKYQPETSEY RILDERQKAL KIMANAMYGY TGWVGARWYK KEVAESVTAF
ARAILKDVIG YARRLGIMVI YGDTDSLFVK KGGDLEKLAR YVDEKYGIEI KADKDYEKVL
FTEAKKRYAG LLRDGRIDIV GFEVVRGDWS ELAKEVQLKV IELILKARDL SEARQRVIKY
VKDVIERLKS GKFDLDDLII WKTLDKELGE YKAHPPHVRA ALILKKKGYK VGRGTTIGYV
IVKGGEKVSE RSLPYILVDD LAKIDIDYYI EKQVIPAALR IAEVIGVKEG DLRAGRSEKS
LLDFFE