Gene Pars_0255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0255 
Symbol 
ID5054174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp223730 
End bp225388 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content66% 
IMG OID640467834 
ProductDNA polymerase B region 
Protein accessionYP_001152521 
Protein GI145590519 
COG category[L] Replication, recombination and repair 
COG ID[COG0417] DNA polymerase elongation subunit (family B) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.604411 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.249553 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTTGTAA TTGGGCCGAG GCCGCCTGTA AAAGCCGAGC GGGTAGACTT AATCCCCCTA 
GTGAGGAGGG GCTTTGTATA CGTAGAGGCC AGCGGCGCCG TGTGGCGCAT ATGCGAGCCG
GCTGGGGACA GAGATCTGCT GAGAGCTAAG CTGGTGTCGA AGGGGTATAA GGTGTCCCAG
TCCTACATTC CCCAGTCTGT GCGGCCCTAC CTAGAGGGCA GGAGGCAGCC GAAGCTCGGC
GGCCGCGTCG CCGCGGTCTA CACGGAGGGC GGAGAGACGG GGTGGCTTAT GGAAGGCGAG
ATCGGCGTGG GGTGCCCGCC GGCTGACTAC GTGGCTTACT GGGGCGAGAG GCCGAGATGC
CCCGGCGTTT TGGTAGACCT CAAGAAGATG TACAGAAGGG GCTTGTGGGC CGAGGCGGCT
GAGAGGTTCG GCGACGGCTG GCTCCTAAAG GCGCTGAGGA GGAGGCCGGA GGACGTGCTG
GAGGCCGCCT ACGCCCTGGC CGTCGAGGCG GTCAGCTTCT TGGAGGCGCT GGCCGCGGCG
ACGGGGATAG GCGTAGACGC AATCCTGGCC GTCTTCGAGA CCAACTCAGT GGCGGCGCTG
GCTGAGGCGG TTTTCCACTT CGAGGCGGAA AGGCGGGGTT ACGTCCTGGA GGACTCGCGG
CGGCTTTTCG ACATGCCCTA CTTCGACATG GTCAGGGGCA GGGCGCCTGG GGTGTACAGG
AGGGTGGCGG AGCTGGACTT CAAGTCCCTC TTCCCCTCCC TCGTGGTTAA GTACGGCATA
GACCCCACCA CGGCAAGGCC CTGCGCGGAG GGCTTGCCGG CGGGGCCTCT TGGGAGGTAC
TGCTTCGACG GCGGGCCCGT CGCCTATGTG GTGAGGAGGT GGCTGGAGGC CAGGCTTGCC
GCGGCGGACA AGGCTCTGTC TGACGCGCTT AAGTGGCTCA TGAACGCGGG CATCGGGGCC
TGGGGCAAGG CGGGGTGGGG GATCATATGC GAGCCGTGCC TCCTCGCAGT GAGATCGGAG
GCGGCACGAC TCTTCGACGA GGCTTGGAGG AGGTTTTCCC CGATATACGG CGACACAGAC
TCCGTATACG TCGAGGAGGA GAAGGCGGGG GAGGTGCTCC GGTGGGGCTC CTCATTGGGC
GGCCTGAGGC TGGAGCTTAA GGGGGTTTGG GACGTCTTCG TGCTGGCGTG GGCGAAGGGC
GGCGGCGTTG CTGAGAAGAA CTACGTAAAG GCCAAGGGGG ATCGCTGGGA GGTAAAGGGA
GGCTTGCTGA GGCCACACGA CCTCCCCCTA GCTGTGAGGC TTAGGTACAG GGAGGTGGTG
GAGGCGGCGG CCGGCGGCGG AGATCCGCTC GGAGCCGCGG TGAAGATAGT CAAGGAGGCT
CCCCCCGAGC AGCTCTTCGT CTACAAGCCC GTCTGGAGGC GGAGGCTCGG CGAGGTGGAG
AGGGAGACCC CCCTCCACGT GGCGGCGCGG TACCTGGCCG GGAGGTGTAG ATGCGACCCG
GTGGACGTGT TTTACCTCCC CGGCAACGGC GTGGTATACG TCCCCTGGGC CGCTGTGGAC
AAGAGGTTGA GGGCAAGGCC GGCGCGGGCG GAGGAGGTTA GGCGGGCCGC CGTGGAGTAC
GTGGGGAGGC TGTGGAAGCT ACGGGCCCTG GCCCCGTGA
 
Protein sequence
MFVIGPRPPV KAERVDLIPL VRRGFVYVEA SGAVWRICEP AGDRDLLRAK LVSKGYKVSQ 
SYIPQSVRPY LEGRRQPKLG GRVAAVYTEG GETGWLMEGE IGVGCPPADY VAYWGERPRC
PGVLVDLKKM YRRGLWAEAA ERFGDGWLLK ALRRRPEDVL EAAYALAVEA VSFLEALAAA
TGIGVDAILA VFETNSVAAL AEAVFHFEAE RRGYVLEDSR RLFDMPYFDM VRGRAPGVYR
RVAELDFKSL FPSLVVKYGI DPTTARPCAE GLPAGPLGRY CFDGGPVAYV VRRWLEARLA
AADKALSDAL KWLMNAGIGA WGKAGWGIIC EPCLLAVRSE AARLFDEAWR RFSPIYGDTD
SVYVEEEKAG EVLRWGSSLG GLRLELKGVW DVFVLAWAKG GGVAEKNYVK AKGDRWEVKG
GLLRPHDLPL AVRLRYREVV EAAAGGGDPL GAAVKIVKEA PPEQLFVYKP VWRRRLGEVE
RETPLHVAAR YLAGRCRCDP VDVFYLPGNG VVYVPWAAVD KRLRARPARA EEVRRAAVEY
VGRLWKLRAL AP