Gene Pars_1208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1208 
Symbol 
ID5055302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1094646 
End bp1096346 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content60% 
IMG OID640468755 
Productacylphosphatase 
Protein accessionYP_001153428 
Protein GI145591426 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0709] Selenophosphate synthase
[COG1254] Acylphosphatases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATGG AGAGGTTTAG GGAGAGGGTG AGGCTGTATA GGGAGGCGGG CATTGCCCTG 
GAGTCGCTGT CGCTGGGGTG CTCGGTGAAG GTAGACCTCT ACAACGTGTT GTACCCAGCC
CTCCAGCTGT TGAAAGACGA AGTGTACAAG CTCAACCTCG TCATCGCGCC GAGGGAAGAC
GCGGCGATAA TGCCAGGTGA GGGCGCCTAC CTAAGGCGGT ATTTCCTCAA CGCGGAGGAG
CCGTGGCTAG AGCCCAGCGA GATTGAGAAG CTGGCCCCCA CGGTGGCTAT TGTGCTGGCC
CAGCTCTACA TGGGAAAGGC CGCCTCTGCA GACGTCTTTG CTAAGTACGT CGCCAAGCTC
TACAAGGCGC TGGGCTCCTC GAGGCACAAG GTGTGGCTTG GCAAGGGGCA TAGCATAGTC
AGTACCAAGA AGGGGGCTGA GTTCTTCATG GTTGATTTCA TCAAGGCGGA GGGGTCTAGG
GGCTACGTCG TCGCCAACAA CGACACGATA CAAGTCATCG ACCCTTCTGA GGATCTAGAT
TCGCAACTAC AAATAGCCGT GGCGGTGAAC AACGCGCTTA ACGATCTCTT CACCAAGGGA
GCCTGGAAGG ACTTGCACAT AGCCCCGGTA TACGATGGGC CCAGCGCCTA TAAGGCCTCC
ATAAAGGCGA AGGTGGAGGG GTACGCCTCG TCGCTGGGGA AGCTGGTGGA GGCGCCGCAG
CCTGATATGG GCTACCTCCT CCTCGGCGCC ACGGCTTACG CCTATCTAGA CAGGGAGCCC
CCCCTCTTCT ACAAACAGCT AGACGAGGGC TTCGTGGTGG TCGTCACGAG ACCCTTCGGC
GAGCTGGCAT TCTTCACCAC CTACGTCGCT GTCCACACAG ACGAATTCTT GTTGCAGAGG
TTTGAAAGGG AGGTTATGTC ACTGGAGCAA TTCGAGAGGG AGAAGCGGAG GGTGCTGGAG
GTCATGGCCA CGCCCAACTT GGAGGTGGCT AAGGCGATCT ACGAGTTCCT CCCCGACCTG
GGCGAGGCCT TTGACCCCGC AAGCCACATC GCCGCCACCA TCGACGTGTC GGGGCCCGGC
GTATTTGTGT TTAAGGAGGT GGCTGAAAAA GCCGGCGTCG ATATAAGGCT ACTCGACGTG
CCCCTCATGT CCGATCGAAT CTCGGCATTC GCCGCGGAGA ACTACATCAT GCCTGACGCC
ACTGCCGGCA CCAACGGCGC AATCGCCATC TTTGCCCACA AGCGGCTAGC CGACGAGTTG
ATCCAGAGGC TGTCCAAGGC GCCGCACGCA AGGCCTTTGG TAATAGGCGA AGTCGTGGGC
AAGGGGGAGG GGAAGCTTGT AGTGCCCGAG TGGGCACTTA AATACATCTC CAGCAACAAG
CTGAGGGAGA AGCTGGGCGC CCGCCAAATC CTCGGCGGCC TTTCCAGCGT GGTATCGAGG
CCTGTGAGGG CTGTGGCGTA TGTAGAGGGG AGAGTCCAGG GCGTGGGATT CCGGCCCATG
GCCCGGGCCA GGGCAAAGGC CCTATCCCTC GTGGGCTACG CCAAGAACCT CCCCGACGGC
AGGGTTGAGG TGGTTGTGGA AGGCGACGAG GAGAGGGTTA GGAAGTTCGT GGAGGAGCTG
TGCCGCGGCT TCGACGACTG CCGCGTCTCG GCGACTTACA GCCCCGCCAC GGGCAAGTTC
AAGGATTTCG AAATTTCATG A
 
Protein sequence
MSMERFRERV RLYREAGIAL ESLSLGCSVK VDLYNVLYPA LQLLKDEVYK LNLVIAPRED 
AAIMPGEGAY LRRYFLNAEE PWLEPSEIEK LAPTVAIVLA QLYMGKAASA DVFAKYVAKL
YKALGSSRHK VWLGKGHSIV STKKGAEFFM VDFIKAEGSR GYVVANNDTI QVIDPSEDLD
SQLQIAVAVN NALNDLFTKG AWKDLHIAPV YDGPSAYKAS IKAKVEGYAS SLGKLVEAPQ
PDMGYLLLGA TAYAYLDREP PLFYKQLDEG FVVVVTRPFG ELAFFTTYVA VHTDEFLLQR
FEREVMSLEQ FEREKRRVLE VMATPNLEVA KAIYEFLPDL GEAFDPASHI AATIDVSGPG
VFVFKEVAEK AGVDIRLLDV PLMSDRISAF AAENYIMPDA TAGTNGAIAI FAHKRLADEL
IQRLSKAPHA RPLVIGEVVG KGEGKLVVPE WALKYISSNK LREKLGARQI LGGLSSVVSR
PVRAVAYVEG RVQGVGFRPM ARARAKALSL VGYAKNLPDG RVEVVVEGDE ERVRKFVEEL
CRGFDDCRVS ATYSPATGKF KDFEIS