Gene Pars_0984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0984 
Symbol 
ID5055465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp875815 
End bp876969 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content58% 
IMG OID640468540 
Productmetallophosphoesterase 
Protein accessionYP_001153216 
Protein GI145591214 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000094413 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAATTC TACATATGTC AGATGCCCAC TTGGGCAGAG CGCAGTACGG CCTCCCCGAG 
AGGGAGGAGG ATTACTACAA GGCGTTCCGC GAGGGCCTTA TGAGGGGGAA AACGGCAGAC
GCCGTTTTAA TAACCGGCGA CGTTTTCGAC TCCAGGAGGC CCTCCACCAG GGCTATGCTA
CGCTTTGTCG AGGCTGTGGA GGGGGTAATG CTACCCATCT ACGTCATTGG AGGGAATCAC
GACTTCAGCT ACGTGAGGTA CAGGGCAGAG GCGGAGCAGT GCGGGGGGAG TTGCGCCGGC
GATACGGTGT TGAGACTTCT AGACCGCGTA AAACTTGCAA AGCTCCTCTG CTGGAGCCCC
GCCGATCTCG GCGGCTTGGT CCTCTTCGGC GCCTGCGCTA CGCCGAGGGA TTACGCGGCG
GAGTACCGAA AGCTACTGCA GAAGGCCCCT CCAGGCGCTT TGTTGGCAAT TCACCAAGCT
GTTGAAGGCG TGCAGGCGAG GTACCCTGCA GAGGAAGATG ACTACACTAT GCCTCAGTCA
GTCTTCCAAG GGCTTTCCTA CATCCATATA GCGGCTGGTC ACATACATGA CCACTTCGCT
AAGCACCCCA TTGGCGCCGT GTGGGCCGGG TCAATAGAGG TGTGGGACTC AGGCGAGTTC
GAGACCTGGG AATACGCGGG GAGGTTTAAT AAGGTCCAAG AAAGGGCAAC TAAGGGCGTA
GTCTTGATAG ACGCGGCGGG GAGGGGGGTC TCGGTCAAGT CTATACCTCT TCCGGATTCG
AGGCCGTTGT ATAGGCTGAG AATCTACGTC AGAGAGGGCA AGGAGCTGGC CGGCGCGGTA
GAAGAGGCGG CAAGGCTCTT CGACAAGCCC GGGGCAGTTG TTAGGCTCGA GATCATGGGC
ACCGCCGAGG AGGGGGTAAA AACAAGGCAA TTGGCAGCAG CGTTTACAAA GGCGCTTTAT
GTCGACGTAG TGGACCGCAC ATCAGCCCCC CAGAGGGCCG TGGCACTGCG CGGATCTGCC
TTTGAGGAGC TGTGGCGGTT ACTTAGGGAG AAGCTGGGGG AACACGCAGA GGTGGTGATA
AAGGCGATGG AGCTGGTGAG GGAGGGCGAG AGGGAAGCGG CGTATAGACT AGTGCTTAAG
GCGCTGTATG ATTAG
 
Protein sequence
MKILHMSDAH LGRAQYGLPE REEDYYKAFR EGLMRGKTAD AVLITGDVFD SRRPSTRAML 
RFVEAVEGVM LPIYVIGGNH DFSYVRYRAE AEQCGGSCAG DTVLRLLDRV KLAKLLCWSP
ADLGGLVLFG ACATPRDYAA EYRKLLQKAP PGALLAIHQA VEGVQARYPA EEDDYTMPQS
VFQGLSYIHI AAGHIHDHFA KHPIGAVWAG SIEVWDSGEF ETWEYAGRFN KVQERATKGV
VLIDAAGRGV SVKSIPLPDS RPLYRLRIYV REGKELAGAV EEAARLFDKP GAVVRLEIMG
TAEEGVKTRQ LAAAFTKALY VDVVDRTSAP QRAVALRGSA FEELWRLLRE KLGEHAEVVI
KAMELVREGE REAAYRLVLK ALYD