Gene Pars_2292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2292 
Symbol 
ID5054129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp2052504 
End bp2053967 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content52% 
IMG OID640469844 
Productmetal dependent phosphohydrolase 
Protein accessionYP_001154488 
Protein GI145592486 
COG category[R] General function prediction only 
COG ID[COG1078] HD superfamily phosphohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0322637 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTTTA TAAATTGGTA TATTGTCCAC CGCGTGCAGT TTAGGAAAGC GATTAGAGAT 
CCGGTGCATG GGTTCATAAA GCTCACGGAG GAGGAGGTGA GGTTTATTGA CGGCGAGCCC
ATCATCCAGA GGCTTAGGTA CGTTAAGCAA CTTGGCTTTG TCTACCTCGT ATACCCAACC
GCCACACACA CCAGGTTTGA CCACTCCCTG GGCGTCATGC ACATAGCCAC TCAGCTCGGG
CACCGGATAA TGGAGCAGAG GGGGGAGTTC GACGAGGTTT TGTTGAAGCA TCTCCGCATG
GCAGCTCTGT TACACGACGT GGGCCACCTG CCTTTTTCCC ACTCTTTTGA GATCCTCACT
AGAGAGCTCC TCCACATGGC AACTGTGAGG GGCTGTTTGG AGGTGGATCT GGCCTTGTTT
GACAGGTCTC AGAAACCGCA CGAAGTAACC ACAAAGTTGC TTGTAGAGAA ACTGAGCGAT
AGGCTCTCGG CCCTTGGGTA CAATCCCTCT CTTGTCTTAG GGCTCTTGTT TGAGCCGCCG
AGTAAGTATC GTCTCTACAG TAGTATTCTA TCTGGCGTAT TCGACGCAGA TAGGCTTGAC
TATATTATGC GCGACATGTA CTTTACAGGC GCGGCGGTGG GGACCAGCTT TACCCACATA
GATCTTGAGC GGATAGTGGA AAATCTCGAA GTGGTGGGCG ATAGCTTCCA GTTTAACGAA
AAGGCGAGGG TAAATTTGGA GGGCTATATA ATTACGCGGT ACAACCTCTA CCGCCACGTT
TACCTGCACC ACAAAACTGT GCTTTTCACA GAGCTGGCCC GCGACATCTT AGCCGACAAT
ATAGAGAAGT GTTCCGAGGG TGCGGGGGAT CCCGAAATTT GCCGCTACCT CTGCGAGCTG
GCCCAGTTCG TGACTGGAAG TGCAGACGAG GTGAGTATCT GGAAGGCCAC TGATGACTAC
TTTGTCTCAG TCTTCATGCG TGACCCTCGT TTTAGGGATC TTCTTTCGAG GAAGCCGCCT
GGCTACATAC CGCTGTGGAA GAGGGAAAAG GACTTTATGG AAATTTTCCA AAACCCGGTG
AGGGTAAACG AGTCTGTGGA TAGGATAGGC CCTCTGCACT GGGCTTTGAT AAATAGGCTC
AAGAGGAAGT TCGTCGAGCG GCTGAACATC GAGGTAGGAG AGGTAGGCGG CTGCCGCCTA
GCTCCTAACG ACGTGATTCT GTCGTATGTG AGCTTTGACC CGTACGCGGA GGATATATAC
ATCACCACGG CCATGGGCCC CATCCCTATA TCGAAAATAT CGCCGCTGGT GGAGGCGGTG
AACGAGGCGT GGAAAAGGGC TCCACACGTA TTTCTCTATG TAAAGAGGGA TGTTATCGAA
AAATGCGGGG GTGATGTCTT GCAGAAGATC CTGACGGTAT TGGAGCCCCT GATGGAGCTC
GCCGTGCGTA GGCTAGGAGC GTAG
 
Protein sequence
MLFINWYIVH RVQFRKAIRD PVHGFIKLTE EEVRFIDGEP IIQRLRYVKQ LGFVYLVYPT 
ATHTRFDHSL GVMHIATQLG HRIMEQRGEF DEVLLKHLRM AALLHDVGHL PFSHSFEILT
RELLHMATVR GCLEVDLALF DRSQKPHEVT TKLLVEKLSD RLSALGYNPS LVLGLLFEPP
SKYRLYSSIL SGVFDADRLD YIMRDMYFTG AAVGTSFTHI DLERIVENLE VVGDSFQFNE
KARVNLEGYI ITRYNLYRHV YLHHKTVLFT ELARDILADN IEKCSEGAGD PEICRYLCEL
AQFVTGSADE VSIWKATDDY FVSVFMRDPR FRDLLSRKPP GYIPLWKREK DFMEIFQNPV
RVNESVDRIG PLHWALINRL KRKFVERLNI EVGEVGGCRL APNDVILSYV SFDPYAEDIY
ITTAMGPIPI SKISPLVEAV NEAWKRAPHV FLYVKRDVIE KCGGDVLQKI LTVLEPLMEL
AVRRLGA