Gene Pars_0524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0524 
Symbol 
ID5056231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp472508 
End bp473893 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content62% 
IMG OID640468086 
Productselenium-binding protein 
Protein accessionYP_001152771 
Protein GI145590769 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.377432 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGTC TAACCCCGGA CCCGACGCTT TATCCCACGG TTCGAGACGC TATTAAGAGC 
CCGCCGGAGG ATGTGGCGTA TGTGGCCGCG CTGTATGTGG GCACCGGCGT GGAGGAGCCC
GATTTCCTCG CGGTGGTCGA CGTGGACCCC AACTCGGCAA CATACGGCAA GATCGTGTAC
AAGCTACCGA TGCCGGATAT AGGCGACGAG TTGCACCACT TCGGCTGGAA CGCCTGCTCC
TCCGCCTACT GCCCCAACGC GAGGCCTTTC CTCGAGAGGC GGTACCTCAT AGTGCCTGGG
CTTAGGTCCT CCAGGATATA TATCGTGGAT ACGAAGCCGG ACAAGAGGAA GCCCTCTATA
CATAAAATAA TCGAGCCCAA AGTCGCCGTG GAGAAGACCG GCTACACGAA GTACCACACG
GTCCACTGCG GCCCAGACGC CATATACATA TCGGCGCTGG GCGGGCCAGA CGGGAGGGGA
GGCCCCGGCG GAGTCCTGTT GCTAGACCAC AACACCTTCG AGCCGCTGGG CCGGTGGGAG
GTCCACAGAG GGCCGCAGTT CTACGCCTAC GACTTCTGGT GGAACCTCCC CTCCGGCGTC
ATGATCTCCA GCGAGTGGAC GACGCCGGAG TGCTTCGAAA ACGGCTTCAG CCCCGAGTGC
CTCGCCGCCG GTAAATACGG CCACAAGCTC CACGTCTGGG ACCTCGGCAG ACGTAGGCAC
CTCTACTCCA TAGACCTGGG GGAGGAGCAC CGCATGGTGC TTGAGGTGAG GCCCCTCCAC
GACCCCACCA AGCTGATGGG CTTCGTCAAC GTGGTCCTCA ACACGAAGGA CCTCTCCAGC
TCCATATGGC TCTGGTTCCC AGAAGACGGC AGGTGGCACG CCGAGAAGGT GATAGAGATA
GAGGCGCAGC CCTCCGAGGG CCCCCTCCCG CCGCCGCTGA AGGACCTGAA GACCGTGCCT
CCGCTGGTCA CCGACATAGA CGTCTCTCTG GACGACAGAT TCCTCTACGT CTCCCTCTGG
GGCCTCGGGG AGCTGAGGCA GTACGACATC ACGAACCCCC ACCAGCCGCG GCTGGCGGGG
CGGGTCAAGA TAGGCGGCAT ATTCCACAGA GAGCCCCACC CCTCCGGCGC AGAGGCGACC
GGCGCCCCGC AGATGATATC AGTCAGCAGA GACGGGAGGA GGGTCTACAT AACCAACAGC
CTCTACAGTA GCTGGGATAA CCAGTTCTAC CCCGGCCTCA GGGGCTGGAT GGCGAAGATC
AACGTAAATC CCGAGGGCGG TTTGGAGCTC GACAAGTCTT TCTTCGTGGA CTTCGGCCAG
GCGAGGACCC ACCAAGTACG CCTCTGGGGC GGCGACGCCT CCACAGACAG CTTCTGCTTC
CCGTAA
 
Protein sequence
MARLTPDPTL YPTVRDAIKS PPEDVAYVAA LYVGTGVEEP DFLAVVDVDP NSATYGKIVY 
KLPMPDIGDE LHHFGWNACS SAYCPNARPF LERRYLIVPG LRSSRIYIVD TKPDKRKPSI
HKIIEPKVAV EKTGYTKYHT VHCGPDAIYI SALGGPDGRG GPGGVLLLDH NTFEPLGRWE
VHRGPQFYAY DFWWNLPSGV MISSEWTTPE CFENGFSPEC LAAGKYGHKL HVWDLGRRRH
LYSIDLGEEH RMVLEVRPLH DPTKLMGFVN VVLNTKDLSS SIWLWFPEDG RWHAEKVIEI
EAQPSEGPLP PPLKDLKTVP PLVTDIDVSL DDRFLYVSLW GLGELRQYDI TNPHQPRLAG
RVKIGGIFHR EPHPSGAEAT GAPQMISVSR DGRRVYITNS LYSSWDNQFY PGLRGWMAKI
NVNPEGGLEL DKSFFVDFGQ ARTHQVRLWG GDASTDSFCF P