Gene Pars_0351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0351 
Symbol 
ID5054714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp301359 
End bp302618 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content54% 
IMG OID640467924 
Productsulfatase 
Protein accessionYP_001152611 
Protein GI145590609 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAAT ACAACGTCGT CCTCATCGTC CTCGACACCC TAAGGGCCGA CCACGCCCAG 
GGCCTAGACA AGCTACTCGA CCTCGGCTTC GTCAAATACG AAGACGTCTA CGCCACCGCC
CCCTGGACCC TCCCCAGCCA CGCCTCCATG TTCACCGGCA TGTACCCCTC CGAACACGGG
ATACACGAGA CAAGGGAATA CCAGTTGGAT GTAGCCAAGA TTGCCAGGTT GCGCATGGCT
AAGCTAAACG GCGGCATATT GGGCCAGCTA AAAGAGGAAG GATACAACAC GTATCTTATA
TCGGCGAATC CAATAGTCTC AAGCAACTTC GGCTTTAACG CCGACTACGA ATACATCATA
GATCCCATAT ATACCTTGCT CATTACGTCA ATCGACATAA TACTAGATAA AATCTATGCC
GAGAGCGGCT CCAGAGCAAA GGTATTATCA AAACTAATTG AAGAACGTAG ACTCGATATG
TTACTTCATG GGATCAAGAT ATTTGTAGAA AGAAGAATTC GCGTAATCCC AAAATATCTT
TCTGAAAAGG CGACCAGAAA TAAGGGAGGC GGAAAAATTG TAGGGTTACT AGGGAGATTA
AAATTGGAAA CTCCATTTTT CCTCTTTGTC AACATAATGG AAGCACATGA CCCCTATAAT
AAACCACTCG TTGATAGGCG TAGGCTGAAA TACATCGGCA AGTGGCTCGC AACAGGGTTG
ATAGACCCCG AAGCGGTGAG GTTGTGGCGG AACTACCCGG CCCACGCCGA GGAAGCCGTC
AAGAGGGCCC TGGAGGCCGT GGAGACGCTG AAGGCGAGGG GCTACTGGGA CGATACACTG
ATAATCGCGA CGTCCGACCA CGGGGAGCTA CTGGGAGACG GCGGGCTCTA TCACATCTAT
TCGCTCCTCG ACGGGAACCT CCGGGTCCCC CTCTACGTCA AGTACCCAGG GAAGCCCAAG
AAGCAGAGAG GTCCCATCAC GCTCGCCGAC GTGCCCCGGC TGATCGACCC CTCGGCGGAG
GAGGTAGGGC GCCCCCTAGT CATGGCGGAA ACGTTCGGCA TAAGCTCTCC GCCTAAAGCC
CTCGGCATAG AGCCGGAGGA GAGGTTCTTC CACCACAAGA TAAGGGTAAT CGGCCGCAAG
CTCGACTTCA TATACGACGC AACGGCCGGC GTCGTGGAGA GGGTCTTCCG CGGCGATAAG
GAGGACGCGG CGAGGCTGCT CGAGGACGCG GGGGCCAAGC TCGGCGGCCG TAGTATATAA
 
Protein sequence
MRKYNVVLIV LDTLRADHAQ GLDKLLDLGF VKYEDVYATA PWTLPSHASM FTGMYPSEHG 
IHETREYQLD VAKIARLRMA KLNGGILGQL KEEGYNTYLI SANPIVSSNF GFNADYEYII
DPIYTLLITS IDIILDKIYA ESGSRAKVLS KLIEERRLDM LLHGIKIFVE RRIRVIPKYL
SEKATRNKGG GKIVGLLGRL KLETPFFLFV NIMEAHDPYN KPLVDRRRLK YIGKWLATGL
IDPEAVRLWR NYPAHAEEAV KRALEAVETL KARGYWDDTL IIATSDHGEL LGDGGLYHIY
SLLDGNLRVP LYVKYPGKPK KQRGPITLAD VPRLIDPSAE EVGRPLVMAE TFGISSPPKA
LGIEPEERFF HHKIRVIGRK LDFIYDATAG VVERVFRGDK EDAARLLEDA GAKLGGRSI