Gene Pisl_1506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_1506 
Symbol 
ID4618079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp1374604 
End bp1375911 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content55% 
IMG OID639784589 
Productsulfatase 
Protein accessionYP_931005 
Protein GI119872998 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000053867 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.747943 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATGTTT TTCTGATTGT GGTGGATTCG CTTCGGCTGG ATTTTGCGGG GGAGCTTTTG 
TCGGGTTTGA AGCGGCGTGG GTTTAGGGTG TATGAGAGGG CTGTGGCGGC TTCTAACTGG
ACCATTCCGT CTTTTGGGTC GATGCTTACG GGGCTTTACC CCTCTCTCCA CGGGGGGCAT
GAGGAGGGCG ATAGGGTTTT TCCCGTGAGG TGGGGGGATA TGGTGTCTTG TAGGTTGGGT
GAGCTTGGGT TTCACCCGGT TGTGCTTACT GAAAACCTGC TTCTCTCGCC GGCATATGGC
TTTAAGTGTT TTGAGGTGTG GGAGTATTTC AACTGGTGGT TTTTTGTGTT TAAGTTGAGC
CGTGAGGAGT ATGGAAGAGC GATTGGCGAG TTTGTTAGGA ACGGCAATAA CGCGGTTAGG
GCTGGTCTGA GTTTACTGAG GCAGGGCCGT CTTGGTTTGC TTTCTAAGCT CTTTGTTAAC
TATTTGGCTT ACAGGGCTGT GGCGCTGAGG CGTGGACCGG TGGATAGGTG TTCTCGGTGT
ATTATTAGGG ATGTGACGAA GATAAAGACA CCGGCCTTTG TGGTGGTGAA TTTTATGGAG
GCGCATGAGC CTTATACTTA TACAGAGTTA GGCACCCCGT ATTTACCTAC CTACGACTTT
GTGGAGATGT TTAGGGAGGG TCGTGCGCCG CGTGAGTTGG TGGATTTGTG GAGGAGGTGG
TATCCGCGGG CGGTTGGGCT GGCGTCTCGT CGGGTTTTTG AGCTTTTGGA TGTGTTGGAG
GATGGGGGGC TCTTGGACGA TAGTCTTGTG GTTGTGGCTA GTGACCATGG GCAGCTTCTT
GGCGAGTTTG GGCTGGTGGG GCATCTGGCT CTTCTTTCTG ATGAGCTTGT GCGGGTTCCG
CTTGCGGTTA GGTTTCCGTC GGGGGTGGAG GTGGTTGGGG GTGGTGGGTC TGGCTGGGTT
TCTAACACGG CTGTCAAGCG GCTTGTGTTG GAGGTGGCGC GTGGCGTGAG GAGGTTTGAT
GAGGGGGTTC TCTATTCGGA TGTGGTGTTT TCTGAGACTT TTGGGCTTGG CTTCACGTCG
TGGCCTCGGG TGTGTAGAGA CGGGGGCTGT AGGCTTCTGC CTAAGCGTAG GGTGGCGGTG
TATAAGGGGG ATTTTAAGCT TGTGTATAAC GTGACTGATG GGGTTGTGGA GGAGGTGAGG
GGGTACGGGG GGCGGCCGGA TGGGGATGTG GCTGGGGATC TTCTGAGGGA GGTGTTTGGG
TTTTTAAAGG TAGCCGAGGG CCTCCAGTTT TCTCCTGAGG GCCTCTAG
 
Protein sequence
MNVFLIVVDS LRLDFAGELL SGLKRRGFRV YERAVAASNW TIPSFGSMLT GLYPSLHGGH 
EEGDRVFPVR WGDMVSCRLG ELGFHPVVLT ENLLLSPAYG FKCFEVWEYF NWWFFVFKLS
REEYGRAIGE FVRNGNNAVR AGLSLLRQGR LGLLSKLFVN YLAYRAVALR RGPVDRCSRC
IIRDVTKIKT PAFVVVNFME AHEPYTYTEL GTPYLPTYDF VEMFREGRAP RELVDLWRRW
YPRAVGLASR RVFELLDVLE DGGLLDDSLV VVASDHGQLL GEFGLVGHLA LLSDELVRVP
LAVRFPSGVE VVGGGGSGWV SNTAVKRLVL EVARGVRRFD EGVLYSDVVF SETFGLGFTS
WPRVCRDGGC RLLPKRRVAV YKGDFKLVYN VTDGVVEEVR GYGGRPDGDV AGDLLREVFG
FLKVAEGLQF SPEGL