Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pisl_1506 |
Symbol | |
ID | 4618079 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum islandicum DSM 4184 |
Kingdom | Archaea |
Replicon accession | NC_008701 |
Strand | + |
Start bp | 1374604 |
End bp | 1375911 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 639784589 |
Product | sulfatase |
Protein accession | YP_931005 |
Protein GI | 119872998 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000053867 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 0.747943 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATGTTT TTCTGATTGT GGTGGATTCG CTTCGGCTGG ATTTTGCGGG GGAGCTTTTG TCGGGTTTGA AGCGGCGTGG GTTTAGGGTG TATGAGAGGG CTGTGGCGGC TTCTAACTGG ACCATTCCGT CTTTTGGGTC GATGCTTACG GGGCTTTACC CCTCTCTCCA CGGGGGGCAT GAGGAGGGCG ATAGGGTTTT TCCCGTGAGG TGGGGGGATA TGGTGTCTTG TAGGTTGGGT GAGCTTGGGT TTCACCCGGT TGTGCTTACT GAAAACCTGC TTCTCTCGCC GGCATATGGC TTTAAGTGTT TTGAGGTGTG GGAGTATTTC AACTGGTGGT TTTTTGTGTT TAAGTTGAGC CGTGAGGAGT ATGGAAGAGC GATTGGCGAG TTTGTTAGGA ACGGCAATAA CGCGGTTAGG GCTGGTCTGA GTTTACTGAG GCAGGGCCGT CTTGGTTTGC TTTCTAAGCT CTTTGTTAAC TATTTGGCTT ACAGGGCTGT GGCGCTGAGG CGTGGACCGG TGGATAGGTG TTCTCGGTGT ATTATTAGGG ATGTGACGAA GATAAAGACA CCGGCCTTTG TGGTGGTGAA TTTTATGGAG GCGCATGAGC CTTATACTTA TACAGAGTTA GGCACCCCGT ATTTACCTAC CTACGACTTT GTGGAGATGT TTAGGGAGGG TCGTGCGCCG CGTGAGTTGG TGGATTTGTG GAGGAGGTGG TATCCGCGGG CGGTTGGGCT GGCGTCTCGT CGGGTTTTTG AGCTTTTGGA TGTGTTGGAG GATGGGGGGC TCTTGGACGA TAGTCTTGTG GTTGTGGCTA GTGACCATGG GCAGCTTCTT GGCGAGTTTG GGCTGGTGGG GCATCTGGCT CTTCTTTCTG ATGAGCTTGT GCGGGTTCCG CTTGCGGTTA GGTTTCCGTC GGGGGTGGAG GTGGTTGGGG GTGGTGGGTC TGGCTGGGTT TCTAACACGG CTGTCAAGCG GCTTGTGTTG GAGGTGGCGC GTGGCGTGAG GAGGTTTGAT GAGGGGGTTC TCTATTCGGA TGTGGTGTTT TCTGAGACTT TTGGGCTTGG CTTCACGTCG TGGCCTCGGG TGTGTAGAGA CGGGGGCTGT AGGCTTCTGC CTAAGCGTAG GGTGGCGGTG TATAAGGGGG ATTTTAAGCT TGTGTATAAC GTGACTGATG GGGTTGTGGA GGAGGTGAGG GGGTACGGGG GGCGGCCGGA TGGGGATGTG GCTGGGGATC TTCTGAGGGA GGTGTTTGGG TTTTTAAAGG TAGCCGAGGG CCTCCAGTTT TCTCCTGAGG GCCTCTAG
|
Protein sequence | MNVFLIVVDS LRLDFAGELL SGLKRRGFRV YERAVAASNW TIPSFGSMLT GLYPSLHGGH EEGDRVFPVR WGDMVSCRLG ELGFHPVVLT ENLLLSPAYG FKCFEVWEYF NWWFFVFKLS REEYGRAIGE FVRNGNNAVR AGLSLLRQGR LGLLSKLFVN YLAYRAVALR RGPVDRCSRC IIRDVTKIKT PAFVVVNFME AHEPYTYTEL GTPYLPTYDF VEMFREGRAP RELVDLWRRW YPRAVGLASR RVFELLDVLE DGGLLDDSLV VVASDHGQLL GEFGLVGHLA LLSDELVRVP LAVRFPSGVE VVGGGGSGWV SNTAVKRLVL EVARGVRRFD EGVLYSDVVF SETFGLGFTS WPRVCRDGGC RLLPKRRVAV YKGDFKLVYN VTDGVVEEVR GYGGRPDGDV AGDLLREVFG FLKVAEGLQF SPEGL
|
| |