Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_0566 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | - |
Start bp | 512940 |
End bp | 514694 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | |
Product | glycoside hydrolase 15-related protein |
Protein accession | ACX90842 |
Protein GI | 261601239 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.941265 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACAAGAT ATTACGAATT TCTTTCGAAC GGTATCACCT CAGCTTTAAT AAAAAATGGC AGTGTTGAAT GGTTACCATT ACCTAGATTT GATTCTTCAT CAATTTTCAC AAAAATTCTG GATGAAGAAA GAGGAGGGTA TTTCAGTATA AGTCCGGTAG ATGAAAAATA TGAGGTAATT CATGAAGATT ATATTGAAAA TACATTGATT CTAAGAACTA TCTTCAAAAG CGAGGAGGGT AAAGCTGAAG TAACGGACTT CTTACCTTTA TCGTTAGCAG GGATTATACG AATTTATGAA AGTGAAATAG ATCTGAAAGT AGAGATAAAG CCATTCTTTG AATACGGCCT TATAACACCA GCTATAATTA AAGCTAGAAG AGGCTTGATA TTTAGAAATC CTAAATCCAA AGAGGGCGTG GAAATACTAA TAGGTGGGGA ATACATACCT ATTGATGATA CTGAATTCGT TTTGAAAAAG GGTAAAGGTT ATATATACTT GCTTTACTCG AAGGATTTGA GATATGGTCT GTTCAGTAAT AAGGGCTTTG TTTATTCTGA ACCGTATGAA GCCTTAAATA AGGCTATAAG ATATTGGAGA AGAAGGTTAG AGAACGCTAA AAAGGTTAAC ATGTATGTAG ATGCATATTA TAGATCTTTG TTAGTACTTC TTGGCTTAAT TTATGAACCA TCAGGAGGAA TTATAGCAGC TCCAACTACT TCTCTTCCAG AGATAATCGG CGGGTCGAGA AATTGGGATT ATAGATATGT ATGGGTTAGA GATGCTTCGT ACTCTGCAGA AGCGTTAATA AAAGCAGGAT TATTGGTAAA GGCCAGGGAT ATAATAGGAT TCCTAACTGC TATGATAGAT CCTTCGTCTA AAAGTTTTGA TCATCCACTT TATACAATAG ATGGAACAGC GCCACCAGCA GAGGAAATCC TAGATTGGCT AAAAGGTCAT AAAAAATCGT TTCCGGTAAG AGTTGGAAAT GCAGCGTATA TGCAAATACA AATGGATGTT GAAGGTGCTT ATATGAATGC ACTATACGAA TATTATAATG CAAGTAAAGA TTCAGAATAT ATTAGTGAAA TATGGTGGGC CATAGAGGCA ATAGCGGAAT GGACTAAGAA GAGTTGGAAA TGGAAGAGTA CTGATTTATG GGAACAAAGA GGAGTAGAGG AGCATTTCAC GCACACTAAG GTTATGGATT GGGTAGCTCT CGATAGGGCT AGCAAACTTG CAAATGAATT GGGATATAAA AATGAAGCTG AGGAATGGAT TAATGTTGCT GAAGAAATTA AAAAGGACGT TTTTGATAAC GGATATTCAG AAAAGTTAGG TTATTTTACG AGATATTACG GGAGTAATTC AGTAGATTCT GCTTTACTTA CCTTACCGCT ATACGGCTTT ATAGATGCTA AAGATCCTAG ATTTTTGAGG ACATTTGAGA AGATCGAGAG GGATTTAATG ATTTCAGATG GTTTATTACT AAGATATAAG GATGACTTTA TGGGAAATGT AAAACATCCT TTTGCCCTAG TCTCGACATG GTTAAGCCGG GTTTATATTA GATTAGGAGA AGTAGACAGA GCGAAAAAAG TAATTGAGAA GCTAATTAAC TGTTCTACAT CATCATTATT ACTCGCTGAA CACTTAGATC AGAACACTTG CGAACCTAGG GGTAACTTTC CTCAAGCATT TCCCCATGCA GGATTAATAG TGGCAATAGT GGAACTAGAA GAGAGGTTAG TTTAA
|
Protein sequence | MTRYYEFLSN GITSALIKNG SVEWLPLPRF DSSSIFTKIL DEERGGYFSI SPVDEKYEVI HEDYIENTLI LRTIFKSEEG KAEVTDFLPL SLAGIIRIYE SEIDLKVEIK PFFEYGLITP AIIKARRGLI FRNPKSKEGV EILIGGEYIP IDDTEFVLKK GKGYIYLLYS KDLRYGLFSN KGFVYSEPYE ALNKAIRYWR RRLENAKKVN MYVDAYYRSL LVLLGLIYEP SGGIIAAPTT SLPEIIGGSR NWDYRYVWVR DASYSAEALI KAGLLVKARD IIGFLTAMID PSSKSFDHPL YTIDGTAPPA EEILDWLKGH KKSFPVRVGN AAYMQIQMDV EGAYMNALYE YYNASKDSEY ISEIWWAIEA IAEWTKKSWK WKSTDLWEQR GVEEHFTHTK VMDWVALDRA SKLANELGYK NEAEEWINVA EEIKKDVFDN GYSEKLGYFT RYYGSNSVDS ALLTLPLYGF IDAKDPRFLR TFEKIERDLM ISDGLLLRYK DDFMGNVKHP FALVSTWLSR VYIRLGEVDR AKKVIEKLIN CSTSSLLLAE HLDQNTCEPR GNFPQAFPHA GLIVAIVELE ERLV
|
| |