Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_2683 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | - |
Start bp | 2459368 |
End bp | 2461038 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | |
Product | Thermopsin |
Protein accession | ACX92776 |
Protein GI | 261603173 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTAAAGC ATATAGTGTT AGTCCTTCTT TTGCTCTTAT TAACACCGTT AGTTGCCATT TCATTTCCAA CTGGAGTAGT AGCTTATAAT GGTCCTATAT GTACAAATGA AGTACTAGGT TATGCAAATA TATCATCGCT GTTGGCTTAT AACACTTCTG CATCACAGCT TGGAGTTCCG CCTTATGGCG CTTCGCTTCA ATTAAACGTT ATGTTAGAAG TAAATACTAG CGGTGGAGAA TACTATTTCT GGTTACAAAA TGTAGCTGAT TTCATTACAA ATGAGAGTAA GGTATTCTTT GGCGACAATA TTTGGAACTC GACTACTCCC TTTGCTGGAA TAAACAATAT AGTTGGCAAA GGTGAAATAT ACTCTACTTC AGACTTTTTC TCTCATTCCT CATACTACGC TTATGGGACT TATTATATTA AATATAATTT CCCCTTTTCG TTCTACCTTA TAATAAATGA GAGCTATGAT ACTCAAGGAG TATATGTTAG TTTCGGTTAT GTTATTCTTC AAAACGGAAA TATAAGTCCA CCTAACCCAA TATTTTACGA TACGGTCTTC ATTCCAATTC AAAATTTATC ATTTGCTTCA ATTATAATAG CTAATCAAAC CACCCCCAGC GCGAATTTTG GTATTGTTAC ATATCTGGGA AATTATTTAG ATGCTGAGTT AGTATGGGGA GGATTTGGGA ATGGTGAAAG CACAACTTTC TTAAACATGT CTTCTTACTT AGCATTACTC TATATGAAAA GTGGCGAATG GGTTCCATTT TCACAAGTAT ACAATTACGG AAGTGATACC GCAGAATCCA CTAATAATTT GCAAGTTTTG ATAGGTAAAA ACGGTGATGC TTACGTTACA ATAGGCAGAC AGAACCCTGG TCTATTGACT ACAAAATTTA ACCCTTCATA TCCAAGTTTC CTATACTTAA ACATTAGTAG CAAAATACCA TTTCTACTAA ATAAAAGCCT TTCACATGCA TTCTCCGGCT ACGTTACCAC CCAAATTAAA TTAGGATTCT TTAAGAACTA TTCAATTAAC TCATCGTCAT TTGCAGTGCT TAATGGAAAC TATCCCAGCC TAATAGAACC TAACGTTAGT TGGTTTAAGG TTTTGAATAT TATTCCCAAT TATACATATT ACTATCTGGT GAAAGTAAAC TCACAAATTC CAGTTATTGC CAATGTGAAT GGTAAACAAA TAACTTTGAA CAGTACAGAT TGGTTTGCTC AAGGCACTCA AATCAGCATA CTCAATTATA CATATTACAA CGGTAGCAAT GAGAGGTACA TAATATCATC AATTTTACCG TCATCGTCAT TCAACGTTAG TCTACCTTTA AACATAACCT TAAGCACAAT AAAACAATAT CGGGTTTTAG TAGACTCCAA TCTACCCGTA TATTTAAATG GTGAAAGAGT GAATGGAAGT GTATGGATTA ACGCGGGTTC CTCCATTCAA TTAAGTGCTA ACGTTCCCTT TTACGAAAAG GGCATATTTA CGGGGACTTA TAACGTAACA CCAGGGAGCA TTATAACGGT AAATGGGCCA ATAGTTGAGA CCTTAATATT ATCCATCAAT ACTGAACTAA TGGGTATAGT GGCAGTAATA GTAATAGCAG TAGTAGCAAT TGCCATATTG GTATTGAGGC GAAGAAGATG A
|
Protein sequence | MLKHIVLVLL LLLLTPLVAI SFPTGVVAYN GPICTNEVLG YANISSLLAY NTSASQLGVP PYGASLQLNV MLEVNTSGGE YYFWLQNVAD FITNESKVFF GDNIWNSTTP FAGINNIVGK GEIYSTSDFF SHSSYYAYGT YYIKYNFPFS FYLIINESYD TQGVYVSFGY VILQNGNISP PNPIFYDTVF IPIQNLSFAS IIIANQTTPS ANFGIVTYLG NYLDAELVWG GFGNGESTTF LNMSSYLALL YMKSGEWVPF SQVYNYGSDT AESTNNLQVL IGKNGDAYVT IGRQNPGLLT TKFNPSYPSF LYLNISSKIP FLLNKSLSHA FSGYVTTQIK LGFFKNYSIN SSSFAVLNGN YPSLIEPNVS WFKVLNIIPN YTYYYLVKVN SQIPVIANVN GKQITLNSTD WFAQGTQISI LNYTYYNGSN ERYIISSILP SSSFNVSLPL NITLSTIKQY RVLVDSNLPV YLNGERVNGS VWINAGSSIQ LSANVPFYEK GIFTGTYNVT PGSIITVNGP IVETLILSIN TELMGIVAVI VIAVVAIAIL VLRRRR
|
| |