Gene Ssol_2683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2683 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp2459368 
End bp2461038 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content35% 
IMG OID 
ProductThermopsin 
Protein accessionACX92776 
Protein GI261603173 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTAAAGC ATATAGTGTT AGTCCTTCTT TTGCTCTTAT TAACACCGTT AGTTGCCATT 
TCATTTCCAA CTGGAGTAGT AGCTTATAAT GGTCCTATAT GTACAAATGA AGTACTAGGT
TATGCAAATA TATCATCGCT GTTGGCTTAT AACACTTCTG CATCACAGCT TGGAGTTCCG
CCTTATGGCG CTTCGCTTCA ATTAAACGTT ATGTTAGAAG TAAATACTAG CGGTGGAGAA
TACTATTTCT GGTTACAAAA TGTAGCTGAT TTCATTACAA ATGAGAGTAA GGTATTCTTT
GGCGACAATA TTTGGAACTC GACTACTCCC TTTGCTGGAA TAAACAATAT AGTTGGCAAA
GGTGAAATAT ACTCTACTTC AGACTTTTTC TCTCATTCCT CATACTACGC TTATGGGACT
TATTATATTA AATATAATTT CCCCTTTTCG TTCTACCTTA TAATAAATGA GAGCTATGAT
ACTCAAGGAG TATATGTTAG TTTCGGTTAT GTTATTCTTC AAAACGGAAA TATAAGTCCA
CCTAACCCAA TATTTTACGA TACGGTCTTC ATTCCAATTC AAAATTTATC ATTTGCTTCA
ATTATAATAG CTAATCAAAC CACCCCCAGC GCGAATTTTG GTATTGTTAC ATATCTGGGA
AATTATTTAG ATGCTGAGTT AGTATGGGGA GGATTTGGGA ATGGTGAAAG CACAACTTTC
TTAAACATGT CTTCTTACTT AGCATTACTC TATATGAAAA GTGGCGAATG GGTTCCATTT
TCACAAGTAT ACAATTACGG AAGTGATACC GCAGAATCCA CTAATAATTT GCAAGTTTTG
ATAGGTAAAA ACGGTGATGC TTACGTTACA ATAGGCAGAC AGAACCCTGG TCTATTGACT
ACAAAATTTA ACCCTTCATA TCCAAGTTTC CTATACTTAA ACATTAGTAG CAAAATACCA
TTTCTACTAA ATAAAAGCCT TTCACATGCA TTCTCCGGCT ACGTTACCAC CCAAATTAAA
TTAGGATTCT TTAAGAACTA TTCAATTAAC TCATCGTCAT TTGCAGTGCT TAATGGAAAC
TATCCCAGCC TAATAGAACC TAACGTTAGT TGGTTTAAGG TTTTGAATAT TATTCCCAAT
TATACATATT ACTATCTGGT GAAAGTAAAC TCACAAATTC CAGTTATTGC CAATGTGAAT
GGTAAACAAA TAACTTTGAA CAGTACAGAT TGGTTTGCTC AAGGCACTCA AATCAGCATA
CTCAATTATA CATATTACAA CGGTAGCAAT GAGAGGTACA TAATATCATC AATTTTACCG
TCATCGTCAT TCAACGTTAG TCTACCTTTA AACATAACCT TAAGCACAAT AAAACAATAT
CGGGTTTTAG TAGACTCCAA TCTACCCGTA TATTTAAATG GTGAAAGAGT GAATGGAAGT
GTATGGATTA ACGCGGGTTC CTCCATTCAA TTAAGTGCTA ACGTTCCCTT TTACGAAAAG
GGCATATTTA CGGGGACTTA TAACGTAACA CCAGGGAGCA TTATAACGGT AAATGGGCCA
ATAGTTGAGA CCTTAATATT ATCCATCAAT ACTGAACTAA TGGGTATAGT GGCAGTAATA
GTAATAGCAG TAGTAGCAAT TGCCATATTG GTATTGAGGC GAAGAAGATG A
 
Protein sequence
MLKHIVLVLL LLLLTPLVAI SFPTGVVAYN GPICTNEVLG YANISSLLAY NTSASQLGVP 
PYGASLQLNV MLEVNTSGGE YYFWLQNVAD FITNESKVFF GDNIWNSTTP FAGINNIVGK
GEIYSTSDFF SHSSYYAYGT YYIKYNFPFS FYLIINESYD TQGVYVSFGY VILQNGNISP
PNPIFYDTVF IPIQNLSFAS IIIANQTTPS ANFGIVTYLG NYLDAELVWG GFGNGESTTF
LNMSSYLALL YMKSGEWVPF SQVYNYGSDT AESTNNLQVL IGKNGDAYVT IGRQNPGLLT
TKFNPSYPSF LYLNISSKIP FLLNKSLSHA FSGYVTTQIK LGFFKNYSIN SSSFAVLNGN
YPSLIEPNVS WFKVLNIIPN YTYYYLVKVN SQIPVIANVN GKQITLNSTD WFAQGTQISI
LNYTYYNGSN ERYIISSILP SSSFNVSLPL NITLSTIKQY RVLVDSNLPV YLNGERVNGS
VWINAGSSIQ LSANVPFYEK GIFTGTYNVT PGSIITVNGP IVETLILSIN TELMGIVAVI
VIAVVAIAIL VLRRRR