Gene Ssol_0753 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0753 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp696223 
End bp698280 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content37% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionACX91007 
Protein GI261601404 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.371532 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTCTC GTAAGGCATT TAAGGGCTTA AGTAGAACCT TTATAGCAAT AATTGTTGTT 
GTAATTGTTG TAATTGGAAT AGCTGTTGGA GTATTATTAG GTAATCACCC ATCTAGTAAT
AATATAAGCA CTACTACAAC TTCTTCTAGT ATTATCTCTA CAACTACCCT ATCATCAACA
TCTTCCTCAA CATCTCCTCC TAATAGCGTT TCAATACCTT CTAGTATTAC CGTGGAAGAA
GCTGCAACAC CAGTAAGTGT AGATCCGGCA AGTAGCTATG ATATAGCAGG TGGGGAAATT
ATTCAGAATG TTTATCAAAC TTTAGTATTT TATAATGGGA CTAATACATC TTCCTTTATT
GGAGTTTTGG CAGAAAATTG GACTGTAGAG AACAATGGTA CTACTTATAT CTTCCATCTA
TGGCCCTTTA TTACTTTCAG TAATGGTAAT CCTCTTAACG CTACAGATGT ATGGTTTTCA
ATATACAGGA CGATGCTTAT TAATTTAGGT ATATCAATTT ATACTAGTCA AGCTTTAGCT
GTTAATAATG GTCTTGGTTT TGTAGGAAAA TTACCCAACG GGAAATACGG TACAATAATG
CTACCTAATG GCATACTACA AGCATTAGAG TATGCTGGAT ATAATTTTTC GTCAAATAAA
ACTATTGCGA TGGAACAAGC TGCATATGAT TTGGCATACA TTTTATCCCA TTTTAATGTA
AGCAATACCA CAATTCAGAA GGTAATGTCA TATCCTCATC AGGCAGTAGT AGTTATCGAT
CCATATACCG TAGAGTTTAA TTTAGACTAC CCATATTCAG CATTTCTAGC TGCGCTTTCC
ACAAGTACTG GGGCTATAGT AAATCCAGTT TTTGTTGATG AGAATGGCGG AGTTCAAATT
GATACTTCTA ATACTTATCT TTCCACTCAT GCTTTAGGCT CTGGACCCTA TATTCTGGAA
ACTCCAATAG GAGGGTCTTA CGTAGTGCTA AATGCTAGTC CTAACTATTG GGCAAGTAAA
GTCCCTACAA AAGATCTAAA TCCGATGTTG GAGACACCTA AGATTAAGAC GATAATTATA
GATTATCAAA CTAACGAGGC AGTAAGAATA TTAGACTTAC AGCAAGAGAA GGCACAAATT
TCGCAGATTG ACGTGATAAA CTTACAGGAG CTAATAGGTA GCTCTGGTGT GCAACAACTT
CAGAATCTGG TTAATGGAAA GACGTTCCCA ACAACCTATA CCAGTGGTAA TGTGACAATT
TATATTTGGG GGCCTTCAGC ACAAATAGAC TTCTTGGCAA TAGATGCATA TCAATACCCA
TTTAACATAA CAGCAGTAAG ATTGGCCATA TCACATGCAA TAAACCCTGT ACAAATTCAA
CAACAAGTCT ATAAGGGTTT TGCAATAAAC TATGTTGGAC CTTTAGATCC ATCATTACCA
TACTATAACT CATCAATAAT AGGTTATACG TATAATCCCT CCCTTTCAAT ACAACTACTA
GAAGAAGCAG GATTCAAATT AACACTACCT AATGGAACTA CAGTAAATCC AAATGGAAAA
CCTTTCCCAA CAATTACCTT AACATATCAA ACTGGTAGTA CAGCTCTACA AGATGAGGCA
TTACTGGTTC AACAACAGCT AGCTCAAATA GGAATAACAG TTCAGCTAAA TCCTGAATCC
GCGGTAACAA TAGTAGAATC GTATCTAAAT CCACCCAATT CATCAGCATA TCCTGCCTTC
CAATTAGCCG GTAACTTCCC TCCAGTGCTC AGCCCCATAG ACCCAGCAAT ATACTTACTG
TCTCAAGCTA GATTACACCA CGGAAATCCA GCTTTCGTAG ATAATAGTAC GATTAATCAG
TTAATCGTAG AGGCTGTAAG AACCAATAAC CCCCAGCAAT TACAGCATAT ATATAATGAA
ATAACTTTAC TAACCTTAGC ACAAGCACAG TATGTATGGT TAGATGACTT TTTAGCCTAT
ACGGTAGCAT CGTCAAGTAT TCATGGATTC TGGTATAGCC CCGGATTAGA TGGGTTATTC
TATGCTGACT TATACTGA
 
Protein sequence
MKSRKAFKGL SRTFIAIIVV VIVVIGIAVG VLLGNHPSSN NISTTTTSSS IISTTTLSST 
SSSTSPPNSV SIPSSITVEE AATPVSVDPA SSYDIAGGEI IQNVYQTLVF YNGTNTSSFI
GVLAENWTVE NNGTTYIFHL WPFITFSNGN PLNATDVWFS IYRTMLINLG ISIYTSQALA
VNNGLGFVGK LPNGKYGTIM LPNGILQALE YAGYNFSSNK TIAMEQAAYD LAYILSHFNV
SNTTIQKVMS YPHQAVVVID PYTVEFNLDY PYSAFLAALS TSTGAIVNPV FVDENGGVQI
DTSNTYLSTH ALGSGPYILE TPIGGSYVVL NASPNYWASK VPTKDLNPML ETPKIKTIII
DYQTNEAVRI LDLQQEKAQI SQIDVINLQE LIGSSGVQQL QNLVNGKTFP TTYTSGNVTI
YIWGPSAQID FLAIDAYQYP FNITAVRLAI SHAINPVQIQ QQVYKGFAIN YVGPLDPSLP
YYNSSIIGYT YNPSLSIQLL EEAGFKLTLP NGTTVNPNGK PFPTITLTYQ TGSTALQDEA
LLVQQQLAQI GITVQLNPES AVTIVESYLN PPNSSAYPAF QLAGNFPPVL SPIDPAIYLL
SQARLHHGNP AFVDNSTINQ LIVEAVRTNN PQQLQHIYNE ITLLTLAQAQ YVWLDDFLAY
TVASSSIHGF WYSPGLDGLF YADLY