Gene Ssol_1976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1976 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1762766 
End bp1764199 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content37% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionACX92187 
Protein GI261602584 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCGCT CAGATAAATT CTCCAATAAA GAGAAAATGA GACGTGGTTT ATCCACAACC 
ACAATAATAG GAATAGTAGT GGCAATAGTA ATAATCGTAA TAGGAGCCGT TGCTGCGGTA
ACTCTACTTA GTCATAAGCC GTCACAAGTA GTATCTACCA CTTCTCCGTC TACTTCGCAA
TCAGCAACGT CAACATCTCC ATCACAAGTT ATTACCATAA CCTATTTTGA CGATCTATCG
CCATCTGAAG CCAACATAAC ACAGAAAATA ATAATTCCAC AATTCGAGGC AACACATCCT
AATATTAAAA TTAATTATGT TGATGAAAGT GCTGACGATA TAGTAAAAAG TGTTGTAGAG
TTAGTAAAGA GTGGTAACGT TGGCCCAGTT ATCATTGGTG AAGATAATCT TGTTATAGGA
GAATTGCTCA ATGGTAACTA TTTAATGAAC CTAACTCCAT ATGTAAATCA AATCTTACAG
AATGTTAGTC TAATTCCATC GATGCAATCA TTAGTCCAGT ATGAACAAAA AGTATATCAT
GGTACTTACT TCATTCCTCT TAGAGCCAAC ATTCCTCTAG TCTGGTATAA CGCAAGTCTC
TTTAGGCAAT TAGGTTTGGC TTCTCCACCA GAAAACTGGT CACAATTGCT ACAGTACGCA
AAGATAATTT ATGATAAAAC TGGCGTAAAA CCCATAATGT TCCAAGGTCA TGGTGGAGCA
AGTACTTATA CTGAGCTTTA CCAGTGGATG GTCCAGGCTG GTGGAAATCC ATTCTTATTT
AATGATTCTG GCGATGTACT TGCATTTGAA TATTTGTACA ATCTCTCACA ATATTTTAAC
CTAGAGTATA TTCATGGATA TTGGGGAAGC TATAAAGGAT TAATTGATGG TAGTTACTAT
TTGATCGATT ATCAATGGCC CTATATCTAT AGTGTTATGG CCAGTGAAGG TGTTAATATG
AGCAATATAG GATTCTACCC TGGCCCTACC GGCCCTGTTA ACGGTGATCA CTTAGTAGGT
GGTGATGTAT TAGCAATACC TAAGGGAGCA ACACACATTG ATGCGCTAAT AGAATTCGCT
AAGTTCTTAC TATCTCCGCA AGTTCAAAGA GAATTTATCA TTTACCTCTC ATGGCCAGCA
GTAAATCAAC AAGCTTATCA GAATCTCCCA AGTAATATAA GTTCACTATA CAAAGCTGAA
GAAGAAGCTT TACAGAACGC ATTCTTCAGA GAACCAGTAC CTTGGATAAC TGTTTGGGGA
CAGATAGCAG ATAACGTGTT TAATCAAATA ATCGTAAATC ATGCACCATA TTCACAAATA
CCTCAAATAT TAAGTCAAGC AAATAAGGAA ATGTACCAGT ATCTCTTACA AAACTATAAT
GTAACGGTAG CACAAGAGTA TGAGGAAGGA GTGTTTGGAC CACTATACGG GTGA
 
Protein sequence
MSRSDKFSNK EKMRRGLSTT TIIGIVVAIV IIVIGAVAAV TLLSHKPSQV VSTTSPSTSQ 
SATSTSPSQV ITITYFDDLS PSEANITQKI IIPQFEATHP NIKINYVDES ADDIVKSVVE
LVKSGNVGPV IIGEDNLVIG ELLNGNYLMN LTPYVNQILQ NVSLIPSMQS LVQYEQKVYH
GTYFIPLRAN IPLVWYNASL FRQLGLASPP ENWSQLLQYA KIIYDKTGVK PIMFQGHGGA
STYTELYQWM VQAGGNPFLF NDSGDVLAFE YLYNLSQYFN LEYIHGYWGS YKGLIDGSYY
LIDYQWPYIY SVMASEGVNM SNIGFYPGPT GPVNGDHLVG GDVLAIPKGA THIDALIEFA
KFLLSPQVQR EFIIYLSWPA VNQQAYQNLP SNISSLYKAE EEALQNAFFR EPVPWITVWG
QIADNVFNQI IVNHAPYSQI PQILSQANKE MYQYLLQNYN VTVAQEYEEG VFGPLYG