Gene Ssol_0525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0525 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp468379 
End bp469815 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content36% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionACX90804 
Protein GI261601201 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.180437 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCCC TTTCAACTCT TGCAATGGCT GTAATAATAA TAGTTGTAAT AGCTGTAGTG 
GCTGCTGCAG CATATCTGAT TACCTCAAGT AGTCATCATC CCTCTATTTC AACGACAACT
ACACCGATTA TAGCTACCAA CACAACCGCA CCAATTACAT TAACAGTAGT CACATTTAGT
GGGCAGTCTG CAAACTTTAT CCAATACGCT GGTAACTTAT TCCATCAACT ACACCCAAAT
GTTCAAGTTG AAGTTATCCA ATATCCATTT AGCGAGTACA TTCAGAAAGA ACTTACGGCA
CTTGAAGCTC ATTCCTCCCA ATATGATATT ATAGGCTTCA CTTCAACTTC TGCTCTAGAT
GTCGCATCAT ATTTGTTGCC TATAAACGAG TCAGTTTTCA ATTTCTCCGA CATAATATAC
CCTCAAGAAG ATTTTGGAGG ATTGTATTAT AACGTATCCA CTAATAAAAC TGAAGTGATA
GGTATCGCAT ATGAAACTGC AGTTTACTTA ATGGCATATA ATGCTACAAT ATTTAATAAT
CAAACCTTGG CACAAGAATT TGAACAAGAG TATCATATGA ATTTTTCACC AATTACATAT
AAGAACTGGA GTGTAGTTTT AGATGTTGAT CAATTCCTAA CTTCACACCA TATCACAAAA
TACGGTTTCC TAATAGACGA TCATGTCGCA CACGGAATTA TTGACGCATT TCCTGCAGTA
TTTGGCTGGT ATTATTTTAG AAATAATTCA TTAAATATGG GTAATCCAGC AGGTTTACCT
AACTATAACA TAATGTTTGA GGGTAGAATA TTACCAGGTT TTAATTATCC TCTACCATCG
TTTAATTCTT CCTCTGGCGT GCAAGCTCTA ATTACCTATA GGGAATTAGT AAGTTATGAG
CCCAGTCCTT CACAGATTCA AATATCGTAT GATAACCTAC CAGCATTCTT CTCTCAAGGA
GCTGGCGCAT TTCTATTCAC ATCTCAATTA AGTTATATAA ATAACTCTAA AGATGTACTA
CTCGCACCAT TACCTGGGGG ATATGCGGAA ACCGGAACTG ACTTTTTAGG AATTAGTAAG
TACTCATCAC ATCCTCAATT AGCTCTAGAA TTCTTGCAAT TTTTAGTATC CCCTAAGGTG
CAAGAGATTG CATTCCTAAA ATATGGTAAA TTCCCGATCT CTAAACAAGC GTTTCTTTCA
CTAATAAGCA ACTCGTCACT TCCTTCTTAT AAAAGGGAAT GGCTGCAAGA GACTTATTAC
GCAGCGTTAA ATGCCACAGC AAATCCACCA AATATTCCAC AAACATATCC TGCATTAATT
CCAAGCTTTA ATAATGAGGC ATTTCAGTTC TTAACTTCAC CTCAATATAA TGAGACATAT
GCTATGAACG TATTACAACA AGCTGCAAAT GCATGGATTA AGGCACTTTC TTCATAG
 
Protein sequence
MKALSTLAMA VIIIVVIAVV AAAAYLITSS SHHPSISTTT TPIIATNTTA PITLTVVTFS 
GQSANFIQYA GNLFHQLHPN VQVEVIQYPF SEYIQKELTA LEAHSSQYDI IGFTSTSALD
VASYLLPINE SVFNFSDIIY PQEDFGGLYY NVSTNKTEVI GIAYETAVYL MAYNATIFNN
QTLAQEFEQE YHMNFSPITY KNWSVVLDVD QFLTSHHITK YGFLIDDHVA HGIIDAFPAV
FGWYYFRNNS LNMGNPAGLP NYNIMFEGRI LPGFNYPLPS FNSSSGVQAL ITYRELVSYE
PSPSQIQISY DNLPAFFSQG AGAFLFTSQL SYINNSKDVL LAPLPGGYAE TGTDFLGISK
YSSHPQLALE FLQFLVSPKV QEIAFLKYGK FPISKQAFLS LISNSSLPSY KREWLQETYY
AALNATANPP NIPQTYPALI PSFNNEAFQF LTSPQYNETY AMNVLQQAAN AWIKALSS