Gene Ssol_0654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0654 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp604035 
End bp605699 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content39% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionACX90924 
Protein GI261601321 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGGA AATATAGGTA TAGTTTAGCT AAGGGATTAA CATCTACCCA AATAGCAGTA 
ATAGTAGCAG TAATCGTAAT AGTGATAATA ATAGGAGTTA TAGCCGGCTT CGTTTTAACT
AAGGGGCCCT CCACAACCCC CGTAACTACT ACAGTAACTA GCACATTTAC TACAACTACA
ACAATACCCA CTACAACTAC GTCAACCCCT AGCAATACAG TGGTCTTCTA CACATGGTGG
GGTGGAGGTG ATGGAGGACA AGCACTAAGC CAGATAATCC CTGCAGTTAA GCAATACACG
GGCTTACAAA TGCAAACATA TTCTATTCCA GGGGCTGGAG GTACAAATGC AAAATATGCT
ATTTTAGCCC TTATACAAGC TGGTAAACCT CCAGCAGCAT TTCAAGTACA TTACGGACCA
GAAATGATAA GTTATGTTGA GGCTGCACCC AACGGTATAC ATACTTTTGT CAATATGACT
CCTTATTTAG CCCAATGGGG ACTACTTAAT AACGCGGTTT ATGCAGTATT ACAAGCTGGA
GCCTATAATG GCACATTACT ATCCGTCCCA ATTAACGTAC ATAGGGGAGC AGTTCTTTAT
GTGAATACTC AATTATTAAG GGAATATAAT TTACCATTCC CCTATAACTT TAGTACTCTT
GTATATGATA CTGTACAATT AGCTAATCAT GGTGTGAGTC CATGGATTAT ACCCGGTGGA
GACGGTGGAT GGGATCAATT TAACGTATGG GAGGATATAT TCTTATATTT AGCTGGGCCA
CAAATGTACA ATGAACTAAT ATATGGTACA TTGAACTTCA ATAATCCAAT GGTTCAGAAG
ATAATAAATG AAACCAACTA CTTGTTCTTG AACTTCACAA GCTATAACTA TCCCGGTTGG
CAATCTATGT CATGGGAACA AGGATTTGCA CTACTAGCTC AAGGTAAAGT CGCATTTCAA
GCTAATGGGA ACTGGGTAAC TAATTACGCA AGTTATATAA ATATTTCAGT TTATCCTCCG
TTGCCTCAAT ACATAAACAA TTCAAGTGTT TCTGTAGTAG AGACTCCATT CCCAGGCACA
CAGCATTACT ATGCATTAGT GATAGATACA ATTGGTATAC CAGTAGGTCC TCAAGAACAA
CAAGCTTTAC AACTAGCCCA TTTCTGGTCT TCATATCAAG GGCAGGAAGT CTGGACAAAA
TACAAGGCAG TAACCTATTA TAAGAATGGT ACGGATTGGT ATGCTCAACC AGCACAATGG
TATGATTATC AACAATTAAT AAACACTTCA GAGCAAAACT TCGTTTATCA GTTATCAGAT
GGTGGAGTGT TTGATGACGT TTTCGCCCAG ATAGATTCAG GGTTACTAAC TTTACAGCAA
GTTGGTAAGA TTGGATTATC TGCTTGGAAC TCTACATTAG TATCTTCAAT GCAACAAGAA
CAAAGTGAAT GGTTAGCGGC AGCTAAACTA GGATTAGGAT ACTTAGGATT CCCTGGTCAT
CCTTTTGCTG GGTACTACCC ACCATGGGTT ACAAATCCAT CAGCATATGG ATTAAACACC
AATACGCGTC AGACAAGTAA TAGCACAATA CTCTTCTTAC TCCCATTCTT AGCACTATCC
CCTGCAATAG CCAACATTGA CAAGAAATAC TATCTCTTAA AGTAA
 
Protein sequence
MKRKYRYSLA KGLTSTQIAV IVAVIVIVII IGVIAGFVLT KGPSTTPVTT TVTSTFTTTT 
TIPTTTTSTP SNTVVFYTWW GGGDGGQALS QIIPAVKQYT GLQMQTYSIP GAGGTNAKYA
ILALIQAGKP PAAFQVHYGP EMISYVEAAP NGIHTFVNMT PYLAQWGLLN NAVYAVLQAG
AYNGTLLSVP INVHRGAVLY VNTQLLREYN LPFPYNFSTL VYDTVQLANH GVSPWIIPGG
DGGWDQFNVW EDIFLYLAGP QMYNELIYGT LNFNNPMVQK IINETNYLFL NFTSYNYPGW
QSMSWEQGFA LLAQGKVAFQ ANGNWVTNYA SYINISVYPP LPQYINNSSV SVVETPFPGT
QHYYALVIDT IGIPVGPQEQ QALQLAHFWS SYQGQEVWTK YKAVTYYKNG TDWYAQPAQW
YDYQQLINTS EQNFVYQLSD GGVFDDVFAQ IDSGLLTLQQ VGKIGLSAWN STLVSSMQQE
QSEWLAAAKL GLGYLGFPGH PFAGYYPPWV TNPSAYGLNT NTRQTSNSTI LFLLPFLALS
PAIANIDKKY YLLK