Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_1976 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | + |
Start bp | 1762766 |
End bp | 1764199 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | ACX92187 |
Protein GI | 261602584 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCGCT CAGATAAATT CTCCAATAAA GAGAAAATGA GACGTGGTTT ATCCACAACC ACAATAATAG GAATAGTAGT GGCAATAGTA ATAATCGTAA TAGGAGCCGT TGCTGCGGTA ACTCTACTTA GTCATAAGCC GTCACAAGTA GTATCTACCA CTTCTCCGTC TACTTCGCAA TCAGCAACGT CAACATCTCC ATCACAAGTT ATTACCATAA CCTATTTTGA CGATCTATCG CCATCTGAAG CCAACATAAC ACAGAAAATA ATAATTCCAC AATTCGAGGC AACACATCCT AATATTAAAA TTAATTATGT TGATGAAAGT GCTGACGATA TAGTAAAAAG TGTTGTAGAG TTAGTAAAGA GTGGTAACGT TGGCCCAGTT ATCATTGGTG AAGATAATCT TGTTATAGGA GAATTGCTCA ATGGTAACTA TTTAATGAAC CTAACTCCAT ATGTAAATCA AATCTTACAG AATGTTAGTC TAATTCCATC GATGCAATCA TTAGTCCAGT ATGAACAAAA AGTATATCAT GGTACTTACT TCATTCCTCT TAGAGCCAAC ATTCCTCTAG TCTGGTATAA CGCAAGTCTC TTTAGGCAAT TAGGTTTGGC TTCTCCACCA GAAAACTGGT CACAATTGCT ACAGTACGCA AAGATAATTT ATGATAAAAC TGGCGTAAAA CCCATAATGT TCCAAGGTCA TGGTGGAGCA AGTACTTATA CTGAGCTTTA CCAGTGGATG GTCCAGGCTG GTGGAAATCC ATTCTTATTT AATGATTCTG GCGATGTACT TGCATTTGAA TATTTGTACA ATCTCTCACA ATATTTTAAC CTAGAGTATA TTCATGGATA TTGGGGAAGC TATAAAGGAT TAATTGATGG TAGTTACTAT TTGATCGATT ATCAATGGCC CTATATCTAT AGTGTTATGG CCAGTGAAGG TGTTAATATG AGCAATATAG GATTCTACCC TGGCCCTACC GGCCCTGTTA ACGGTGATCA CTTAGTAGGT GGTGATGTAT TAGCAATACC TAAGGGAGCA ACACACATTG ATGCGCTAAT AGAATTCGCT AAGTTCTTAC TATCTCCGCA AGTTCAAAGA GAATTTATCA TTTACCTCTC ATGGCCAGCA GTAAATCAAC AAGCTTATCA GAATCTCCCA AGTAATATAA GTTCACTATA CAAAGCTGAA GAAGAAGCTT TACAGAACGC ATTCTTCAGA GAACCAGTAC CTTGGATAAC TGTTTGGGGA CAGATAGCAG ATAACGTGTT TAATCAAATA ATCGTAAATC ATGCACCATA TTCACAAATA CCTCAAATAT TAAGTCAAGC AAATAAGGAA ATGTACCAGT ATCTCTTACA AAACTATAAT GTAACGGTAG CACAAGAGTA TGAGGAAGGA GTGTTTGGAC CACTATACGG GTGA
|
Protein sequence | MSRSDKFSNK EKMRRGLSTT TIIGIVVAIV IIVIGAVAAV TLLSHKPSQV VSTTSPSTSQ SATSTSPSQV ITITYFDDLS PSEANITQKI IIPQFEATHP NIKINYVDES ADDIVKSVVE LVKSGNVGPV IIGEDNLVIG ELLNGNYLMN LTPYVNQILQ NVSLIPSMQS LVQYEQKVYH GTYFIPLRAN IPLVWYNASL FRQLGLASPP ENWSQLLQYA KIIYDKTGVK PIMFQGHGGA STYTELYQWM VQAGGNPFLF NDSGDVLAFE YLYNLSQYFN LEYIHGYWGS YKGLIDGSYY LIDYQWPYIY SVMASEGVNM SNIGFYPGPT GPVNGDHLVG GDVLAIPKGA THIDALIEFA KFLLSPQVQR EFIIYLSWPA VNQQAYQNLP SNISSLYKAE EEALQNAFFR EPVPWITVWG QIADNVFNQI IVNHAPYSQI PQILSQANKE MYQYLLQNYN VTVAQEYEEG VFGPLYG
|
| |