Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_0753 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | - |
Start bp | 696223 |
End bp | 698280 |
Gene Length | 2058 bp |
Protein Length | 685 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | ACX91007 |
Protein GI | 261601404 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.371532 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTCTC GTAAGGCATT TAAGGGCTTA AGTAGAACCT TTATAGCAAT AATTGTTGTT GTAATTGTTG TAATTGGAAT AGCTGTTGGA GTATTATTAG GTAATCACCC ATCTAGTAAT AATATAAGCA CTACTACAAC TTCTTCTAGT ATTATCTCTA CAACTACCCT ATCATCAACA TCTTCCTCAA CATCTCCTCC TAATAGCGTT TCAATACCTT CTAGTATTAC CGTGGAAGAA GCTGCAACAC CAGTAAGTGT AGATCCGGCA AGTAGCTATG ATATAGCAGG TGGGGAAATT ATTCAGAATG TTTATCAAAC TTTAGTATTT TATAATGGGA CTAATACATC TTCCTTTATT GGAGTTTTGG CAGAAAATTG GACTGTAGAG AACAATGGTA CTACTTATAT CTTCCATCTA TGGCCCTTTA TTACTTTCAG TAATGGTAAT CCTCTTAACG CTACAGATGT ATGGTTTTCA ATATACAGGA CGATGCTTAT TAATTTAGGT ATATCAATTT ATACTAGTCA AGCTTTAGCT GTTAATAATG GTCTTGGTTT TGTAGGAAAA TTACCCAACG GGAAATACGG TACAATAATG CTACCTAATG GCATACTACA AGCATTAGAG TATGCTGGAT ATAATTTTTC GTCAAATAAA ACTATTGCGA TGGAACAAGC TGCATATGAT TTGGCATACA TTTTATCCCA TTTTAATGTA AGCAATACCA CAATTCAGAA GGTAATGTCA TATCCTCATC AGGCAGTAGT AGTTATCGAT CCATATACCG TAGAGTTTAA TTTAGACTAC CCATATTCAG CATTTCTAGC TGCGCTTTCC ACAAGTACTG GGGCTATAGT AAATCCAGTT TTTGTTGATG AGAATGGCGG AGTTCAAATT GATACTTCTA ATACTTATCT TTCCACTCAT GCTTTAGGCT CTGGACCCTA TATTCTGGAA ACTCCAATAG GAGGGTCTTA CGTAGTGCTA AATGCTAGTC CTAACTATTG GGCAAGTAAA GTCCCTACAA AAGATCTAAA TCCGATGTTG GAGACACCTA AGATTAAGAC GATAATTATA GATTATCAAA CTAACGAGGC AGTAAGAATA TTAGACTTAC AGCAAGAGAA GGCACAAATT TCGCAGATTG ACGTGATAAA CTTACAGGAG CTAATAGGTA GCTCTGGTGT GCAACAACTT CAGAATCTGG TTAATGGAAA GACGTTCCCA ACAACCTATA CCAGTGGTAA TGTGACAATT TATATTTGGG GGCCTTCAGC ACAAATAGAC TTCTTGGCAA TAGATGCATA TCAATACCCA TTTAACATAA CAGCAGTAAG ATTGGCCATA TCACATGCAA TAAACCCTGT ACAAATTCAA CAACAAGTCT ATAAGGGTTT TGCAATAAAC TATGTTGGAC CTTTAGATCC ATCATTACCA TACTATAACT CATCAATAAT AGGTTATACG TATAATCCCT CCCTTTCAAT ACAACTACTA GAAGAAGCAG GATTCAAATT AACACTACCT AATGGAACTA CAGTAAATCC AAATGGAAAA CCTTTCCCAA CAATTACCTT AACATATCAA ACTGGTAGTA CAGCTCTACA AGATGAGGCA TTACTGGTTC AACAACAGCT AGCTCAAATA GGAATAACAG TTCAGCTAAA TCCTGAATCC GCGGTAACAA TAGTAGAATC GTATCTAAAT CCACCCAATT CATCAGCATA TCCTGCCTTC CAATTAGCCG GTAACTTCCC TCCAGTGCTC AGCCCCATAG ACCCAGCAAT ATACTTACTG TCTCAAGCTA GATTACACCA CGGAAATCCA GCTTTCGTAG ATAATAGTAC GATTAATCAG TTAATCGTAG AGGCTGTAAG AACCAATAAC CCCCAGCAAT TACAGCATAT ATATAATGAA ATAACTTTAC TAACCTTAGC ACAAGCACAG TATGTATGGT TAGATGACTT TTTAGCCTAT ACGGTAGCAT CGTCAAGTAT TCATGGATTC TGGTATAGCC CCGGATTAGA TGGGTTATTC TATGCTGACT TATACTGA
|
Protein sequence | MKSRKAFKGL SRTFIAIIVV VIVVIGIAVG VLLGNHPSSN NISTTTTSSS IISTTTLSST SSSTSPPNSV SIPSSITVEE AATPVSVDPA SSYDIAGGEI IQNVYQTLVF YNGTNTSSFI GVLAENWTVE NNGTTYIFHL WPFITFSNGN PLNATDVWFS IYRTMLINLG ISIYTSQALA VNNGLGFVGK LPNGKYGTIM LPNGILQALE YAGYNFSSNK TIAMEQAAYD LAYILSHFNV SNTTIQKVMS YPHQAVVVID PYTVEFNLDY PYSAFLAALS TSTGAIVNPV FVDENGGVQI DTSNTYLSTH ALGSGPYILE TPIGGSYVVL NASPNYWASK VPTKDLNPML ETPKIKTIII DYQTNEAVRI LDLQQEKAQI SQIDVINLQE LIGSSGVQQL QNLVNGKTFP TTYTSGNVTI YIWGPSAQID FLAIDAYQYP FNITAVRLAI SHAINPVQIQ QQVYKGFAIN YVGPLDPSLP YYNSSIIGYT YNPSLSIQLL EEAGFKLTLP NGTTVNPNGK PFPTITLTYQ TGSTALQDEA LLVQQQLAQI GITVQLNPES AVTIVESYLN PPNSSAYPAF QLAGNFPPVL SPIDPAIYLL SQARLHHGNP AFVDNSTINQ LIVEAVRTNN PQQLQHIYNE ITLLTLAQAQ YVWLDDFLAY TVASSSIHGF WYSPGLDGLF YADLY
|
| |