Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_0654 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | - |
Start bp | 604035 |
End bp | 605699 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | ACX90924 |
Protein GI | 261601321 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGGA AATATAGGTA TAGTTTAGCT AAGGGATTAA CATCTACCCA AATAGCAGTA ATAGTAGCAG TAATCGTAAT AGTGATAATA ATAGGAGTTA TAGCCGGCTT CGTTTTAACT AAGGGGCCCT CCACAACCCC CGTAACTACT ACAGTAACTA GCACATTTAC TACAACTACA ACAATACCCA CTACAACTAC GTCAACCCCT AGCAATACAG TGGTCTTCTA CACATGGTGG GGTGGAGGTG ATGGAGGACA AGCACTAAGC CAGATAATCC CTGCAGTTAA GCAATACACG GGCTTACAAA TGCAAACATA TTCTATTCCA GGGGCTGGAG GTACAAATGC AAAATATGCT ATTTTAGCCC TTATACAAGC TGGTAAACCT CCAGCAGCAT TTCAAGTACA TTACGGACCA GAAATGATAA GTTATGTTGA GGCTGCACCC AACGGTATAC ATACTTTTGT CAATATGACT CCTTATTTAG CCCAATGGGG ACTACTTAAT AACGCGGTTT ATGCAGTATT ACAAGCTGGA GCCTATAATG GCACATTACT ATCCGTCCCA ATTAACGTAC ATAGGGGAGC AGTTCTTTAT GTGAATACTC AATTATTAAG GGAATATAAT TTACCATTCC CCTATAACTT TAGTACTCTT GTATATGATA CTGTACAATT AGCTAATCAT GGTGTGAGTC CATGGATTAT ACCCGGTGGA GACGGTGGAT GGGATCAATT TAACGTATGG GAGGATATAT TCTTATATTT AGCTGGGCCA CAAATGTACA ATGAACTAAT ATATGGTACA TTGAACTTCA ATAATCCAAT GGTTCAGAAG ATAATAAATG AAACCAACTA CTTGTTCTTG AACTTCACAA GCTATAACTA TCCCGGTTGG CAATCTATGT CATGGGAACA AGGATTTGCA CTACTAGCTC AAGGTAAAGT CGCATTTCAA GCTAATGGGA ACTGGGTAAC TAATTACGCA AGTTATATAA ATATTTCAGT TTATCCTCCG TTGCCTCAAT ACATAAACAA TTCAAGTGTT TCTGTAGTAG AGACTCCATT CCCAGGCACA CAGCATTACT ATGCATTAGT GATAGATACA ATTGGTATAC CAGTAGGTCC TCAAGAACAA CAAGCTTTAC AACTAGCCCA TTTCTGGTCT TCATATCAAG GGCAGGAAGT CTGGACAAAA TACAAGGCAG TAACCTATTA TAAGAATGGT ACGGATTGGT ATGCTCAACC AGCACAATGG TATGATTATC AACAATTAAT AAACACTTCA GAGCAAAACT TCGTTTATCA GTTATCAGAT GGTGGAGTGT TTGATGACGT TTTCGCCCAG ATAGATTCAG GGTTACTAAC TTTACAGCAA GTTGGTAAGA TTGGATTATC TGCTTGGAAC TCTACATTAG TATCTTCAAT GCAACAAGAA CAAAGTGAAT GGTTAGCGGC AGCTAAACTA GGATTAGGAT ACTTAGGATT CCCTGGTCAT CCTTTTGCTG GGTACTACCC ACCATGGGTT ACAAATCCAT CAGCATATGG ATTAAACACC AATACGCGTC AGACAAGTAA TAGCACAATA CTCTTCTTAC TCCCATTCTT AGCACTATCC CCTGCAATAG CCAACATTGA CAAGAAATAC TATCTCTTAA AGTAA
|
Protein sequence | MKRKYRYSLA KGLTSTQIAV IVAVIVIVII IGVIAGFVLT KGPSTTPVTT TVTSTFTTTT TIPTTTTSTP SNTVVFYTWW GGGDGGQALS QIIPAVKQYT GLQMQTYSIP GAGGTNAKYA ILALIQAGKP PAAFQVHYGP EMISYVEAAP NGIHTFVNMT PYLAQWGLLN NAVYAVLQAG AYNGTLLSVP INVHRGAVLY VNTQLLREYN LPFPYNFSTL VYDTVQLANH GVSPWIIPGG DGGWDQFNVW EDIFLYLAGP QMYNELIYGT LNFNNPMVQK IINETNYLFL NFTSYNYPGW QSMSWEQGFA LLAQGKVAFQ ANGNWVTNYA SYINISVYPP LPQYINNSSV SVVETPFPGT QHYYALVIDT IGIPVGPQEQ QALQLAHFWS SYQGQEVWTK YKAVTYYKNG TDWYAQPAQW YDYQQLINTS EQNFVYQLSD GGVFDDVFAQ IDSGLLTLQQ VGKIGLSAWN STLVSSMQQE QSEWLAAAKL GLGYLGFPGH PFAGYYPPWV TNPSAYGLNT NTRQTSNSTI LFLLPFLALS PAIANIDKKY YLLK
|
| |