Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_0482 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | + |
Start bp | 428412 |
End bp | 430493 |
Gene Length | 2082 bp |
Protein Length | 693 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | ACX90763 |
Protein GI | 261601160 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.181954 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGAAGG AATTAGTATT AGAAGTAGGT GTAATATTCT CGATATCTGT AATGCTCTTT TCAATCAGTG GTATAATGAT AGCCAACTCA GCATCTTCAC CATTTCCGTC AACCCTTTAC TTAGGATGGT ATAATTCTAA CGTCGAAGCT TATTCTTCAT TTGCTACTTA TAATCCCAAT ATATTTGCAG GAGGTTTAGG TGGATCTTTT TACGGATTGC TCTATGCATA CTCTGCAATA ATTAATGTTA GTAATTCTCA AGCGATTCCC GTGATAATAA CTTCATGGAA CTTCTCTCCA GCAAGCTGGC AGCAGATTTG GCAGACAAAA CCGGTTAATG TGACGCTGAT TATACGCAAT GATTCTGGTT GGAGTAATGG GCAACCTCTC ACTGCTTACG ATCTTATGGC CACATGCCTA ATTTTAGATA TGTTTGGTGC TCCGCCATTT CCTAATTATA CTGTTATTAA TAATTACACG CTAGTTGAGA CTTGGCCTCC AGATACGTTA TCACCGATAT TGTTTACAGA CACGCTAATT AATACGGTAG GCTTAGGGGA AGTTGCAATT ATAATACCAT ACCAAGTTTG GAAGCCTATA ATTAGTCAGA TAATGGGCAA TTGGACAACA ATACAAAATA CAAGCAATAT GAAACTTGCG CAGCAGATTA TTAAGAATAT TAGGTCAGAA ATCCTTCACT TTAGACCAAA CCCTGCCACA TATCCTTATA GTGGTCCGTT CTACCTATCA CAGTTATCAT CGAATGAAAT AGTACTAAAT AAGAATCCAT ATTATTATGA CGCTAAATAT ATACCATTCA ATCAAGTAAT AATATATCAA TACGGTCAGG CTGACTTAGC AGCAGCTGCA ATAACGGGTG GTCAAGTTTC TATGGACTGG TCCGGTTTGA CAGGATTATC ACCAACGCAG CTAGAAAGTT TGCCGACCAC ACTTGAGGTA ATTAATATAC CTCAACCATT TGGCTGGGGT ATAGCATTCA ATCTGAAGAA TCCATGGTTA AGAGAGTATC AAGTAAGGGC GGCAATTGCA TATATTCTAA ATAGGACTGC CATAGCTTCA GTGGGAGGTC CTTTAACAGC ACCGGTTACT ATTCCTAATG CAATACCCAA TTTATCGTAC TATTCATTCA TGACATCTTC GCAATACTAC TCATCTTTAA ATCCGTATAA TGTTAATTTA ACAAAGGCAG CACAGCTGCT TAAGAGTGTA GGGTTTTACC AGAAGTCTGG AGTTTGGTAT ACTCCAAATG GTACACCGTT TACTTTAACA ATAGGTGCTG GAAGTCCACC AACGCAAGCA CTAGCAATGA TGGAAGAAGT ACAAAAAGAA TTACAGCAAT TTGGAATTAA CGTACAATTA CACATATATA CTGTAGTTAG TCAGTGGCAT CAAGCATGGC AAAATGGTAC CGGCTATGAT TTATGGTTTG AGAATTGGGG TAGTTCATAC TCGCCGGGGA CTGCACCATG GAGTTTGGTA CTTTCTTATT TTGGAGGATA TCCATGGAAT GTTACACAGT GGAATGAGAA TCTTACTTTA CCTAATGGTA CTATAATTGA CTTCCATAAA TTACTTGAGG AAACCGAGTC TCCTAATACC ACACAACAGT TAATACAAGC TAATCAAGAG TTATCATATT ATATGAACCA ATACTTACTT CCAATCTTAC CTCTAGTTGA GATCGAGAAT TACGTCATTG TAAATCCCTC GCTACTTATT GCAGCTCCAC CTGCTAATTC GTGGATATGG GAAGAGGCAC AATACGGTAT AGGTGGTACT GCTATGGTAC AAGCTCTAAT AGATTATTGG TACGCCCCAT TATATGAGAG CACTATAATT ACTACAACTA CGACTTCTAT TTCTACTACT ATAACTACAA CAACCACTTC CGCAACAACA GCTACCACGT CTGTAACTAC GACTACGTCT GTAACTACGA CTTCTATTTC TACTACTACA GTTACTGTGA CTTCTACTTC AACTATTCCA ATTATAATAG CAATAGTTAT AGTTATTATT GTTATAATTG CTGCTGTGGC AATATTAATG AGAAGGAGAT AA
|
Protein sequence | MRKELVLEVG VIFSISVMLF SISGIMIANS ASSPFPSTLY LGWYNSNVEA YSSFATYNPN IFAGGLGGSF YGLLYAYSAI INVSNSQAIP VIITSWNFSP ASWQQIWQTK PVNVTLIIRN DSGWSNGQPL TAYDLMATCL ILDMFGAPPF PNYTVINNYT LVETWPPDTL SPILFTDTLI NTVGLGEVAI IIPYQVWKPI ISQIMGNWTT IQNTSNMKLA QQIIKNIRSE ILHFRPNPAT YPYSGPFYLS QLSSNEIVLN KNPYYYDAKY IPFNQVIIYQ YGQADLAAAA ITGGQVSMDW SGLTGLSPTQ LESLPTTLEV INIPQPFGWG IAFNLKNPWL REYQVRAAIA YILNRTAIAS VGGPLTAPVT IPNAIPNLSY YSFMTSSQYY SSLNPYNVNL TKAAQLLKSV GFYQKSGVWY TPNGTPFTLT IGAGSPPTQA LAMMEEVQKE LQQFGINVQL HIYTVVSQWH QAWQNGTGYD LWFENWGSSY SPGTAPWSLV LSYFGGYPWN VTQWNENLTL PNGTIIDFHK LLEETESPNT TQQLIQANQE LSYYMNQYLL PILPLVEIEN YVIVNPSLLI AAPPANSWIW EEAQYGIGGT AMVQALIDYW YAPLYESTII TTTTTSISTT ITTTTTSATT ATTSVTTTTS VTTTSISTTT VTVTSTSTIP IIIAIVIVII VIIAAVAILM RRR
|
| |