Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_0554 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | + |
Start bp | 500463 |
End bp | 502199 |
Gene Length | 1737 bp |
Protein Length | 578 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | |
Product | glycoside hydrolase 15-related protein |
Protein accession | ACX90830 |
Protein GI | 261601227 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000579959 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACAC TTGGATTCAT CTCAAATCAG ATAACATCAG CCCTAATTGA TTTATCTTCA ATTGTTTGGT TTCCAGTTCC TAAGTTCGAC TCCCCGTCAG TGTTCACTAG GTTGTTAGAT GAAGATGGAG GAGAGTTTTC TATATTACCA GAAGGACAAG AAATAATAGC TGTTAAACAA GAATATGTTT ATCCATTAGT ATTAATGACT GCCATACGTA CTAAACAAGG CGAAATAAGT ATTACTGATC TCATACCATT GGGTGAAACA GTAATTATAA GAAAGGTTGA ATCGGAAATT CCTTTTAAGG TCGTATTTAA ACCTAGATTC TATTATTCTC TATACAAGCC CATAATCGAT GGTAATAGAT TTGTAAATCC AAGAGGAAGG GATTGTATGG CGTTTCTTTA CGACTTTTCG GGCGAGGTCA AAAGATCTGG AAATTATGTT TGGAATTTTA GTAATGGGAA AGGATATTTA ATAGCCAACT ATGCCTCTGA CGTTAAACAT GGGGTTTTCA GTGAAAGGGG TTCAACGTTA AATGCCATAT ATGAAAGATC GTTTGAAAAT ACGATAAACT ATTGGAAAAG TATTGATGTG AAAGACGCTA AATCATTTAA TGACCTTTAT AAGGCATCCA TATATACAAT GCTAGGCTCT ATTTATGCGC CTTCTGGAGG AGTAATTGCA GCTCCTACAA CTTCTTTACC AGAAGTTGAA GGTGGAAAGA GAAATTGGGA TTATAGATTT GCATGGGTAA GAGATTCTTC GATCATAGCT GAAGCCTTGT TAGAAGCTGG ATCCATTGTA GAAGCTAGAA GAATAATAAA CTTCTTACTT TCGCTCATAA ATTTCTCGTC AAAGCCATTT TACTATCCCC TATATACGAT AGAGGGCACA ATTCCTCCCC CTGAGAGGGA ATTACGATGG CTATCTGGAT ACAAGAACTC TAAACCAGTA AGAATAGGAA ACGGAGCTTC TTCTCAGATT CAATTAGATA TTGAAGGATT TTTCATTTCG GCTCTTTATA AATATGTAAA GATGACTAAT GATCAAGTGT TTCTGAAAGA CGTTTTTAGT AAAGTGAAGT ACATTGGGGA TTGGATATCA GAGAATTGGA GCTTAAAAGA TTCTGGTATT TGGGAGGATA GGGGGAGTCC TCAACACTAT ACTCACTCTA AAATTATGAT GTGGATAGCA CTAGATAAAA TAGGGAAACT AGCAAACTTA ATCGGATATG CGGACATTTG GGCTAAAGAG AGGGAAAAGC TTAGAAACTG GATATTCACT AACTGTGTAA AGAACAATTA TTTTATCAGA TATTGTGGGA ATACTGATGA CGTAGATTCA TCATTATTAT CAGCACCATT GTATGGGTTC ATTGAAGTTA GTGATAGTAC ATTTATTAAT ACACTAACGA AAATCGAAAA CGATCTAAAA ACCGACGTAT TTGTGAAAAG ATACAAAACT GATTTCATGG GAGAAGCTAA ACACCCATTT TTGTTGACTA CAGTGTGGCT TGCTAGAGTT TATATGAGAT TAGGAAAAAT AGATAGTGCT ATAGAAATCT TGAATAAGAT CAATAAGGTT TCAAGAGAAC TACATTTAGT AGGTGAACAC GTTGATGTGG AAAAAGGGGA GTTTACGGGT AACTTTCCTC AGATTTTTGT TCATGCGCAA TTGGTAATTG CAATAAAAGA GCTTAACGAC ACGTTAACTG ATAAAAATAT TATATAG
|
Protein sequence | MKTLGFISNQ ITSALIDLSS IVWFPVPKFD SPSVFTRLLD EDGGEFSILP EGQEIIAVKQ EYVYPLVLMT AIRTKQGEIS ITDLIPLGET VIIRKVESEI PFKVVFKPRF YYSLYKPIID GNRFVNPRGR DCMAFLYDFS GEVKRSGNYV WNFSNGKGYL IANYASDVKH GVFSERGSTL NAIYERSFEN TINYWKSIDV KDAKSFNDLY KASIYTMLGS IYAPSGGVIA APTTSLPEVE GGKRNWDYRF AWVRDSSIIA EALLEAGSIV EARRIINFLL SLINFSSKPF YYPLYTIEGT IPPPERELRW LSGYKNSKPV RIGNGASSQI QLDIEGFFIS ALYKYVKMTN DQVFLKDVFS KVKYIGDWIS ENWSLKDSGI WEDRGSPQHY THSKIMMWIA LDKIGKLANL IGYADIWAKE REKLRNWIFT NCVKNNYFIR YCGNTDDVDS SLLSAPLYGF IEVSDSTFIN TLTKIENDLK TDVFVKRYKT DFMGEAKHPF LLTTVWLARV YMRLGKIDSA IEILNKINKV SRELHLVGEH VDVEKGEFTG NFPQIFVHAQ LVIAIKELND TLTDKNII
|
| |