Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_2138 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | + |
Start bp | 1921295 |
End bp | 1924003 |
Gene Length | 2709 bp |
Protein Length | 902 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | |
Product | glycoside hydrolase family 57 |
Protein accession | ACX92341 |
Protein GI | 261602738 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.205932 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAAAA TTGCAATTTT AGCCATGGGT AATTTGCCTA AAACAGCGAA GGCATTTTTA ACTCTATTTT TTCTTCTAAG TTTAATTTCT TGTTCATTCC TCATTCCAAC ATCACAGTCA ATAAGCGTAA ACTTTACTGT GTCAAGCAAT GGTATGGTAT CCATTTATAT TGGAGGTATA CCATGGAATA ACACCGTTAT TCTACACTAC GGAATAGAGA ACGGACCACA ACAAGCGTGG ACTAACATCT CAAACGTTGT AATGAAATGG AATGGGCAGA ATTTCAGCGC AACCATAGGG CCTTTCGCTA ATGGAACCTG GATAGGTTGG GTGTTTTACG ATAATACTAC TGGACAATGG ATAAATTACG ATAATCATCC ATTCTGGAAT TGGAATTTAG AGGTTAATCC TCCAAATGTA GGGCAAACTT ACGCTACAGT GCTTCAAAAC GGTTCAATAT TAATAACTGC AATAGGAAGA GCACCAGACC AGCTCGATAT ACATTATGGG CTAACTACTG GACCCCAAAC GGGATTACCA TGGGGTAATA TAACTGATGA ATTGATGACT TACAATCCAT TATGGGGTAA TTATACGATA ATCATAGGTC CTTTTAAGCC AGGCCAATGG GTACAATGGG TCTATCACGA TCTGACTCTA AATCAATGGT ATCACAACAC ATCTGGTCAA AACTTCGCAA TACAAGATGT TTATTCATTC ATTCAATACA TCAATGCTTC ATATAATAGA TATGTTTACG TTGAGGGTCA GCCAGTAAGT GTTATAATAT ATCTACAGAA CACTATTTCG CAAGCGATTA ATTCCTCAAT TTCGATTCAG ATCGCTGGCA AATTATACAA TATCTCCTCA ACGTTAAAAC CGGGATATAA TCTATTATCC CTAACACTAG ATACCTCCCA AATACCACAA GGTATATACT ATCCACAATT AAGCATTTAC GTGAATAATA CTCTTCAAAG ACAAGCAACT CTACCACAAT TATATGTCCT AAACACTACT GGAAAGAAGC CCTTAAGTTT AGTAATAGTC TGGAATATGC ACCAGCCTCT TTATGTAGCT CCAAATGGTA GTTGGGAGCA ACCTTGGGTA TGGTTACACA CTGGGCAAGA CTTCTACTGG GATGGAAGTT TAGTCGGAGC CTATGAACTT CAAGCGTTAT TGATAAGACA GTTTAACGTT AGCGTAACGA TAGATTTCAC ACCTGTTCTA TTATATCAAT GGGAGACAAT ACTGCACGAG AAGAATTATA GCTTTACGTC TAATTTTGGA ATTATTCCCA ATCATGATAT ATCTGCTGTA AATTATACCA TTAATCTCTA TAGGCAATTA ATTAATGAGG GTAAGGTAGA TGTCCTTACA GTGCCATTTT ATCATCCTTT ACAGCCATTG CTACTTCAAG ATGGTTATTG GAGCGACGTC TTAGCTCAAA TCAGAATGGG TGAGAATTTC ACTCATGAAG TTTTTGGAGT GTGGGCTAAT GGAACGTGGA CTCCGGAAAT GGCATTTGAC ATGGATTTAG TAGGACTTTA TAATGAGAGT AATATATCCT ATACTATACT TGACCAACAA GCCTTCTTAC CTTACGTTAC ATTGGTTAGA GGTTCGTTGA ACCCAGATCA GCCCTTTATC GTAGAGAATA ATCTAGGTCA AACTATTATT GTACTATTTA GAAACACAAC TTTATCTAAC GAATTTGGTT TTAAGTTCTT TAGCCAATCT CCTCAGCTTA CTGCACAAGA ACTTATACAA CAATTGGCTG AAATATATAT GAACAATCCA GGTGGTGTTG TTACAGTTGC TTTAGATGGG GAGAATCCTC TTATTTTCAA TCCAAATACT GGCCCAGCTG ATCTATACGC AATCTATCAA GCTTTATCCG AATATCAAGG GCAATGGCTA ATAACTCAAA CTGCAAGTGA GGCAATAGCT ACTCACAAGC CTTACAGTGT AATAACCAAT TTACTTGTGA ATTCATGGGA TCTTAATCTA AACTATTGGA ATAATGGATA CCTTGGAAAG ACTGAAATAT GGCAAAACGT TTCGCTAGCT AGGGAATATC TAGTAGCTTA CACTGCAGCT GTTGGAGCCA ACATTTCACC ATTAGTCTAT CTACCTTTGA ATGAGACACC TAATTCAACT AATTTATTTG ATACATTATG GAATTACCTA TATGTTGCAG AAGGTAGTGA TTGGACTTGG CAGACTGGTC CTCCAGCCTA TGGTCCATTA TGGTTTAAAG AACAAGCACT GCTCTACACT TCTACGATAA TTTCAGAGGT TAAACAGCAG TTTGATCTAA TAAAACTACA AAGTGTAAAG TTAGATGGAA ATAACTTAAA GCTCGGCATA TATAACGGAA TCAACACTAC GGTACATTTA TTACTGGTAA TTACCAACGG TAAGCAAGAA ATACAATTAC CTATAATATT AAGTCAAGGA CAAAACAACT TTAAGATAAA AATATCTAAT GTGTCCGGTC AACTTCAAAT AGCACTTTAT TCTCCTATTT CTGCTTCACA AGTAGGCTTA ACATTAATAC CCATTAATAG TTATGGATTC CTAGTAGCCC AATATCAAGT TAATCTAAAC AAATCAACAA GTTCCATGGG AACTTACCTG TTGGCGTTAG TTGGTATACT AGTAGTGTCA GCAATAATAG TTATAGTAAT GAAGAGGGGG CACATTTAG
|
Protein sequence | MIKIAILAMG NLPKTAKAFL TLFFLLSLIS CSFLIPTSQS ISVNFTVSSN GMVSIYIGGI PWNNTVILHY GIENGPQQAW TNISNVVMKW NGQNFSATIG PFANGTWIGW VFYDNTTGQW INYDNHPFWN WNLEVNPPNV GQTYATVLQN GSILITAIGR APDQLDIHYG LTTGPQTGLP WGNITDELMT YNPLWGNYTI IIGPFKPGQW VQWVYHDLTL NQWYHNTSGQ NFAIQDVYSF IQYINASYNR YVYVEGQPVS VIIYLQNTIS QAINSSISIQ IAGKLYNISS TLKPGYNLLS LTLDTSQIPQ GIYYPQLSIY VNNTLQRQAT LPQLYVLNTT GKKPLSLVIV WNMHQPLYVA PNGSWEQPWV WLHTGQDFYW DGSLVGAYEL QALLIRQFNV SVTIDFTPVL LYQWETILHE KNYSFTSNFG IIPNHDISAV NYTINLYRQL INEGKVDVLT VPFYHPLQPL LLQDGYWSDV LAQIRMGENF THEVFGVWAN GTWTPEMAFD MDLVGLYNES NISYTILDQQ AFLPYVTLVR GSLNPDQPFI VENNLGQTII VLFRNTTLSN EFGFKFFSQS PQLTAQELIQ QLAEIYMNNP GGVVTVALDG ENPLIFNPNT GPADLYAIYQ ALSEYQGQWL ITQTASEAIA THKPYSVITN LLVNSWDLNL NYWNNGYLGK TEIWQNVSLA REYLVAYTAA VGANISPLVY LPLNETPNST NLFDTLWNYL YVAEGSDWTW QTGPPAYGPL WFKEQALLYT STIISEVKQQ FDLIKLQSVK LDGNNLKLGI YNGINTTVHL LLVITNGKQE IQLPIILSQG QNNFKIKISN VSGQLQIALY SPISASQVGL TLIPINSYGF LVAQYQVNLN KSTSSMGTYL LALVGILVVS AIIVIVMKRG HI
|
| |