Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_2109 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | - |
Start bp | 1886570 |
End bp | 1890382 |
Gene Length | 3813 bp |
Protein Length | 1270 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | |
Product | Peptidase S53 propeptide |
Protein accession | ACX92315 |
Protein GI | 261602712 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.509088 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATAGGT ATATATTTTT AATGTCAATG CTATTAATTT CCGTTATACC CGTAGTTTTT GCATCATATT CCAATATATA CCAGAATCCA GTAACTTTAA AGGGATTTAG AGAAGTAGGA ACGTTAAATA CAAATCAAGA GGTAGTAGTT ACAATTTTTG TACCACTTAA AAATCTAGAT CTATTATACT ATTACGCCAG TGCAACCTCA AACCCAGCTT CACCGCTATA CCATAAATTC TTAAGCCCTC AAGAAGTGCA ACAACTATTC TTACCAACTG AGGAGTATAA CCAAATTCTA AACTACGTTA AAAATAGCGG ATTTCAAGTA TTATTTACCG CATTAAACTC AGTAATAGTA GTAAAGGGTA CAGTGGGTCA AGTTGAGAAG TATCTAGGTA CTAAATATAC GGTTTACTCT AATGGTTCCA TAACCTATTA CACCAATTAC GGATACCCCA AGATAAACGC CTATATATAC TCCAGTAACG TCTCAATAAT ATTCTTTGCC CATCCATCTA CGTTAATTAC TGAGACCACG CTAAAGAGCT TTCAGCAAGA GATAAATCAA ACATTTCCAC TTGAAGGCTA TTGGCCAACT GTATTACAAA AAGTGTATAA TGTTACCACA GAGGGAGAGA ATACTACAAT AGGAATACTG GACTTTTATG GTGATCCTTA CATTGTACAG CAATTAGCTT ATTTTGATAA GGTTACTGGA TTACCAAATC CTCCCAACTT TACTGTAGTA CCAATTGGAC CCTATAATCC TAACTTAGGT ATTCTAACCG GTTGGGCTGG AGAGATTAGT CTAGATGTTG AAGTAGCGCA CGCAATAGCG CCAAAAGCTA ACATAACGCT ATACATAGCT AATCCAAATA TACCTCTACC CGCTATTCTC GCATACATTA TAGGTCAAAA TCAAGTTGAT ACGTTATCCC AAAGCTTTAG CATACCAGAA AGCTTCTTTT CCTACCTTTT TAACGGTCCA TTATTCTATT CATGTGTAGT ATTAAGTGAT GAATACTATG CACTAGGTTC AGCCGAAGGA ATAACTTTCT TAGCCAGTTC TGGAGATGCA GGAGGCTCTG GATATAGTAA CGGACCAATA GGCACAGTAG GATATCCCTC AACATCACCG TTTGTAACTT CAGTAGGAGG TACAACTGTA TATATACAAT TCCCTAATGG ATCTTATTAT CAGACTGCTT GGTCAAATTA CGGTTTCGTT CCAAATGATG TAAATTATGG TGGTTCAACC GGTGGTATAA GTATAATTGA GCCAAAACCT TGGTATCAGT GGGAATTGCC TACGCCATCT ACTTATCCGA ATGGTAAGCT TATTCCAGAA ATTTCCGCTA ATGCCAACGT ATATCCCGGA GTATACATAG TTTTACCAGG TAACGTAACA GGGATAACTG GAGGTACAAG TGAGTCTTCG CCTTTAACTG CTGGGTTATT AAGTACAATT GAAAGTTATA CTCATCATAG AATTGGTTTG CTTAATCCTA TATTAGCTTA CATGGCTGAG AAATACTACG GTAAAGCAAT AGAACCGATA ACTTTTGGTT ATAATATCCC TTGGGTCGCG TATTATGGTT ATAACTTAGT AACTGGTTAT GGTACAATTA ACGCAGGATA CTTTGAAAAT ATACTATCCA CAATCAATTT ATCAAAAAAG GAGCTAAATG TTATAGTTAG CGTTTACAAT ACCTCAATAC CTACCATACC TCCTCAACAA TTCTACCCTG GACAACGTAT TCTAGTCACT GCTAACATAA CATATCCAAA TGGAAGTCCA GTACAAACCG GTGAGTTCAA AGCATTAATA GAGAACTATC TAGGGAACTT AACTACGTTT AACTTGACTT ATAACCCGCT TACTAAGTTA TGGGCTGGTA GTGGCGTTTT ACCCAATAAC GCTAGTGGTG TTTTATTCGT CTACGCTTAT GGAAGTAGTG ATGGAATAAA GGGAATTGGA TACTATGAGA CCTTCTCTGG ATATTATGTA ACATTTAGTT ACACGACGAC TTTTACACCA GTTTATACAG AGCTGGGTAA TGCTGAACTG GGAATTACCT TATCTAACTC ATACTTCCAG GCACCAATTG GAGTGATGAA TATTACCCTT AATATTTACT CCTATAACAT AACAACAAAC GCATACACGT TTGTAACGAC GTTAAGTGTA CCTATTAAGA ATGGAGTAGG AGTTATCGAT TTGCCACCAG ACTTAAGCAT AGGAGATCTA TTGATTATAG CTGAAGGTAA TGCCTATGGA TTTGACGCAT TTACCAATGG AGTATACATG CAAACCTTAT TCATATTGCC ACAAGTGGTA GTTGAACCGG GCAGTGTTTC CCCTGGGCAA CACATTACAA TAGAGGGATC AATTATACCG CCAGTTAACT TACCCAGTAC TACATTCCAA GATGCATTAC AAGGTACTAA CATTACTGCT AAATTGGTAA GTAGTAATGG TGTCGTAATA AATGAGGCTA ATATACCATT ATCACCAAAT GGAATCTACT TCGGATATTT GTACATACCT AAAAATACTC CCTCTGGGCT TTATAATGTT CTACTATTTG CAACCTATTA CTCTTATACT TTGAACACTA CAATTCGAGG ATTCTACTAC GGTCAAATAT ACGTTTCTAA TCAAGCCACG ATCTCAGTGA AGTCAGTTAA CTATGCATTT GAGGGACAAA CTGTTTTCAT TTACGCTAAT ATAACAAATG GTACTAATGA AATTAAGTTC GGAATGTTTA GTGCTACCGT GTACCCCTCG AGCCTCTCAT TTAATTATAC TACGATAAGT TCAATAATAG AGATACCGCT GTGGTATAAT CCTAAGATAG GAGAATGGGA AGGGAATTTC ACACTGCCTT CAGCAATTAG TGCAGGAAAT CTAACTTATT TAGCTGGACA AGGATATTTC GGAGTGCCAT TCAAGGTCTT AATAACCGGA ATTTCAGCCT TAGGTAATCC AACCACTACC AATTCTGGTA ATGCTTATAC AATCAACGTA TTGCCATATA CCTTATTTAC AAATCAAACC TTAGATAAGA CGTTACCATC ATATGCAAGT TTAGTTAACG TGAAGATATT GAATGTAAGT GGCAATCTAT TAAATGACTT CCTTACTAAC GTTATTATCG TTAACAGCAA TGTAAAAATA TTGAATGGGA ACATATCTAA TATAGTAATT AGAAATTCCA CTGTGTTGAT AATGCAGAGT AATGCGAATA ACATTACATT ATACAATTCA ACTCTGTACG CCATAGGTGG AAGTATAAAT GGATTAAACG TAGTTAACTC TAAAGTAGTT CCAATAAACA TTCATATCCA AGGTTTATAC CCTGAATTAC CAAGTATTTC GATAAACTTA CCTTCTAAGA ACGTAACTGG AACAGTTAAT GTTACCGTCA ATGTAATTGG TGAAGATGTA AGTAGGATTA ACGTATACTT GAATGGTAAC TTGATAAATT CATTTACAAC AAATGGGACC CATATAGTAA CTATAAATAC TCAAAATTAT CCAGATGGTG GGTATAATTT AACAGTAACA GCAATTCAAA GTGATGGTTT AAGTAGTAGT AATAGTAGTT ATCTGTATTT TGAAAACGGT CTAACTAATC TAAATACTAA GGTGAATGTA ATATCTAACC AATTAACTAA TGTAAGTAAT AGTTTATCAT CTTCTATATC TTCTTTAAGG ACTGCATCAT TAGAATATCA GAGTATATCT TTAGCGATCG GTATTATAGC AATAGTTCTG GCAATATTGG CTCTAGTAAG AAGGAGAAGG TAA
|
Protein sequence | MYRYIFLMSM LLISVIPVVF ASYSNIYQNP VTLKGFREVG TLNTNQEVVV TIFVPLKNLD LLYYYASATS NPASPLYHKF LSPQEVQQLF LPTEEYNQIL NYVKNSGFQV LFTALNSVIV VKGTVGQVEK YLGTKYTVYS NGSITYYTNY GYPKINAYIY SSNVSIIFFA HPSTLITETT LKSFQQEINQ TFPLEGYWPT VLQKVYNVTT EGENTTIGIL DFYGDPYIVQ QLAYFDKVTG LPNPPNFTVV PIGPYNPNLG ILTGWAGEIS LDVEVAHAIA PKANITLYIA NPNIPLPAIL AYIIGQNQVD TLSQSFSIPE SFFSYLFNGP LFYSCVVLSD EYYALGSAEG ITFLASSGDA GGSGYSNGPI GTVGYPSTSP FVTSVGGTTV YIQFPNGSYY QTAWSNYGFV PNDVNYGGST GGISIIEPKP WYQWELPTPS TYPNGKLIPE ISANANVYPG VYIVLPGNVT GITGGTSESS PLTAGLLSTI ESYTHHRIGL LNPILAYMAE KYYGKAIEPI TFGYNIPWVA YYGYNLVTGY GTINAGYFEN ILSTINLSKK ELNVIVSVYN TSIPTIPPQQ FYPGQRILVT ANITYPNGSP VQTGEFKALI ENYLGNLTTF NLTYNPLTKL WAGSGVLPNN ASGVLFVYAY GSSDGIKGIG YYETFSGYYV TFSYTTTFTP VYTELGNAEL GITLSNSYFQ APIGVMNITL NIYSYNITTN AYTFVTTLSV PIKNGVGVID LPPDLSIGDL LIIAEGNAYG FDAFTNGVYM QTLFILPQVV VEPGSVSPGQ HITIEGSIIP PVNLPSTTFQ DALQGTNITA KLVSSNGVVI NEANIPLSPN GIYFGYLYIP KNTPSGLYNV LLFATYYSYT LNTTIRGFYY GQIYVSNQAT ISVKSVNYAF EGQTVFIYAN ITNGTNEIKF GMFSATVYPS SLSFNYTTIS SIIEIPLWYN PKIGEWEGNF TLPSAISAGN LTYLAGQGYF GVPFKVLITG ISALGNPTTT NSGNAYTINV LPYTLFTNQT LDKTLPSYAS LVNVKILNVS GNLLNDFLTN VIIVNSNVKI LNGNISNIVI RNSTVLIMQS NANNITLYNS TLYAIGGSIN GLNVVNSKVV PINIHIQGLY PELPSISINL PSKNVTGTVN VTVNVIGEDV SRINVYLNGN LINSFTTNGT HIVTINTQNY PDGGYNLTVT AIQSDGLSSS NSSYLYFENG LTNLNTKVNV ISNQLTNVSN SLSSSISSLR TASLEYQSIS LAIGIIAIVL AILALVRRRR
|
| |