Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_0361 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | + |
Start bp | 321437 |
End bp | 325363 |
Gene Length | 3927 bp |
Protein Length | 1308 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | |
Product | Peptidase S53 propeptide |
Protein accession | ACX90649 |
Protein GI | 261601046 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0637543 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAGTA GAATAATACA AGTCGTAGTT ATATCCACTT TTTTAGTATT ATCTGTTCTG TTTCCCCTAT TATCTCTAGC ATATTCCACA ACTTCTATAA ATCCCAGCTA TCCACAGTCT AACGTAATCT CAGCTTTACC CTCGAATACT AATATTATTC TTTATTTCTT TATTCCACCA AAGAATCTTA ATGAACTTTA TCTGATAGCA CAAGAAGTAG CTAATCATCA AATTAAGCCA CTAAGTAACG CTCAGCTTGT CTCAATGTTT AGTAATCAAG ACAAGGTTAA CGAGAGTATA AAATATCTTG AGAGTAAAGG TTTTACAATA ATATATAGAA GCCCATTCGA AATTATGGCT GAAGCCCCAG TTTCCTTGGT TTCGTCAGTT TTTGAGACTA GTTTTGTATT AGCCAAATCA ACTAATGGTG AGATATACTA CAAGCCAGCT GGTAACGTTA AAATACCGTC TACTCTTAAT AACTTGCTAA TAGGTGGTTT AACAAACTTC ACTAACGTAT CATTGCCTTT AATACAGTTA GGTAAATTGG AAAATGGTAA TTTAATACCT AATAAACAAG CTTATTCCTC ATTTGTCTAT ACTTTCCAAT TCTCAGCAAC TTGGTATACT CCTAAAGTTA TTGAAGGGGC ATACAATATA ACTCCACTAC TGAACTCTAC TGCCGATAAA AAGGTTACAA TTGCAATAAT AGATGCCTAT GGCGACCCAG AGATATATCA AGATGTAAAT CTGTTTGATG CCAGGTTTGG CTTACCTCCG ATAAACTTGA CAGTATTACC AGTTGGTCCT TATCATCCAG AAAATGGATT GTTTACTGGT TGGTTTGAAG AGGTTGCATT AGATGTTGAG GCAGCTCACG CAGCTGCTCC ATATTCTAAT ATTTTATTAG TTGTAGCCCC ATCAGCGACC TTAGAAGGAT TATTTTCAGC AATTGATGTT GTTGTAAGCG AGGATCTAGC ACAAGTAGTT TCCATGAGCT GGGGTCTTCC CGGGATACTA TTTGGTGCTT CAGGGTTTTA TGCTGTATTC AATGGAATAA TATTTCCTAA TTATCCCTAT TATGATTACT ACTTTGAGCT TGGTTCTGCT GAGGGTATTA CGTTTTTAGC GTCATCTGGG GATTTAGGTG CCTATAATGA TTTACCAACT GTTTATGGCT CAGCTAACTA TCCTGCATCA TCTCCTTTCG TTACAGCTGT AGGTGGTACT TCCCTATTTG CTAACATTAC TAGCGGCTAT ATTTCCACTT ATAATTCTAC AGGCAATTTT GGTGCCGAGA TTGCGTGGAG TGTTAATCCG TTGTATTTTG GTGTTATCCA AGGTGGTGTG AGTTCTGGGG GTGGGTATAG TCAATTGTTC CCAGCTCCTT GGTATCAGCG TTATGTTACT CATTCGAATT ATAGGGCTAT TCCAGACGTT GCAGCAGACG CTAATCCATA TACTGGGTTT ACAATTTATG CTTTGGGCCA AGAGGTGGTA ATTGGTGGAA CAAGTTTATC CGCTCCATTG TGGGCTGGAA TCATTGCTGA TATAGATGGA ATTATTGGCC ATCCGTTAGG TTTGGTAAAT CCGATACTTT ATGAGATTTA TCAAAATACT ACTTTATATC ATCAAGCTTT CCATCAAATA AGTTTGGGAT ATAATGGTTA TTATTATGCT AACTCCTCTT ACAATTTGGT TACTGGCTTA GGAAGTCCTA ATGCTGGAAT GTTAGGAGTC ATCATCAAAC ATTCATTATC CAAGAGTTTA GCGATATCTG TAAGTACTTT CGAGACTGGG GTTTTTCAAC CTTGGTATTT CTATGGTTCT ACCTTTACAA TAGCCGCGTA CATAACTTAT CCCAACAATA CTATTGTCAG CCAAGGTAGT TTTAACGCTT ATATATATAC TAGTGAGGGA TATTTAGCCA CAGTCCCCTT ATCCTTTAAC GGTAGTTATT GGGTTGGCAA CTATACTATA ACTCCTAATA ATCCACCAAA TCTTTGGGAA ATAGTAGTTA ATGGTAGTTC TGATCAGTTT ACCGGAGTTG GGACTGTAGA AGTTGATGTT GGTGAATCAA TCAATATTGT ATCCCCAATT CCTTATCCAT ATAGCTTTCC AATTCCATAC AACTCACCAT TTGGCATTGA AGCTTGGATT TACTACCCTA ATGGTACACC AGTTGTTAAT CAAAGCGTTA CAGCTTATTT AGTTAGTAAT GATGGCAAGT TATTAGCTTC TATACCCTTA ACCATGATGG CTCCAGGATT ATATGAGGGT AGTTACGCTT TACTTCCTCC ATTACCACAA GGTACTTATT TACTAATAGT AAATGACTCA TATGGTAGTG CTTTCTCTTA TGTATATTTT GGTGAGTATA ATTTTGGCGC AATTTTAACC CCAATAAATG ATGGTTTTCC AGCTGCGTCC CCTGGACAAA ATATAACTAT AATTGATGAG GTGCTAACAC CAGAGCTTAC CGGTTTATTT ACTTCTAACG TCACTGCATA TATTTATAAC CAACACGGTA ACCTTATAGA TCAAGTTAAA CTTACTCCAG CTCCAGACGA AATTCAATTC GGTGTTTACT TGCTCTTCTT CCTATATTAT GCTAATTTCA CAATACCTTT CGATGCTTCT CCAGGGTTCT ACAATGTTGT AATACAATCT ATAAGCAATA CTTCTACTGG GTTAGTCAAG GCGGACTTTA TCACATCATT TTACGTTTCT CCAGCTAATT TGACATTAAA TGTAAAGGTG AATAATGTCG TATATGAGGG TGAACTTCTG AAAATTTTCG CTAACATAAC TTATCCCAAT GGTACTCCAG TTAAATATGG AATGTTTACT GCTACGATCT TGCCGACTTC TCTAAATTAC GAGCAATTAA TTATAGGATT TGAGGCTGGA ATACCCTTAC AATACAATTC TACTTTAGGA GAATGGGTAG GTATTTATAG TATTCCATCC ATATTTTATG GTTCAATATT TCAAGGCTCC TCTGTGTATT CGTTAGCTGG ACCTTGGAAT GTTATAGTAT CTGGCGTTTC TTGGAATGGT TATAATCTAT ATTCTACTCC AAGCTCCTTT AACTTCGTTA ATGTTATGCC ATATACTTTC ATTAACAACA TTGTAGTGAG CAGTAAATCC CTAGATTCAC CCCTATTATC CAAAATTAAC TCTACAACAT ATATGTTATC TAATGTCAAA TCAAATAATA TAACAATTAA CGGAATGAAC GTTATTTTAA GTAACGTTAT AGCTAATACG GTAACCGTTA AGAACTCTAA TATAATGATA ACTTCATCCA CAATTAACCA GTTAGTGTTA GACAATTCTT CAGTCTCAAT TATAGGGTCT AAAATAGGAG GGGATAATAT TGCTGTAGTC GCTAATGATT CTAATGTAAC AATAGTCTCG TCAGTAATTC AAGACTCAAA GTACGCGTTT CTACAACCCA ATTCAGTAAT TAGTCTAAGC GGTGTTAATA TGTATAACGT TACTAGTTTG TCTTCAATAC CAGCACCTAG GATTACGTAC CTTTCAACAA CTAATGTTAC CACATCTAAG GAATCAATAA TTGTTAATAT CACCGGTGAA TACCTAAGAC TTTTAGGAGT TTCAATGAAC AACAAACCAG TAGGTTATAG TGTAATTTCA TCTTCACCTT CATCAATAAG TTTAAGTATA CCTTTTAATG CTTCTCAACT TTCTGATGGT CAATACATAT TTACAGTAAG CATATCCGAT GGATTGCCAT ACAATTTAAC ATTTAACCTC TTAAATAATT ATCATCTCAT AATTGTACAA GACCATCTTA AAGCACTACA AGGATCAGTG AATTTATTAA CAGTAATCGC AATAATTTCC TTAATAATAG CAATAATAGC AGTAGCTCTA CTATTCGTAT TTACGAGAAG GAGGTGA
|
Protein sequence | MESRIIQVVV ISTFLVLSVL FPLLSLAYST TSINPSYPQS NVISALPSNT NIILYFFIPP KNLNELYLIA QEVANHQIKP LSNAQLVSMF SNQDKVNESI KYLESKGFTI IYRSPFEIMA EAPVSLVSSV FETSFVLAKS TNGEIYYKPA GNVKIPSTLN NLLIGGLTNF TNVSLPLIQL GKLENGNLIP NKQAYSSFVY TFQFSATWYT PKVIEGAYNI TPLLNSTADK KVTIAIIDAY GDPEIYQDVN LFDARFGLPP INLTVLPVGP YHPENGLFTG WFEEVALDVE AAHAAAPYSN ILLVVAPSAT LEGLFSAIDV VVSEDLAQVV SMSWGLPGIL FGASGFYAVF NGIIFPNYPY YDYYFELGSA EGITFLASSG DLGAYNDLPT VYGSANYPAS SPFVTAVGGT SLFANITSGY ISTYNSTGNF GAEIAWSVNP LYFGVIQGGV SSGGGYSQLF PAPWYQRYVT HSNYRAIPDV AADANPYTGF TIYALGQEVV IGGTSLSAPL WAGIIADIDG IIGHPLGLVN PILYEIYQNT TLYHQAFHQI SLGYNGYYYA NSSYNLVTGL GSPNAGMLGV IIKHSLSKSL AISVSTFETG VFQPWYFYGS TFTIAAYITY PNNTIVSQGS FNAYIYTSEG YLATVPLSFN GSYWVGNYTI TPNNPPNLWE IVVNGSSDQF TGVGTVEVDV GESINIVSPI PYPYSFPIPY NSPFGIEAWI YYPNGTPVVN QSVTAYLVSN DGKLLASIPL TMMAPGLYEG SYALLPPLPQ GTYLLIVNDS YGSAFSYVYF GEYNFGAILT PINDGFPAAS PGQNITIIDE VLTPELTGLF TSNVTAYIYN QHGNLIDQVK LTPAPDEIQF GVYLLFFLYY ANFTIPFDAS PGFYNVVIQS ISNTSTGLVK ADFITSFYVS PANLTLNVKV NNVVYEGELL KIFANITYPN GTPVKYGMFT ATILPTSLNY EQLIIGFEAG IPLQYNSTLG EWVGIYSIPS IFYGSIFQGS SVYSLAGPWN VIVSGVSWNG YNLYSTPSSF NFVNVMPYTF INNIVVSSKS LDSPLLSKIN STTYMLSNVK SNNITINGMN VILSNVIANT VTVKNSNIMI TSSTINQLVL DNSSVSIIGS KIGGDNIAVV ANDSNVTIVS SVIQDSKYAF LQPNSVISLS GVNMYNVTSL SSIPAPRITY LSTTNVTTSK ESIIVNITGE YLRLLGVSMN NKPVGYSVIS SSPSSISLSI PFNASQLSDG QYIFTVSISD GLPYNLTFNL LNNYHLIIVQ DHLKALQGSV NLLTVIAIIS LIIAIIAVAL LFVFTRRR
|
| |