Gene Ssol_0361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0361 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp321437 
End bp325363 
Gene Length3927 bp 
Protein Length1308 aa 
Translation table11 
GC content36% 
IMG OID 
ProductPeptidase S53 propeptide 
Protein accessionACX90649 
Protein GI261601046 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0637543 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAGTA GAATAATACA AGTCGTAGTT ATATCCACTT TTTTAGTATT ATCTGTTCTG 
TTTCCCCTAT TATCTCTAGC ATATTCCACA ACTTCTATAA ATCCCAGCTA TCCACAGTCT
AACGTAATCT CAGCTTTACC CTCGAATACT AATATTATTC TTTATTTCTT TATTCCACCA
AAGAATCTTA ATGAACTTTA TCTGATAGCA CAAGAAGTAG CTAATCATCA AATTAAGCCA
CTAAGTAACG CTCAGCTTGT CTCAATGTTT AGTAATCAAG ACAAGGTTAA CGAGAGTATA
AAATATCTTG AGAGTAAAGG TTTTACAATA ATATATAGAA GCCCATTCGA AATTATGGCT
GAAGCCCCAG TTTCCTTGGT TTCGTCAGTT TTTGAGACTA GTTTTGTATT AGCCAAATCA
ACTAATGGTG AGATATACTA CAAGCCAGCT GGTAACGTTA AAATACCGTC TACTCTTAAT
AACTTGCTAA TAGGTGGTTT AACAAACTTC ACTAACGTAT CATTGCCTTT AATACAGTTA
GGTAAATTGG AAAATGGTAA TTTAATACCT AATAAACAAG CTTATTCCTC ATTTGTCTAT
ACTTTCCAAT TCTCAGCAAC TTGGTATACT CCTAAAGTTA TTGAAGGGGC ATACAATATA
ACTCCACTAC TGAACTCTAC TGCCGATAAA AAGGTTACAA TTGCAATAAT AGATGCCTAT
GGCGACCCAG AGATATATCA AGATGTAAAT CTGTTTGATG CCAGGTTTGG CTTACCTCCG
ATAAACTTGA CAGTATTACC AGTTGGTCCT TATCATCCAG AAAATGGATT GTTTACTGGT
TGGTTTGAAG AGGTTGCATT AGATGTTGAG GCAGCTCACG CAGCTGCTCC ATATTCTAAT
ATTTTATTAG TTGTAGCCCC ATCAGCGACC TTAGAAGGAT TATTTTCAGC AATTGATGTT
GTTGTAAGCG AGGATCTAGC ACAAGTAGTT TCCATGAGCT GGGGTCTTCC CGGGATACTA
TTTGGTGCTT CAGGGTTTTA TGCTGTATTC AATGGAATAA TATTTCCTAA TTATCCCTAT
TATGATTACT ACTTTGAGCT TGGTTCTGCT GAGGGTATTA CGTTTTTAGC GTCATCTGGG
GATTTAGGTG CCTATAATGA TTTACCAACT GTTTATGGCT CAGCTAACTA TCCTGCATCA
TCTCCTTTCG TTACAGCTGT AGGTGGTACT TCCCTATTTG CTAACATTAC TAGCGGCTAT
ATTTCCACTT ATAATTCTAC AGGCAATTTT GGTGCCGAGA TTGCGTGGAG TGTTAATCCG
TTGTATTTTG GTGTTATCCA AGGTGGTGTG AGTTCTGGGG GTGGGTATAG TCAATTGTTC
CCAGCTCCTT GGTATCAGCG TTATGTTACT CATTCGAATT ATAGGGCTAT TCCAGACGTT
GCAGCAGACG CTAATCCATA TACTGGGTTT ACAATTTATG CTTTGGGCCA AGAGGTGGTA
ATTGGTGGAA CAAGTTTATC CGCTCCATTG TGGGCTGGAA TCATTGCTGA TATAGATGGA
ATTATTGGCC ATCCGTTAGG TTTGGTAAAT CCGATACTTT ATGAGATTTA TCAAAATACT
ACTTTATATC ATCAAGCTTT CCATCAAATA AGTTTGGGAT ATAATGGTTA TTATTATGCT
AACTCCTCTT ACAATTTGGT TACTGGCTTA GGAAGTCCTA ATGCTGGAAT GTTAGGAGTC
ATCATCAAAC ATTCATTATC CAAGAGTTTA GCGATATCTG TAAGTACTTT CGAGACTGGG
GTTTTTCAAC CTTGGTATTT CTATGGTTCT ACCTTTACAA TAGCCGCGTA CATAACTTAT
CCCAACAATA CTATTGTCAG CCAAGGTAGT TTTAACGCTT ATATATATAC TAGTGAGGGA
TATTTAGCCA CAGTCCCCTT ATCCTTTAAC GGTAGTTATT GGGTTGGCAA CTATACTATA
ACTCCTAATA ATCCACCAAA TCTTTGGGAA ATAGTAGTTA ATGGTAGTTC TGATCAGTTT
ACCGGAGTTG GGACTGTAGA AGTTGATGTT GGTGAATCAA TCAATATTGT ATCCCCAATT
CCTTATCCAT ATAGCTTTCC AATTCCATAC AACTCACCAT TTGGCATTGA AGCTTGGATT
TACTACCCTA ATGGTACACC AGTTGTTAAT CAAAGCGTTA CAGCTTATTT AGTTAGTAAT
GATGGCAAGT TATTAGCTTC TATACCCTTA ACCATGATGG CTCCAGGATT ATATGAGGGT
AGTTACGCTT TACTTCCTCC ATTACCACAA GGTACTTATT TACTAATAGT AAATGACTCA
TATGGTAGTG CTTTCTCTTA TGTATATTTT GGTGAGTATA ATTTTGGCGC AATTTTAACC
CCAATAAATG ATGGTTTTCC AGCTGCGTCC CCTGGACAAA ATATAACTAT AATTGATGAG
GTGCTAACAC CAGAGCTTAC CGGTTTATTT ACTTCTAACG TCACTGCATA TATTTATAAC
CAACACGGTA ACCTTATAGA TCAAGTTAAA CTTACTCCAG CTCCAGACGA AATTCAATTC
GGTGTTTACT TGCTCTTCTT CCTATATTAT GCTAATTTCA CAATACCTTT CGATGCTTCT
CCAGGGTTCT ACAATGTTGT AATACAATCT ATAAGCAATA CTTCTACTGG GTTAGTCAAG
GCGGACTTTA TCACATCATT TTACGTTTCT CCAGCTAATT TGACATTAAA TGTAAAGGTG
AATAATGTCG TATATGAGGG TGAACTTCTG AAAATTTTCG CTAACATAAC TTATCCCAAT
GGTACTCCAG TTAAATATGG AATGTTTACT GCTACGATCT TGCCGACTTC TCTAAATTAC
GAGCAATTAA TTATAGGATT TGAGGCTGGA ATACCCTTAC AATACAATTC TACTTTAGGA
GAATGGGTAG GTATTTATAG TATTCCATCC ATATTTTATG GTTCAATATT TCAAGGCTCC
TCTGTGTATT CGTTAGCTGG ACCTTGGAAT GTTATAGTAT CTGGCGTTTC TTGGAATGGT
TATAATCTAT ATTCTACTCC AAGCTCCTTT AACTTCGTTA ATGTTATGCC ATATACTTTC
ATTAACAACA TTGTAGTGAG CAGTAAATCC CTAGATTCAC CCCTATTATC CAAAATTAAC
TCTACAACAT ATATGTTATC TAATGTCAAA TCAAATAATA TAACAATTAA CGGAATGAAC
GTTATTTTAA GTAACGTTAT AGCTAATACG GTAACCGTTA AGAACTCTAA TATAATGATA
ACTTCATCCA CAATTAACCA GTTAGTGTTA GACAATTCTT CAGTCTCAAT TATAGGGTCT
AAAATAGGAG GGGATAATAT TGCTGTAGTC GCTAATGATT CTAATGTAAC AATAGTCTCG
TCAGTAATTC AAGACTCAAA GTACGCGTTT CTACAACCCA ATTCAGTAAT TAGTCTAAGC
GGTGTTAATA TGTATAACGT TACTAGTTTG TCTTCAATAC CAGCACCTAG GATTACGTAC
CTTTCAACAA CTAATGTTAC CACATCTAAG GAATCAATAA TTGTTAATAT CACCGGTGAA
TACCTAAGAC TTTTAGGAGT TTCAATGAAC AACAAACCAG TAGGTTATAG TGTAATTTCA
TCTTCACCTT CATCAATAAG TTTAAGTATA CCTTTTAATG CTTCTCAACT TTCTGATGGT
CAATACATAT TTACAGTAAG CATATCCGAT GGATTGCCAT ACAATTTAAC ATTTAACCTC
TTAAATAATT ATCATCTCAT AATTGTACAA GACCATCTTA AAGCACTACA AGGATCAGTG
AATTTATTAA CAGTAATCGC AATAATTTCC TTAATAATAG CAATAATAGC AGTAGCTCTA
CTATTCGTAT TTACGAGAAG GAGGTGA
 
Protein sequence
MESRIIQVVV ISTFLVLSVL FPLLSLAYST TSINPSYPQS NVISALPSNT NIILYFFIPP 
KNLNELYLIA QEVANHQIKP LSNAQLVSMF SNQDKVNESI KYLESKGFTI IYRSPFEIMA
EAPVSLVSSV FETSFVLAKS TNGEIYYKPA GNVKIPSTLN NLLIGGLTNF TNVSLPLIQL
GKLENGNLIP NKQAYSSFVY TFQFSATWYT PKVIEGAYNI TPLLNSTADK KVTIAIIDAY
GDPEIYQDVN LFDARFGLPP INLTVLPVGP YHPENGLFTG WFEEVALDVE AAHAAAPYSN
ILLVVAPSAT LEGLFSAIDV VVSEDLAQVV SMSWGLPGIL FGASGFYAVF NGIIFPNYPY
YDYYFELGSA EGITFLASSG DLGAYNDLPT VYGSANYPAS SPFVTAVGGT SLFANITSGY
ISTYNSTGNF GAEIAWSVNP LYFGVIQGGV SSGGGYSQLF PAPWYQRYVT HSNYRAIPDV
AADANPYTGF TIYALGQEVV IGGTSLSAPL WAGIIADIDG IIGHPLGLVN PILYEIYQNT
TLYHQAFHQI SLGYNGYYYA NSSYNLVTGL GSPNAGMLGV IIKHSLSKSL AISVSTFETG
VFQPWYFYGS TFTIAAYITY PNNTIVSQGS FNAYIYTSEG YLATVPLSFN GSYWVGNYTI
TPNNPPNLWE IVVNGSSDQF TGVGTVEVDV GESINIVSPI PYPYSFPIPY NSPFGIEAWI
YYPNGTPVVN QSVTAYLVSN DGKLLASIPL TMMAPGLYEG SYALLPPLPQ GTYLLIVNDS
YGSAFSYVYF GEYNFGAILT PINDGFPAAS PGQNITIIDE VLTPELTGLF TSNVTAYIYN
QHGNLIDQVK LTPAPDEIQF GVYLLFFLYY ANFTIPFDAS PGFYNVVIQS ISNTSTGLVK
ADFITSFYVS PANLTLNVKV NNVVYEGELL KIFANITYPN GTPVKYGMFT ATILPTSLNY
EQLIIGFEAG IPLQYNSTLG EWVGIYSIPS IFYGSIFQGS SVYSLAGPWN VIVSGVSWNG
YNLYSTPSSF NFVNVMPYTF INNIVVSSKS LDSPLLSKIN STTYMLSNVK SNNITINGMN
VILSNVIANT VTVKNSNIMI TSSTINQLVL DNSSVSIIGS KIGGDNIAVV ANDSNVTIVS
SVIQDSKYAF LQPNSVISLS GVNMYNVTSL SSIPAPRITY LSTTNVTTSK ESIIVNITGE
YLRLLGVSMN NKPVGYSVIS SSPSSISLSI PFNASQLSDG QYIFTVSISD GLPYNLTFNL
LNNYHLIIVQ DHLKALQGSV NLLTVIAIIS LIIAIIAVAL LFVFTRRR