Gene Ssol_2109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2109 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1886570 
End bp1890382 
Gene Length3813 bp 
Protein Length1270 aa 
Translation table11 
GC content36% 
IMG OID 
ProductPeptidase S53 propeptide 
Protein accessionACX92315 
Protein GI261602712 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.509088 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAGGT ATATATTTTT AATGTCAATG CTATTAATTT CCGTTATACC CGTAGTTTTT 
GCATCATATT CCAATATATA CCAGAATCCA GTAACTTTAA AGGGATTTAG AGAAGTAGGA
ACGTTAAATA CAAATCAAGA GGTAGTAGTT ACAATTTTTG TACCACTTAA AAATCTAGAT
CTATTATACT ATTACGCCAG TGCAACCTCA AACCCAGCTT CACCGCTATA CCATAAATTC
TTAAGCCCTC AAGAAGTGCA ACAACTATTC TTACCAACTG AGGAGTATAA CCAAATTCTA
AACTACGTTA AAAATAGCGG ATTTCAAGTA TTATTTACCG CATTAAACTC AGTAATAGTA
GTAAAGGGTA CAGTGGGTCA AGTTGAGAAG TATCTAGGTA CTAAATATAC GGTTTACTCT
AATGGTTCCA TAACCTATTA CACCAATTAC GGATACCCCA AGATAAACGC CTATATATAC
TCCAGTAACG TCTCAATAAT ATTCTTTGCC CATCCATCTA CGTTAATTAC TGAGACCACG
CTAAAGAGCT TTCAGCAAGA GATAAATCAA ACATTTCCAC TTGAAGGCTA TTGGCCAACT
GTATTACAAA AAGTGTATAA TGTTACCACA GAGGGAGAGA ATACTACAAT AGGAATACTG
GACTTTTATG GTGATCCTTA CATTGTACAG CAATTAGCTT ATTTTGATAA GGTTACTGGA
TTACCAAATC CTCCCAACTT TACTGTAGTA CCAATTGGAC CCTATAATCC TAACTTAGGT
ATTCTAACCG GTTGGGCTGG AGAGATTAGT CTAGATGTTG AAGTAGCGCA CGCAATAGCG
CCAAAAGCTA ACATAACGCT ATACATAGCT AATCCAAATA TACCTCTACC CGCTATTCTC
GCATACATTA TAGGTCAAAA TCAAGTTGAT ACGTTATCCC AAAGCTTTAG CATACCAGAA
AGCTTCTTTT CCTACCTTTT TAACGGTCCA TTATTCTATT CATGTGTAGT ATTAAGTGAT
GAATACTATG CACTAGGTTC AGCCGAAGGA ATAACTTTCT TAGCCAGTTC TGGAGATGCA
GGAGGCTCTG GATATAGTAA CGGACCAATA GGCACAGTAG GATATCCCTC AACATCACCG
TTTGTAACTT CAGTAGGAGG TACAACTGTA TATATACAAT TCCCTAATGG ATCTTATTAT
CAGACTGCTT GGTCAAATTA CGGTTTCGTT CCAAATGATG TAAATTATGG TGGTTCAACC
GGTGGTATAA GTATAATTGA GCCAAAACCT TGGTATCAGT GGGAATTGCC TACGCCATCT
ACTTATCCGA ATGGTAAGCT TATTCCAGAA ATTTCCGCTA ATGCCAACGT ATATCCCGGA
GTATACATAG TTTTACCAGG TAACGTAACA GGGATAACTG GAGGTACAAG TGAGTCTTCG
CCTTTAACTG CTGGGTTATT AAGTACAATT GAAAGTTATA CTCATCATAG AATTGGTTTG
CTTAATCCTA TATTAGCTTA CATGGCTGAG AAATACTACG GTAAAGCAAT AGAACCGATA
ACTTTTGGTT ATAATATCCC TTGGGTCGCG TATTATGGTT ATAACTTAGT AACTGGTTAT
GGTACAATTA ACGCAGGATA CTTTGAAAAT ATACTATCCA CAATCAATTT ATCAAAAAAG
GAGCTAAATG TTATAGTTAG CGTTTACAAT ACCTCAATAC CTACCATACC TCCTCAACAA
TTCTACCCTG GACAACGTAT TCTAGTCACT GCTAACATAA CATATCCAAA TGGAAGTCCA
GTACAAACCG GTGAGTTCAA AGCATTAATA GAGAACTATC TAGGGAACTT AACTACGTTT
AACTTGACTT ATAACCCGCT TACTAAGTTA TGGGCTGGTA GTGGCGTTTT ACCCAATAAC
GCTAGTGGTG TTTTATTCGT CTACGCTTAT GGAAGTAGTG ATGGAATAAA GGGAATTGGA
TACTATGAGA CCTTCTCTGG ATATTATGTA ACATTTAGTT ACACGACGAC TTTTACACCA
GTTTATACAG AGCTGGGTAA TGCTGAACTG GGAATTACCT TATCTAACTC ATACTTCCAG
GCACCAATTG GAGTGATGAA TATTACCCTT AATATTTACT CCTATAACAT AACAACAAAC
GCATACACGT TTGTAACGAC GTTAAGTGTA CCTATTAAGA ATGGAGTAGG AGTTATCGAT
TTGCCACCAG ACTTAAGCAT AGGAGATCTA TTGATTATAG CTGAAGGTAA TGCCTATGGA
TTTGACGCAT TTACCAATGG AGTATACATG CAAACCTTAT TCATATTGCC ACAAGTGGTA
GTTGAACCGG GCAGTGTTTC CCCTGGGCAA CACATTACAA TAGAGGGATC AATTATACCG
CCAGTTAACT TACCCAGTAC TACATTCCAA GATGCATTAC AAGGTACTAA CATTACTGCT
AAATTGGTAA GTAGTAATGG TGTCGTAATA AATGAGGCTA ATATACCATT ATCACCAAAT
GGAATCTACT TCGGATATTT GTACATACCT AAAAATACTC CCTCTGGGCT TTATAATGTT
CTACTATTTG CAACCTATTA CTCTTATACT TTGAACACTA CAATTCGAGG ATTCTACTAC
GGTCAAATAT ACGTTTCTAA TCAAGCCACG ATCTCAGTGA AGTCAGTTAA CTATGCATTT
GAGGGACAAA CTGTTTTCAT TTACGCTAAT ATAACAAATG GTACTAATGA AATTAAGTTC
GGAATGTTTA GTGCTACCGT GTACCCCTCG AGCCTCTCAT TTAATTATAC TACGATAAGT
TCAATAATAG AGATACCGCT GTGGTATAAT CCTAAGATAG GAGAATGGGA AGGGAATTTC
ACACTGCCTT CAGCAATTAG TGCAGGAAAT CTAACTTATT TAGCTGGACA AGGATATTTC
GGAGTGCCAT TCAAGGTCTT AATAACCGGA ATTTCAGCCT TAGGTAATCC AACCACTACC
AATTCTGGTA ATGCTTATAC AATCAACGTA TTGCCATATA CCTTATTTAC AAATCAAACC
TTAGATAAGA CGTTACCATC ATATGCAAGT TTAGTTAACG TGAAGATATT GAATGTAAGT
GGCAATCTAT TAAATGACTT CCTTACTAAC GTTATTATCG TTAACAGCAA TGTAAAAATA
TTGAATGGGA ACATATCTAA TATAGTAATT AGAAATTCCA CTGTGTTGAT AATGCAGAGT
AATGCGAATA ACATTACATT ATACAATTCA ACTCTGTACG CCATAGGTGG AAGTATAAAT
GGATTAAACG TAGTTAACTC TAAAGTAGTT CCAATAAACA TTCATATCCA AGGTTTATAC
CCTGAATTAC CAAGTATTTC GATAAACTTA CCTTCTAAGA ACGTAACTGG AACAGTTAAT
GTTACCGTCA ATGTAATTGG TGAAGATGTA AGTAGGATTA ACGTATACTT GAATGGTAAC
TTGATAAATT CATTTACAAC AAATGGGACC CATATAGTAA CTATAAATAC TCAAAATTAT
CCAGATGGTG GGTATAATTT AACAGTAACA GCAATTCAAA GTGATGGTTT AAGTAGTAGT
AATAGTAGTT ATCTGTATTT TGAAAACGGT CTAACTAATC TAAATACTAA GGTGAATGTA
ATATCTAACC AATTAACTAA TGTAAGTAAT AGTTTATCAT CTTCTATATC TTCTTTAAGG
ACTGCATCAT TAGAATATCA GAGTATATCT TTAGCGATCG GTATTATAGC AATAGTTCTG
GCAATATTGG CTCTAGTAAG AAGGAGAAGG TAA
 
Protein sequence
MYRYIFLMSM LLISVIPVVF ASYSNIYQNP VTLKGFREVG TLNTNQEVVV TIFVPLKNLD 
LLYYYASATS NPASPLYHKF LSPQEVQQLF LPTEEYNQIL NYVKNSGFQV LFTALNSVIV
VKGTVGQVEK YLGTKYTVYS NGSITYYTNY GYPKINAYIY SSNVSIIFFA HPSTLITETT
LKSFQQEINQ TFPLEGYWPT VLQKVYNVTT EGENTTIGIL DFYGDPYIVQ QLAYFDKVTG
LPNPPNFTVV PIGPYNPNLG ILTGWAGEIS LDVEVAHAIA PKANITLYIA NPNIPLPAIL
AYIIGQNQVD TLSQSFSIPE SFFSYLFNGP LFYSCVVLSD EYYALGSAEG ITFLASSGDA
GGSGYSNGPI GTVGYPSTSP FVTSVGGTTV YIQFPNGSYY QTAWSNYGFV PNDVNYGGST
GGISIIEPKP WYQWELPTPS TYPNGKLIPE ISANANVYPG VYIVLPGNVT GITGGTSESS
PLTAGLLSTI ESYTHHRIGL LNPILAYMAE KYYGKAIEPI TFGYNIPWVA YYGYNLVTGY
GTINAGYFEN ILSTINLSKK ELNVIVSVYN TSIPTIPPQQ FYPGQRILVT ANITYPNGSP
VQTGEFKALI ENYLGNLTTF NLTYNPLTKL WAGSGVLPNN ASGVLFVYAY GSSDGIKGIG
YYETFSGYYV TFSYTTTFTP VYTELGNAEL GITLSNSYFQ APIGVMNITL NIYSYNITTN
AYTFVTTLSV PIKNGVGVID LPPDLSIGDL LIIAEGNAYG FDAFTNGVYM QTLFILPQVV
VEPGSVSPGQ HITIEGSIIP PVNLPSTTFQ DALQGTNITA KLVSSNGVVI NEANIPLSPN
GIYFGYLYIP KNTPSGLYNV LLFATYYSYT LNTTIRGFYY GQIYVSNQAT ISVKSVNYAF
EGQTVFIYAN ITNGTNEIKF GMFSATVYPS SLSFNYTTIS SIIEIPLWYN PKIGEWEGNF
TLPSAISAGN LTYLAGQGYF GVPFKVLITG ISALGNPTTT NSGNAYTINV LPYTLFTNQT
LDKTLPSYAS LVNVKILNVS GNLLNDFLTN VIIVNSNVKI LNGNISNIVI RNSTVLIMQS
NANNITLYNS TLYAIGGSIN GLNVVNSKVV PINIHIQGLY PELPSISINL PSKNVTGTVN
VTVNVIGEDV SRINVYLNGN LINSFTTNGT HIVTINTQNY PDGGYNLTVT AIQSDGLSSS
NSSYLYFENG LTNLNTKVNV ISNQLTNVSN SLSSSISSLR TASLEYQSIS LAIGIIAIVL
AILALVRRRR