Gene Ssol_2289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2289 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp2085163 
End bp2086773 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content35% 
IMG OID 
Productpeptidase S9 prolyl oligopeptidase active site domain protein 
Protein accessionACX92468 
Protein GI261602865 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATATG AAGATTTATT TAAGATAAAA CCTGTACTCG ATTACGACGT ATATAACGGA 
AAAATAGCAA CAATTATAAG AGATGAAAAA CCCGTCCTTT ACGTCAACAA GGAAAAAGTC
CAACTCGAAG GATATGCCAA AGAAGTAGAC TGGATAAATA GAAACAAAAT GTTTATCACA
GTAGACCCTA ACGGTAGTGA AATTAGGAAA ATTTACCTTT ACGATAACGG GAAGATAGAG
AAAATTATTG ATAATGAGTT TGATAATTTA TCTCCTCGCG AGACAAACGA AGGGATTCTA
GTAATCTCAA ACTATGATAA AAAGACATTA CATCTTTACC TCTACAATGA AGGAAAATAT
ATTAAGCTAA GCAAGGGTGA AGGACCGGTA AACAATTACT GTTTTAATGG CAAATACATA
GTTTACTCGA CTGGGATTTA TGATAATAAC ATTCACGTGA TGGATCTCAG TGGTAATGAA
ATTAATGTAA TTAATATCCC AAACTCAGAG CAAGAGCTTG CAAATGAGAA TTGTTTTACT
TCTCCTTCAT CATTCATCTT TCTCTCAAAT CACGAGGATT TATCTAAGGT TTATGAGTTT
AATATTTTAA AGGGAGAGAT CAGAAAAATA AGGGAAAGCG ATTACGAAAT CTTTGAGGCT
ATTCCATATA AGGGTTCTAT CGCTTATGTT GAGGATAGAC ACGGCAACTT TGTCTTAATT
CACGAAAAAG AGATAGTTAA TGAAGGCTTT ACCTACTCCT TAAAGGTTGA TGGAGATTAT
ATTTATTTTG TAAACTCTAA ACATGATAGA TCAGCAGACC TATACAGATA TGGGAAAAAG
GTAGAGAGGT TAACTGACTC AATGAACGAT GCTAAAGGGA ATTTCATAAA ACCTAAGGTT
GTCTCTTACG ACTCCAATGG GTTGAGGATT TACGCCTTAC TCTATGAAAA AGGTGGTGAG
GATAAGGGTA TAGTTTATAT TCACGGAGGT CCAGATTGGG AATGCGTAAA CTCATTCAAC
CCAGAAATTC AGTTCTTTAT GGAGAGAGGA TTTAAGGTTA TTTGTCCCAA TTACAGAGGA
TCTATAGGTT ATGGAAGGAG GTTTAACCAT TTGAACGATA AAGACCCAGG AGGAGGTGAG
TTGTTAGATG TTATAAATTC AGTGAAGGTC TTAGGAGTTA AAAAGATTGC AATAACTGGT
GCAAGTTATG GTGGCTATTT GACCATGATG GCTACTACTA AGTTCTCGGA CCTTTGGTGT
TCGGCTGTGG CTGTAGTACC TTTTGTTAAT TGGTTTACCG AAAAGAAGCT TGAAAGGGAA
ATACTTCAAC TATATGACGA AATAAAGGTT GGTAATGATG AAAATTTATT GAGGGATAGA
TCACCTATAT TCTTTATTGA TAGGATAAAA ACTTCATTGC TTCTCTTAGC TGGTGAAAAT
GACCCAAGAT GTCCAGCTGA GGAAACTTTG CAAGTAGTTG AAGAACTTAG AAAGTTGGGT
AGAGAAGTGA AATATAAGAT ATACAAAGAT GAGGGACACG GATTTGCAAA AATAGAAAAC
TATGTTGACT CGATAAAAGA GGCTGTGGAG TTTATTACTA GTCACTGCTG A
 
Protein sequence
MKYEDLFKIK PVLDYDVYNG KIATIIRDEK PVLYVNKEKV QLEGYAKEVD WINRNKMFIT 
VDPNGSEIRK IYLYDNGKIE KIIDNEFDNL SPRETNEGIL VISNYDKKTL HLYLYNEGKY
IKLSKGEGPV NNYCFNGKYI VYSTGIYDNN IHVMDLSGNE INVINIPNSE QELANENCFT
SPSSFIFLSN HEDLSKVYEF NILKGEIRKI RESDYEIFEA IPYKGSIAYV EDRHGNFVLI
HEKEIVNEGF TYSLKVDGDY IYFVNSKHDR SADLYRYGKK VERLTDSMND AKGNFIKPKV
VSYDSNGLRI YALLYEKGGE DKGIVYIHGG PDWECVNSFN PEIQFFMERG FKVICPNYRG
SIGYGRRFNH LNDKDPGGGE LLDVINSVKV LGVKKIAITG ASYGGYLTMM ATTKFSDLWC
SAVAVVPFVN WFTEKKLERE ILQLYDEIKV GNDENLLRDR SPIFFIDRIK TSLLLLAGEN
DPRCPAEETL QVVEELRKLG REVKYKIYKD EGHGFAKIEN YVDSIKEAVE FITSHC