Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_2811 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | - |
Start bp | 2575799 |
End bp | 2578825 |
Gene Length | 3027 bp |
Protein Length | 1008 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | |
Product | peptidase S41 |
Protein accession | ACX92892 |
Protein GI | 261603289 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.16663 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCAT ACTATATGTA CCCTGATATT AGGGGAGATT TAATATCATT TACTTCAGAT GATGACGTTT GGCTTCTTTC TTTAAAAGAT ATGAAACCGC TTAGGATAAC AAGTGGTTTA GGAGTTTCCA CTAGGCCTAA AATAAGTCCA AGTGGTAGAA AAGTGGCGTT TTCTGTTATT TGGCTTAAGA GCGGTAAGCA AGGTGGAGAT ATCTACGTTG TTGAAGACGG GCAAGCTAGG AGGGTTACCT ATTTTGGTAG TAGGAATAGT AGGGTTGCTG GTTGGATTTC TGAGGACGAG ATTATTGTAA TAACTGACTT TCACACTCCT TTTATTCAAT GGACTGAGGC GTATAAGGTA AATGTAAATA ACGGAAAGAC AGAGAAATTG CCTTTCGGCA TGTTATCCAA TATTGTAATA AAGGATGATA TAATAGTAAT TGCAAGGGGT TATCAAGACT TACCAAACTG GAAAGGGTAT AAGGGTGGAA CTAAGGGTGA ATTATGGATT TCCAGCGATG GTGGTAAAAC CTTTGAGAAG TTTGTTAGTT TAGATGGTAA CGTTAGCTGG CCTATGATAG TTAGAGAAAG GGTTTACTTT TTATCTGATC ACGAGGGAGT TGGTAATCTT TATTCAGTTG ATTTAAAAGG TAAGGATTTA AGGAGACATA CTAATTTCAC TGATTATTAT TGTAGGAATG CCAGTAGTGA TGGTAAGAGA ATTGTTTTTC AAAACGCTGG AGACATATAT TTGTACGATC CAGAAAAGGA CAGCTTAACT AAACTGGATA TTAACTTACC TACCGATAGG AAGAAGAAGC AACCAAAATT CGTTAATGTA ATGGAGTACA TGAACGAAGC TGTTGTAAAT GGTAACTATA TAGCATTAGT AAGTAGGGGC AAGGTATTTT TAATGAGACC ATGGGATGGT CCTTCAGTTC AATTGGGTAA GAAACAAGGT GTAAAATATA GGCAGATTCA AGTTTTGCCT AATGGTGACG TGATAGGAGT AAACGATGAG GACAAATTGG TAATCTTAGG TAAGGACGGT AGTGAGAAGG TTATAAACAA GGATTTTAGT AGAATAGAGA GAGTTAAGGT TTCTCCAGAT GGTAAGAAAG TATTATTATC TAACAATAAA CTTGAATTAT GGGTTTACGA GATTGATAAT GATAACGCTA GATTAATAGA TAAGAGCGAG TACGACTTAA TTTTAGAGTT TGATTGGCAT CCAAATGCTG AGTGGTTTGC TTACGCTTTT CCAGAAGGCT ATTATACTCA ATCAATAAAG CTTGCCCACA TTGATGGGAA GGTTGTTAGG ATAACGACTC CCTATGGATA TGACTTTTCA CCATCATTTG ACCCAGATGG TAGATATTTA TACTTCTTGG CTGCTAGGCA TTTGGACCCA ACTAACGATA AGGTAATATT TAATTTAAGT TTCCAGAGGG TTGTTAAGCC ATACCTAGTA GTTTTAGGAA ATTATTATTC CCCATTTAAC CAACCATTAG ATGAGGCTAA TAGCAACGAC AAAAACGTCA TAATTGAGGG AATCGAAGAT AGGGTAGTTC CATTCCCGAT TGAAGAGGAA AATTACGTGC AAATAGCTGG AGCTAAGAAC AACAAGATCT TCCTATTTTC CTATCCAATA AGGGGGCTTA GATCACAAAC TGGAGATGTG TTTGGTAGGT TAGAGGTTTA TGATCTAGAG AATAAGGCGA AGGAGTTATA TGCAGATAAC GTTTCAAGCT TCTCTTTGTC TAGCGATAAA AGTAAAATAC TTTTAATACT TAAGGATAGT CTAAGGCTAT TTGATGTTAA TGTAAAACCA GATTTTAACT CAACTGGAAG AAAAGGTGGG GTAATAGATT TATCTAGAGT TAAGGTTTAT GTTGAGCCGG AGAAGGAATG GAGGCAAATG CTTAGGGAAA CGTGGAAGTT GATGAAGCAG AATTATTGGA ATGAGGAGAG ATTAAAGAAT TGGGACTCTA TCTTACCCAA ATACGAGAGA CTTTTAGATA GGATAAGTAC TAGATTTGAG CTTTCTGATG TAATTCAAGA GATGCAAGGC GAGACTAGGA CTTCTCATTC CTACGAAACA GCTTACGATT ACGATACTCC GGAGCCGTTG TCAGTTGGTG GTTTAGGTGC TGAGTTTGAG TATGATGAGA GCAATAAATG TTACAAAATT ACAAAGATTT ATGTTGGGGA TTCTACCAAT GAGAATGAGA GAAGTCCATT ACGGGATCCT GGTGTTCAAT TGAATGTTGG AGATTGTATA AAAAATATTG ACGGGGAAGA TGCAAATGGT AACATTTACT CTCATCTAAT AAATAAGGAT CAAGTTATTC TTGACGTAAT AACTGCTGAC GGTAAGAATA AACGCGTCAC GGTTAAAGTA TTAAAAGATG AAAGGTTCTT AATATATAGG TATTGGGTTG AGAAGAATAG GGAATATGTT CACGAGAAGA GCAAGGGTAG ATTAGGATAT ATTCACATAC CAGATATGAT GTATCAAGGA TTCGCTGAGT TTTACAGACT TTTCATGTCT GAATTCCACA GAGAAGGGCT AGTAGTTGAC GTTAGGTTTA ATAGGGGTGG CTTTGTCTCA GGTTTACTCT TAGAGAAGCT ACTCTTGAAA AGAGTTGGCT ATGATTATCC TAGAAATGGA AAACCAATAC CTATGCCTTA TTTCTCTTCT CCTAAGGTTT TAGTGGGAAT AACTAATGAG CATGCTGGGT CCGATGGCGA TATCTTTTCA TTCTTGTTCA AGAAGTACAA GCTAGGAGTA CTTATTGGTA GAAGAACGTG GGGAGGTGTT GTCGGTATAA GACCTAGATA TAGATTAGTG GATAAAACTT ATATTAGTCA ACCAGAGTTT GCTGTTAACT TCGAGGATGT AGGTTTTGGT ATTGAGAATT ACGGAGTAGA CCCAGATATA GTTGTTGAGA TTAAGCCAGA TGATTATGTA AATAATAGGG ATACTCAATT AGATACGGCA ATAGAGTTGG CATTAAAACA ACTTTAA
|
Protein sequence | MKAYYMYPDI RGDLISFTSD DDVWLLSLKD MKPLRITSGL GVSTRPKISP SGRKVAFSVI WLKSGKQGGD IYVVEDGQAR RVTYFGSRNS RVAGWISEDE IIVITDFHTP FIQWTEAYKV NVNNGKTEKL PFGMLSNIVI KDDIIVIARG YQDLPNWKGY KGGTKGELWI SSDGGKTFEK FVSLDGNVSW PMIVRERVYF LSDHEGVGNL YSVDLKGKDL RRHTNFTDYY CRNASSDGKR IVFQNAGDIY LYDPEKDSLT KLDINLPTDR KKKQPKFVNV MEYMNEAVVN GNYIALVSRG KVFLMRPWDG PSVQLGKKQG VKYRQIQVLP NGDVIGVNDE DKLVILGKDG SEKVINKDFS RIERVKVSPD GKKVLLSNNK LELWVYEIDN DNARLIDKSE YDLILEFDWH PNAEWFAYAF PEGYYTQSIK LAHIDGKVVR ITTPYGYDFS PSFDPDGRYL YFLAARHLDP TNDKVIFNLS FQRVVKPYLV VLGNYYSPFN QPLDEANSND KNVIIEGIED RVVPFPIEEE NYVQIAGAKN NKIFLFSYPI RGLRSQTGDV FGRLEVYDLE NKAKELYADN VSSFSLSSDK SKILLILKDS LRLFDVNVKP DFNSTGRKGG VIDLSRVKVY VEPEKEWRQM LRETWKLMKQ NYWNEERLKN WDSILPKYER LLDRISTRFE LSDVIQEMQG ETRTSHSYET AYDYDTPEPL SVGGLGAEFE YDESNKCYKI TKIYVGDSTN ENERSPLRDP GVQLNVGDCI KNIDGEDANG NIYSHLINKD QVILDVITAD GKNKRVTVKV LKDERFLIYR YWVEKNREYV HEKSKGRLGY IHIPDMMYQG FAEFYRLFMS EFHREGLVVD VRFNRGGFVS GLLLEKLLLK RVGYDYPRNG KPIPMPYFSS PKVLVGITNE HAGSDGDIFS FLFKKYKLGV LIGRRTWGGV VGIRPRYRLV DKTYISQPEF AVNFEDVGFG IENYGVDPDI VVEIKPDDYV NNRDTQLDTA IELALKQL
|
| |