Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_2911 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | - |
Start bp | 2665629 |
End bp | 2668862 |
Gene Length | 3234 bp |
Protein Length | 1077 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | |
Product | Peptidase S53 propeptide |
Protein accession | ACX92985 |
Protein GI | 261603382 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATGGA GTATATTCTT GCTGATTTTA GCTTTAAGTG ACATTGTATT ACCATTGACT ATTACGAATA TAAACAACCA GTCAATCACA ACATTATCTC CAAACTATTA CTTAACGGTG GCTATAGTTT TTCCACCGAG TAATCTAACT CTTTTACAAC AATATGTTCA AGAACATGTA ATATTAAATC AGACTCAAGT GGAAAAGCTT TTCATTCCAA CAGAGGAAAT ATCAAAAACG TTATCTCAGT TAAGGCAATC AAATATATCC GCAACCAGTT ATATGAATGT AATATTGGCT AGTGGAACTG TTTCACAGCT AGAGAAAGCA TTAAATGGGA AGTTCTACGT TTATGAACTC AACGGAAAGA GGTTCTTTGA ATTTTTTGGT TCTCCAGTAA TTCCTAACGC CATCGTAATA GGTACTAATA TTACTTCACT AATTCTAAAT AAGCCCACAA CACTCTATAA CGTTACACAA GCTGTCGCAT ATAATGCATT AAAGCCTAGT CAATTACTCT ATGCTTATAA TATTAGCTGG TTACACGCCC ATAACATAAC TGGAAAAGGC ACCGCTATAG GCATATTGGA TTTCTACGGT AATCCCTACA TACAGCAACA ATTACAAGAA TTTGACAAAC AGTATAATAT TCCAAATCCT CCATTCTTTA AAATTGTCCC CATAGGAGCT TACAATCCTA ATAATGGTAT TTCAACAGGT TGGGCAATGG AAATCTCTTT AGATGTTGAG TATGCTCATG TTATAGCTCC TGATGCTGGA ATTGTTTTAT ACGTAGCCAA TCCAAATATC CCTCTTCCTG CAATTATAGC ATATATAGTT CAACAAGATG AAGTAAACGT AGTATCGCAA AGTTTTGGTA TCCCCGAACT ATATGTAGAT TTAGGTCTAA TTCCTCGTTC ATATGTCAAT TCGCTAATGT ATGAATATTG GCTAGGTGAG GTTGAGGGGA TTAGTTTCGC TGCTGCTTCT GGTGACGCAG GCGGAAATGG ATACAATTAC TTCTTAGCTC CTCAAGGTTC CGTAATATTT CCAGCTTCTA TACCTTACGT CTTAGCAGTA GGCGGTTCAT CTGTCTACAT AGGTGGGAAT AAGACTATGG AAACAGCGTG GAGCGGTGAA AGCGTGCTAG GAGCGTCTAC CGGTGGATAT AGTACATTAT TTCCCGCTCC TTGGTATCAA GATAGTAATG GTTTTAGAGT GGTCCCAGAT GTTGTAGCTG ACGCTAATCC ATATACTGGT GCTTTTATCT TGTATTACTA CAATCAAACT TACTTAGTGG GCGGCACATC ATTAGCTACA CCAATAGTTA GTGGGATTAT TGACCTAATG ACTCAAAGCT ACGGTAAGCT AGGATTTGTA AATCCTTTTC TTTATGAGCT AAGAAACACA AGTGCATTAT CTCCCATAGG TTTCGGATAT AATACACCTT ATTACGTTAA TTCGTCAGAA TTGAATCCTG TAACTGGTTT AGGATCAATA AATGCCGGAT ATTTATATCA ACTATTACCT AAAGTAATAC ACTCTTCAAG TATTTCTGTG GGGGTTAATA ATATTACATA TTTAGATGGG CAAGTCGTAA AAGTAGTCGC TAATATTACC GGAATAAGAC CTTCAAGTGT AATTGGAATC GTGTATAATG GTAGCTCTGT TGTTCAGCAG TTCTCATTGT CGTTTAATGG AACTTATTGG GTTGGTGAAT TTGTTGCAGA AGGAAGCGGT ATAGAGGAAG TAATAGTAAA AGCGGGTAAT TTAGAAGGTT CTACTTACGT TACTATTGGT TATCAAGCTC AATTCATTTT TCCGCCTATT GCATTATTCC CAGAGCCTGA ACCCGTTCCT ATAGTTGTTC AACTTATTTA TCCCAATGGT TCTCTAGTGA GGAATCCTTC AAATCTTACA GCTTTGATAT ATAAATATGA TCAGATGAAT AATAAAATGT CAATTATTTC TAGCGTTCAA TTGCAAAGAA CCTCATTAAT AAACCTTTCC ATTTTAGGTA TCCAAATTGA ATCTAGCTAC CTCACCGGGG TCTATCAGTT GCCAAGCAAT ATCATAAGTG GAGTCTATTT TATTAAAATA CCGAACGTGT TTGGCTTTGA CGAATTCGTT AGTGGCATAT ATATTCTTGA TGCGGTCTAT CCTCCAGTCT TTACAAACCC AGTGGTACTA TCCCCTGGTC AAAACGTTAC AATACTCGCT GAAGCCTTAG CAATAGGATC TCCTAACGTT ACTGTAACTT TCTATAATAT CTCGGGAAAT AAGGTTTACT CTATACCCGT TAATGCAATT ACTTATCAGA ATACTTTATT ATATATAACT CAAATTACCT TGCCAAAATT GAAGCCAGGA TACTACTATG TTGTTACTAA GGCAATTTAC AACGCTTCTA ACTTTACTGC AGAGGGCGTA GGTTTAACTC AAATTTACGT GTCTCCATAC TCACTTAACG TTAAGGTTAG AATAATACCT AATAATAGTA TTGTGTATCA AAATCAACAA ATATACGTTA TAGCAAATAT TACTTATCCT AATGGCACTG AGGTTAAATA CGGTTCATTT TCCGCTATTA TTGTTCCCTC CTATCTCTCC AGCCAGTTCG ATAACTTACA GTTGCAATAC AGTGTTCCCT TAACGTACAT CAACGGAAGT TGGATTGGTC AATTGGAAAT TCCAAGCGGA TCTTCTACTA ATTCCTTAGG CTATTCTACC TATGGTATAT CTGGCTATTG GGATGTTTAT GTGGAGGGTA TATCTGCGGA TGGAATTCCT ACGAATTTCC CTGCCACTCT TGACGTTAAT ACTTTATCCA TAAATCCTAT ATCACCTTCT AGTCAATTTG TTGTCTTACC TTATGTATAT GTGAGCGTAT TCAATGGCAC TATAGCCTTT AATGAGTTCA TAGATAAAGC TATAGTAGTT GGGCATAATG CTACTTTTAT TAATAGTATA ATACGTAACT TGATTGTTGA AAACGGTACA GTTACCTTAA TCAATTCTAA GGTGCAAAAT GTAAGTCTAG TCAATTCAGA AATAATAAAA ATAAATAGTA CTGTAGGCAA TAATGTAAAC TACATCACAA CGATTGGTAA TAATCATGCT AAGTCTAGTT ACCCGAGTTT AGACAGTGGT TCAATTTTGA CTATAGGAAT CGTACTAGAT ATTATAACTA TTATTGCATT GATCTTGATA AAGAGAAGAA AGAAGTTTAT TTAA
|
Protein sequence | MTWSIFLLIL ALSDIVLPLT ITNINNQSIT TLSPNYYLTV AIVFPPSNLT LLQQYVQEHV ILNQTQVEKL FIPTEEISKT LSQLRQSNIS ATSYMNVILA SGTVSQLEKA LNGKFYVYEL NGKRFFEFFG SPVIPNAIVI GTNITSLILN KPTTLYNVTQ AVAYNALKPS QLLYAYNISW LHAHNITGKG TAIGILDFYG NPYIQQQLQE FDKQYNIPNP PFFKIVPIGA YNPNNGISTG WAMEISLDVE YAHVIAPDAG IVLYVANPNI PLPAIIAYIV QQDEVNVVSQ SFGIPELYVD LGLIPRSYVN SLMYEYWLGE VEGISFAAAS GDAGGNGYNY FLAPQGSVIF PASIPYVLAV GGSSVYIGGN KTMETAWSGE SVLGASTGGY STLFPAPWYQ DSNGFRVVPD VVADANPYTG AFILYYYNQT YLVGGTSLAT PIVSGIIDLM TQSYGKLGFV NPFLYELRNT SALSPIGFGY NTPYYVNSSE LNPVTGLGSI NAGYLYQLLP KVIHSSSISV GVNNITYLDG QVVKVVANIT GIRPSSVIGI VYNGSSVVQQ FSLSFNGTYW VGEFVAEGSG IEEVIVKAGN LEGSTYVTIG YQAQFIFPPI ALFPEPEPVP IVVQLIYPNG SLVRNPSNLT ALIYKYDQMN NKMSIISSVQ LQRTSLINLS ILGIQIESSY LTGVYQLPSN IISGVYFIKI PNVFGFDEFV SGIYILDAVY PPVFTNPVVL SPGQNVTILA EALAIGSPNV TVTFYNISGN KVYSIPVNAI TYQNTLLYIT QITLPKLKPG YYYVVTKAIY NASNFTAEGV GLTQIYVSPY SLNVKVRIIP NNSIVYQNQQ IYVIANITYP NGTEVKYGSF SAIIVPSYLS SQFDNLQLQY SVPLTYINGS WIGQLEIPSG SSTNSLGYST YGISGYWDVY VEGISADGIP TNFPATLDVN TLSINPISPS SQFVVLPYVY VSVFNGTIAF NEFIDKAIVV GHNATFINSI IRNLIVENGT VTLINSKVQN VSLVNSEIIK INSTVGNNVN YITTIGNNHA KSSYPSLDSG SILTIGIVLD IITIIALILI KRRKKFI
|
| |