Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0705 |
Symbol | |
ID | 5105311 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 641934 |
End bp | 645152 |
Gene Length | 3219 bp |
Protein Length | 1072 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640506609 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001190804 |
Protein GI | 146303488 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4934] Predicted protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.276754 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTAAGG AACTTCTGCT CATAGCACTT ATCTTAAGCC AAGGAATACC GCTGTTTCAC ATGGGACAGG ATCAGGTATC AACCCTTCCT CCCTCTCAAC TGGTTACCGT GTCGATAGTC GAAAAACCGC AGAACTTGGC GTTACTCCAG TTATACGTAC AGGAACACAA GGTACTCACC AAGGATCAGG TTGAGTCACT CTTTGTCCCA ACCGAGAAAA TACAACAGCT AGTAAACTAC CTACACGGGT ACGGGATTGC CACCAGCGTA TCACTGAACG TTATCACCGC AACGGGCACA GTTTCTCAGT TCGAGAAGGC GTTAGGTGGC TCCTTTTACG TTGAAAAGTT TCACAACCTA ACCTTTTATC AGTACGTTGA CGTAAGCTCG CCCCTAGTTT CCAACGCATT GGTGTTTTCA ACCAACGTTA CTACCTCGCT CCTTCAGAGG CCAAGCACTC TCATCAATGT GACCCAGGCT GTCGCGTTCT CCCAGGTAAC TCCATCACAA CTTAGGTATG CCTACAATGT AACCCCCCTC CTTCATAAGG GAATAAACGG GACTAACGTA ACCATTGGGA TAGTGGACTT CTATGGGGAT CCCTACATTC AGCAACAACT GCAGGAGTTT GACTCGAACT ATAACATAAG TAACCCCCCG TTCTTTAAGG TCGAGAGCAT AGGGGCCTAT AACCCAAATG ACGGGATCTC CAGCGGATGG GCACTTGAGA TTTCACTTGA CGTGGAGTAC GCCCACGTGC TAGCTCCAGG CGCTGGAATA ATACTGTACG TAGCTAACCC CAACGCCTCC CTTCCGCAGG TCATTGCTTA CATAGACCAA CAGGATCAGG TCTCAGTGGT TTCACAGAGC TTTGGCATCC CTGAGTTATA TGTGGCCCTA GGATTGATAC CCCTGTCCAT GGTACAATCC CTCACCTACG AGTATTGGTT AGGCGAGGTT GAGGGAATCA CGTTCGTGGC CAGTAGCGGT GACGCGGGAG GGAACGGGTA TAATTTCTAC CTTTCTCCCC TTGGTAACCT GGTGCTCCCC GCCTCTGACC CCTATGTGCT CGCTGTTGGA GGCACATCGG TGTACTACTC CAATGGAAGC GTCAAGCAGA CTGCATGGAG TGGCGAAAGC TTGTTCGGGG CGTCCACTGG TGGATACAGC GTGATATTCC CCAGTCCGTG GTATCAGGGC TCTCACGGTT TTAGGATGGT ACCAGACGTG GCTGCTGACG CCAATCCGTA TACGGGAGTT CCAGTCACAT ACTATTACAA TATCTCTGAA CTGGTAGGGG GGACCTCAGT TGCCTCACCT CTAGTAGCTG GAATTCTAGC TCTAGCTGTT CAAGTTCACG GCAAACTTGG CTTCATAAAC CCACTGATCT ACTCACTGAA CGGTACTAAG GCCATTTCTC CAGTTGAGTA CGGATATAAC ACCCCATACA TCGTGAATGG AACTCCAAAC CCTGTTACTG GGCTAGGTTA CATAAACGCA GGTTACTTCG TGTCCATGAT AGAGCCAGGG AATGGGATAT CTGTGGCTGT CCAGAACACC ACCTACCTAG ATGGTCAGGA GGTTAATGTT GTGGTGAAGG CACCCCCATC ACCTCAACCG GTGGCTCAAA TCTACAATGG ATCTGCAATC GTGGGACAGG TTCCCCTCAC GTACAACGGG ACTTACTGGG TTGGTCACTT CCAGGCTACA GGATCGGGAG TCGAGGAAGT AATAGTGACA CAGGGCAACC TGGTGTCGGG TTCCTATTTC ACCGTAGGTC TACAGGCTCA GTACCTCCTT CCACAGGTAG CACTCTACCC GAGCCCAGGA AACTTGCCGG TCCTGGTTCA CTTGACATAT CCCAACGGGA CTACTGCACA TCCCAACTCC CAATTTTCTG CAAACCTATA CAGCTATAAC CCTGAGACGG GACAGGAGAG ACTGATCTCC ACGATACAGC TTAGCCATCC TCTTGTCCTC AACTTCAGCC AGTACGGGAT TACCATAACT AACGCATCCG ATTACGTGTT TGGAACCTTC CCGTTGAATT CCTCCATGGT GAGCGGAATA TACCTCGTTA AGGTCCCAGG TACTTATGGG TTCGACGAGA TAGTTGCAGG AATTTATGTT GTCCCCTACA TAGTTCCTGG GATAGCCACA GAACCGCTAG TGGTTACCCC AGGCGAGAAC TTTACACTTG CTGTGTTCGC GGAGACCCTG GGTTCGCCCA ATATAACGGT GAGCTTCGTT AAGGACGGAG TGTCCTACTT TAACACAACC GTAAACTCCG TGGAGACATC GCTGGGACAG TTCTACGTTC AGGAGATCTC CCTGCCCAAG GGAATACCCG CCGGGTACTA CGACGTCGTG GCGTATGCCA GTTACAATTT CAGCAATTAC ACGGCGTCTG GGATTGGCTT GACGCAAATT TACGTTTCCC CTTCCCCTGT GCAGGTTACG CTGAGCGGTC TACAGGAGAC CATGCTCCAG AACTCCACGC TGGTTATAAA CGCCTCCGTA ACCTATCCCA ACGGAACTCC TGTGAAGTAC GGCACCTTCA CAGCCATCGT TATTCCCTCG TATATGCAGG GCTCCTTCGA TACACTTGCG ATCTCCAACG CAGTTCCCCT GGAATACAGG AACGGGTCGT GGATAGGTTA CTTCAACCTT CCCTCAGGTG GGGGTTCCAA CGGATTGGGT CTATCACCCT ATGGACTGGC AGGTTCCTGG CAGATATATA TCGACGGGGT AACCTATGAC GGGCATCCCA CTGCCCTTAA GTCTTCGCTT GACTACAGCA CGCTTACGGT AACGCCTCAA CCCTACACTA ACTTCGTTCT CCTACCCTAC GTTCTAACTC CGACCTTTAA TGGGACCTCC GGGTACAACC TTTACATACT TAACGCCACC ATAATAAACC ATAACGCAAC CCTGGTAAAC TCGGTTATCT ACAATCTAAC TGCCGTGAAT GCAACAGTAA GCTTGATAAA TAGCCAGGTC TTCCATTACA CGCTAACCAA CAGCACCCTC CTCAATAACA CCGCAATAAC TCCTGTTCAA ATCCTGACCA CCAGTGTGGG TCACCACAGT GTCGTAATTC CCTCCACAGT TACGAAGAAC GTTACATCAA CATCTTCTCA AAGCGGAACA GCTATGGTAG CGTTGGTAAT CCTCGCGTTT GGCCTCGTCC TAGCTGTGTA TGTATGGAGA AGAAAATAG
|
Protein sequence | MIKELLLIAL ILSQGIPLFH MGQDQVSTLP PSQLVTVSIV EKPQNLALLQ LYVQEHKVLT KDQVESLFVP TEKIQQLVNY LHGYGIATSV SLNVITATGT VSQFEKALGG SFYVEKFHNL TFYQYVDVSS PLVSNALVFS TNVTTSLLQR PSTLINVTQA VAFSQVTPSQ LRYAYNVTPL LHKGINGTNV TIGIVDFYGD PYIQQQLQEF DSNYNISNPP FFKVESIGAY NPNDGISSGW ALEISLDVEY AHVLAPGAGI ILYVANPNAS LPQVIAYIDQ QDQVSVVSQS FGIPELYVAL GLIPLSMVQS LTYEYWLGEV EGITFVASSG DAGGNGYNFY LSPLGNLVLP ASDPYVLAVG GTSVYYSNGS VKQTAWSGES LFGASTGGYS VIFPSPWYQG SHGFRMVPDV AADANPYTGV PVTYYYNISE LVGGTSVASP LVAGILALAV QVHGKLGFIN PLIYSLNGTK AISPVEYGYN TPYIVNGTPN PVTGLGYINA GYFVSMIEPG NGISVAVQNT TYLDGQEVNV VVKAPPSPQP VAQIYNGSAI VGQVPLTYNG TYWVGHFQAT GSGVEEVIVT QGNLVSGSYF TVGLQAQYLL PQVALYPSPG NLPVLVHLTY PNGTTAHPNS QFSANLYSYN PETGQERLIS TIQLSHPLVL NFSQYGITIT NASDYVFGTF PLNSSMVSGI YLVKVPGTYG FDEIVAGIYV VPYIVPGIAT EPLVVTPGEN FTLAVFAETL GSPNITVSFV KDGVSYFNTT VNSVETSLGQ FYVQEISLPK GIPAGYYDVV AYASYNFSNY TASGIGLTQI YVSPSPVQVT LSGLQETMLQ NSTLVINASV TYPNGTPVKY GTFTAIVIPS YMQGSFDTLA ISNAVPLEYR NGSWIGYFNL PSGGGSNGLG LSPYGLAGSW QIYIDGVTYD GHPTALKSSL DYSTLTVTPQ PYTNFVLLPY VLTPTFNGTS GYNLYILNAT IINHNATLVN SVIYNLTAVN ATVSLINSQV FHYTLTNSTL LNNTAITPVQ ILTTSVGHHS VVIPSTVTKN VTSTSSQSGT AMVALVILAF GLVLAVYVWR RK
|
| |