Gene Msed_0705 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0705 
Symbol 
ID5105311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp641934 
End bp645152 
Gene Length3219 bp 
Protein Length1072 aa 
Translation table11 
GC content50% 
IMG OID640506609 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001190804 
Protein GI146303488 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.276754 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAAGG AACTTCTGCT CATAGCACTT ATCTTAAGCC AAGGAATACC GCTGTTTCAC 
ATGGGACAGG ATCAGGTATC AACCCTTCCT CCCTCTCAAC TGGTTACCGT GTCGATAGTC
GAAAAACCGC AGAACTTGGC GTTACTCCAG TTATACGTAC AGGAACACAA GGTACTCACC
AAGGATCAGG TTGAGTCACT CTTTGTCCCA ACCGAGAAAA TACAACAGCT AGTAAACTAC
CTACACGGGT ACGGGATTGC CACCAGCGTA TCACTGAACG TTATCACCGC AACGGGCACA
GTTTCTCAGT TCGAGAAGGC GTTAGGTGGC TCCTTTTACG TTGAAAAGTT TCACAACCTA
ACCTTTTATC AGTACGTTGA CGTAAGCTCG CCCCTAGTTT CCAACGCATT GGTGTTTTCA
ACCAACGTTA CTACCTCGCT CCTTCAGAGG CCAAGCACTC TCATCAATGT GACCCAGGCT
GTCGCGTTCT CCCAGGTAAC TCCATCACAA CTTAGGTATG CCTACAATGT AACCCCCCTC
CTTCATAAGG GAATAAACGG GACTAACGTA ACCATTGGGA TAGTGGACTT CTATGGGGAT
CCCTACATTC AGCAACAACT GCAGGAGTTT GACTCGAACT ATAACATAAG TAACCCCCCG
TTCTTTAAGG TCGAGAGCAT AGGGGCCTAT AACCCAAATG ACGGGATCTC CAGCGGATGG
GCACTTGAGA TTTCACTTGA CGTGGAGTAC GCCCACGTGC TAGCTCCAGG CGCTGGAATA
ATACTGTACG TAGCTAACCC CAACGCCTCC CTTCCGCAGG TCATTGCTTA CATAGACCAA
CAGGATCAGG TCTCAGTGGT TTCACAGAGC TTTGGCATCC CTGAGTTATA TGTGGCCCTA
GGATTGATAC CCCTGTCCAT GGTACAATCC CTCACCTACG AGTATTGGTT AGGCGAGGTT
GAGGGAATCA CGTTCGTGGC CAGTAGCGGT GACGCGGGAG GGAACGGGTA TAATTTCTAC
CTTTCTCCCC TTGGTAACCT GGTGCTCCCC GCCTCTGACC CCTATGTGCT CGCTGTTGGA
GGCACATCGG TGTACTACTC CAATGGAAGC GTCAAGCAGA CTGCATGGAG TGGCGAAAGC
TTGTTCGGGG CGTCCACTGG TGGATACAGC GTGATATTCC CCAGTCCGTG GTATCAGGGC
TCTCACGGTT TTAGGATGGT ACCAGACGTG GCTGCTGACG CCAATCCGTA TACGGGAGTT
CCAGTCACAT ACTATTACAA TATCTCTGAA CTGGTAGGGG GGACCTCAGT TGCCTCACCT
CTAGTAGCTG GAATTCTAGC TCTAGCTGTT CAAGTTCACG GCAAACTTGG CTTCATAAAC
CCACTGATCT ACTCACTGAA CGGTACTAAG GCCATTTCTC CAGTTGAGTA CGGATATAAC
ACCCCATACA TCGTGAATGG AACTCCAAAC CCTGTTACTG GGCTAGGTTA CATAAACGCA
GGTTACTTCG TGTCCATGAT AGAGCCAGGG AATGGGATAT CTGTGGCTGT CCAGAACACC
ACCTACCTAG ATGGTCAGGA GGTTAATGTT GTGGTGAAGG CACCCCCATC ACCTCAACCG
GTGGCTCAAA TCTACAATGG ATCTGCAATC GTGGGACAGG TTCCCCTCAC GTACAACGGG
ACTTACTGGG TTGGTCACTT CCAGGCTACA GGATCGGGAG TCGAGGAAGT AATAGTGACA
CAGGGCAACC TGGTGTCGGG TTCCTATTTC ACCGTAGGTC TACAGGCTCA GTACCTCCTT
CCACAGGTAG CACTCTACCC GAGCCCAGGA AACTTGCCGG TCCTGGTTCA CTTGACATAT
CCCAACGGGA CTACTGCACA TCCCAACTCC CAATTTTCTG CAAACCTATA CAGCTATAAC
CCTGAGACGG GACAGGAGAG ACTGATCTCC ACGATACAGC TTAGCCATCC TCTTGTCCTC
AACTTCAGCC AGTACGGGAT TACCATAACT AACGCATCCG ATTACGTGTT TGGAACCTTC
CCGTTGAATT CCTCCATGGT GAGCGGAATA TACCTCGTTA AGGTCCCAGG TACTTATGGG
TTCGACGAGA TAGTTGCAGG AATTTATGTT GTCCCCTACA TAGTTCCTGG GATAGCCACA
GAACCGCTAG TGGTTACCCC AGGCGAGAAC TTTACACTTG CTGTGTTCGC GGAGACCCTG
GGTTCGCCCA ATATAACGGT GAGCTTCGTT AAGGACGGAG TGTCCTACTT TAACACAACC
GTAAACTCCG TGGAGACATC GCTGGGACAG TTCTACGTTC AGGAGATCTC CCTGCCCAAG
GGAATACCCG CCGGGTACTA CGACGTCGTG GCGTATGCCA GTTACAATTT CAGCAATTAC
ACGGCGTCTG GGATTGGCTT GACGCAAATT TACGTTTCCC CTTCCCCTGT GCAGGTTACG
CTGAGCGGTC TACAGGAGAC CATGCTCCAG AACTCCACGC TGGTTATAAA CGCCTCCGTA
ACCTATCCCA ACGGAACTCC TGTGAAGTAC GGCACCTTCA CAGCCATCGT TATTCCCTCG
TATATGCAGG GCTCCTTCGA TACACTTGCG ATCTCCAACG CAGTTCCCCT GGAATACAGG
AACGGGTCGT GGATAGGTTA CTTCAACCTT CCCTCAGGTG GGGGTTCCAA CGGATTGGGT
CTATCACCCT ATGGACTGGC AGGTTCCTGG CAGATATATA TCGACGGGGT AACCTATGAC
GGGCATCCCA CTGCCCTTAA GTCTTCGCTT GACTACAGCA CGCTTACGGT AACGCCTCAA
CCCTACACTA ACTTCGTTCT CCTACCCTAC GTTCTAACTC CGACCTTTAA TGGGACCTCC
GGGTACAACC TTTACATACT TAACGCCACC ATAATAAACC ATAACGCAAC CCTGGTAAAC
TCGGTTATCT ACAATCTAAC TGCCGTGAAT GCAACAGTAA GCTTGATAAA TAGCCAGGTC
TTCCATTACA CGCTAACCAA CAGCACCCTC CTCAATAACA CCGCAATAAC TCCTGTTCAA
ATCCTGACCA CCAGTGTGGG TCACCACAGT GTCGTAATTC CCTCCACAGT TACGAAGAAC
GTTACATCAA CATCTTCTCA AAGCGGAACA GCTATGGTAG CGTTGGTAAT CCTCGCGTTT
GGCCTCGTCC TAGCTGTGTA TGTATGGAGA AGAAAATAG
 
Protein sequence
MIKELLLIAL ILSQGIPLFH MGQDQVSTLP PSQLVTVSIV EKPQNLALLQ LYVQEHKVLT 
KDQVESLFVP TEKIQQLVNY LHGYGIATSV SLNVITATGT VSQFEKALGG SFYVEKFHNL
TFYQYVDVSS PLVSNALVFS TNVTTSLLQR PSTLINVTQA VAFSQVTPSQ LRYAYNVTPL
LHKGINGTNV TIGIVDFYGD PYIQQQLQEF DSNYNISNPP FFKVESIGAY NPNDGISSGW
ALEISLDVEY AHVLAPGAGI ILYVANPNAS LPQVIAYIDQ QDQVSVVSQS FGIPELYVAL
GLIPLSMVQS LTYEYWLGEV EGITFVASSG DAGGNGYNFY LSPLGNLVLP ASDPYVLAVG
GTSVYYSNGS VKQTAWSGES LFGASTGGYS VIFPSPWYQG SHGFRMVPDV AADANPYTGV
PVTYYYNISE LVGGTSVASP LVAGILALAV QVHGKLGFIN PLIYSLNGTK AISPVEYGYN
TPYIVNGTPN PVTGLGYINA GYFVSMIEPG NGISVAVQNT TYLDGQEVNV VVKAPPSPQP
VAQIYNGSAI VGQVPLTYNG TYWVGHFQAT GSGVEEVIVT QGNLVSGSYF TVGLQAQYLL
PQVALYPSPG NLPVLVHLTY PNGTTAHPNS QFSANLYSYN PETGQERLIS TIQLSHPLVL
NFSQYGITIT NASDYVFGTF PLNSSMVSGI YLVKVPGTYG FDEIVAGIYV VPYIVPGIAT
EPLVVTPGEN FTLAVFAETL GSPNITVSFV KDGVSYFNTT VNSVETSLGQ FYVQEISLPK
GIPAGYYDVV AYASYNFSNY TASGIGLTQI YVSPSPVQVT LSGLQETMLQ NSTLVINASV
TYPNGTPVKY GTFTAIVIPS YMQGSFDTLA ISNAVPLEYR NGSWIGYFNL PSGGGSNGLG
LSPYGLAGSW QIYIDGVTYD GHPTALKSSL DYSTLTVTPQ PYTNFVLLPY VLTPTFNGTS
GYNLYILNAT IINHNATLVN SVIYNLTAVN ATVSLINSQV FHYTLTNSTL LNNTAITPVQ
ILTTSVGHHS VVIPSTVTKN VTSTSSQSGT AMVALVILAF GLVLAVYVWR RK