Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0851 |
Symbol | |
ID | 5105211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 782950 |
End bp | 786753 |
Gene Length | 3804 bp |
Protein Length | 1267 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640506756 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001190949 |
Protein GI | 146303633 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4934] Predicted protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.444099 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTAAAA CTTTTCTAGC TCAATTGACC CTATACCTCT TTATTTTTTC AGTTGTAACC CCGGCTTTAA TCAGTGTTGC GAACACTGGT ACCCTTGTCC ATGAAACACT ATCAACTAGA TTACCGAGTT TCAAGGAGAC CAACCTTTCT GGTAATACTC CCGTGATCGC CTCAATATAC ATCCCCTTGA GGAACTTAAA CCTTCTCTAT TACTACGCGG AGCAGGTATC AAACCCCGGG TCGCCCCTTT ACCACAAGTT CCTATCGCCC TCCCAGGTGA AGAATCTATT CTATCCTAGC GCGGAGTTTT CCAGTGTCAT GAATTACCTA GCCAGTCACC ATGTTCACGT AATGTTTACG GCCGCAGATT CTGTTATAGT GGTTCAGGGA ACCGCATCAC AACTCTCACA GGTGTTGGGT ATCCATTATG TTCTCATGAG CAACGGAACA ACTTACTACT ACACGGCAAT GGGAACCCCA AAGGTGCCTG GAATAGTTAT ATCAAGTAAT GTCTCAGCCC TTTTCTTCTC TCATCCATCA ACTCTTTTCA CTCAGAGCGA CGTTAACAAG CTTAGGGAGA CAGTTAATCA ACCAAATTTA ACTGCGCCCA TAGAGGCGTA TAAACTAACT GAGTTGAGAG GAGTCTACAA TGTTTCCTCC CTGATCAGCA AGGGAGTGAA TGGTACCAAC TACACTGTTG GAATCTTAGA TTTTTACGGT GATCCATACA TCCAGCAACA GTTAGCATAT TTCGACAAGA TATACAACGT TCCAGCTCCT CCTAACTTCT CGATTATACC CATTGGTCCC TACGATCCCA ACCTAGGCAT TACCCAGGGG TGGGCGGGTG AGATAAGCTT AGATGTGGAG TCGGCCCATG CCATGGCACC AGGTGCAAAT ATTGTGCTTT ACATAGCTAA TGGGAATTTG CCCCTTTCCT CCGTCATTGC CGAGATAGTG TCCCAGGACA AAGTGGATAC GCTTTCCCAG AGCTTTTCGA TACCTGACGA GTTCATACCT GAGTTTTCGG GTTCGCTGTT TTATCAGTGC GTCGTTCTCA CTGATCAATA CTATGCCATG GGTAATGCCG AGGGAATAAC GTTCCTAGCT TCCAGCGGAG ACGGTGGGGG TAGCGGATAC AGTGCAGGTC CCCTAGGTTC CGTGGGCTAT CCAGCGAGCT CGCCCTTTGT CACGGCCATG GGTGGTACCA CAACCTACAT AACCTTTGGA GGATTCTCCT TCAATGTTAC AGCATGGTCC AATTATGGTT TTGTGCCTCC AGGCGTGAAT TACGGAGGTA GTACTGGAGG GATTAGTCAG GTCGAACCAA AGCCCTACTA TCAGTGGGGG CTTACTACCC CGAAGGGTTT TCCCAACGGT AAGGAAATTC CCGATATCTC AGCCAACGCC GATGTTTACC CTGGAATCTA CATTGTGTGT CCAGGGAACG TAACTGCAAT CTCTGGAGGC ACGAGCGAGG CCTCACCTTT AACTGCCGGG CTTCTCACCT TGGTGATGCA GTATACCCAC TCCAAGCTCG GGAACATAAA CCCTGATCTT TACTATCTCG CAAAAGCAGA TTATCAGAAG GCCTTCTACC CTATTACCTT CGGTTATAAC ATACCGTGGG TTGCGTCCTA CGGCTACAAC CTAGTCACGG GATGGGGACA ACTCAACGTT GGAGAGTTCG CCTCCCTAGT GAAGGATGTA CCCAGTTCCC TATCCATCAT GGTTAACGTG TCCAATACCA CTATCCTTCC AGGACAGACC CTAGGTGTAA AGGCTAACGT CACCCTCAAT GGGCAGGCGG TGAATAGCGG TAGCTTTTAC GTTACGTTGG AAACGGTTAG TGGGAACGTC ACTACGGTGA AGCTAAACGA CGAGGGATCA GGTATCTGGT CAGCTAGTCT AGGTGTACCT GAGAACGATA CCGGTATCAC TTTTGTGACT GTGTGGGGAG AGTCCAATGG GACTAGCGGA TACGGAATTG TGGAGACATA TTCCGGTTAC TTCGTTCAGT TCCTATTCCC AGCTCCCTAT CAGGTATGCT GGACTGGCTC GGGCATAAAC GTTGTGGTCA ACGTTACAAG TCCCTCTGGA GGGATTGCCC CCAATACCAC AGTCCTGAAT CTAAACGTGT ATTCGTATAA CGTAACCAAT AACTCCTTTA CCCTCGTCAA TCAAACCTCC CTAACCTTCA ACCCCGTGTT TAATGCGTGG AGTGGTATGA TACAGGGCAA TTTCCCCGCT GGTCCTCTTC TTCTACAGGT GGTTAATGCC TACGGCTACG ACGCAATCTT TAACGGGATA GGCCTCAACT CTTTCTTCAT CCTTTCCCCA ACCATAGCCG AGCCCGGCAC CGTCTATCCA GGTCAGGACA TCATTATCGA GGGAGGGCTG ACTCCACCCA CTAACCTAGT GTCGTCAGCT CCTAGGACGG CAAGCGACCT GATGACCGGT TCCAACGTCA CCGCGGAACT TCTATATAAC GGTCAAGTGA TATCTAAGAC TCAGGTCCTC TACACGGGGA CGACCTATCT AGGTTACCTG AGAGTTCCAG AAAACGCCAG CCCCGGACTC TACACTATCC TCCTCTTCGC GTCCTACGAC TCGTATACCC TGAACGAGAC AATTCCTGGG TTTTACTACG GTCAGATCTA CGTTGGATCC AGGGTTGTGA CGCCACTAAA CTTCTCTAAT ACCTACGTGA TCCAGGGATC TACACTCTAC ATTTACAGCA ATATAACAGC TAATGGGAAG GTTGTCAAGT ATGGAATGTT CTCCGCCACC GTGTTCCCTA CACTCCTTAC CCAAGAGTAC TCGTTGATCT CGACTGTGCT AGAGGTTCCA CTGTGGTATA ACTCCACTAG CGGGTTGTGG ACCGGAAACG TTACCCTGCC CTCAACGTTA TCCCTAGGTA ACTTGACCTA TCTAGGTAAT TCCTACTTTG CTGAGCCCTT TAAGGTTCTA GTCACTGGAG TTTCAGCCTA CGGGGGTGTC ACATCTACCA ACGTCTCCAA TTCGAGAGAA TTCTTCGTGG AACCCTACAC CCTGATCACT AACGATCCGT CATATACAGC TGTCCAGACT TATGATTCAG CGTTCCAGAA CGACACGCTT ACCGTTAATG GGAACCTGGT GAACGATCTC TTCCTGGGGA ATAATACCAT AATTAATAGT CACGTAACAA TTACCCTATC CAACGTTACT GGAACGCTTA TACTGAGGAA CACTCAGGCC ACGCTGGTAA ATGTGTTGGC CAATAAACTA GTCCTGATTA ACAGTACTGT GAGGCTAATC GATTCCCAGG TCCAAGACCT AGTGGCCCTC TCCTCCGTGG TATCACCTAT CCAAACTAGA ATAACCAGCA TAACTCCGGG ACCTCCCATA ATACAGCTGG GAATTGCCCC CTATCAGAAC ATCACTGGAA ACGTGACAAT TTCAATCACG GTACAGGGAC AGGACGTGGA ACAGGTCCTG GTGTATCTAG ACGGACAGTT GCTAGCAAGT TTTCAAGGTA ATGGAACTCA CGAAGTTAAC CTTGACACCC TCAAATACGC TGACGGGACG CACGAGATTA GCGTTACGGC TAATCAGGCT GACGGTCTTA ACTCGACGGT AACCACGAAC GTGGTATTTG AGAACCAGCT ACAATCCGTG TCGCAGAAGG TTTCAAATCT CAACGAGACA CTAACTCAGG GAATTTCGAC GGCTCACAGT ACGGCAAATG TAGGGGAGAT AGTGGCAATA GTTGCCCTAA TTCTGGCGAT CGTGGGTATA GCCCTGGACT TTAGAAGGAG ATAG
|
Protein sequence | MGKTFLAQLT LYLFIFSVVT PALISVANTG TLVHETLSTR LPSFKETNLS GNTPVIASIY IPLRNLNLLY YYAEQVSNPG SPLYHKFLSP SQVKNLFYPS AEFSSVMNYL ASHHVHVMFT AADSVIVVQG TASQLSQVLG IHYVLMSNGT TYYYTAMGTP KVPGIVISSN VSALFFSHPS TLFTQSDVNK LRETVNQPNL TAPIEAYKLT ELRGVYNVSS LISKGVNGTN YTVGILDFYG DPYIQQQLAY FDKIYNVPAP PNFSIIPIGP YDPNLGITQG WAGEISLDVE SAHAMAPGAN IVLYIANGNL PLSSVIAEIV SQDKVDTLSQ SFSIPDEFIP EFSGSLFYQC VVLTDQYYAM GNAEGITFLA SSGDGGGSGY SAGPLGSVGY PASSPFVTAM GGTTTYITFG GFSFNVTAWS NYGFVPPGVN YGGSTGGISQ VEPKPYYQWG LTTPKGFPNG KEIPDISANA DVYPGIYIVC PGNVTAISGG TSEASPLTAG LLTLVMQYTH SKLGNINPDL YYLAKADYQK AFYPITFGYN IPWVASYGYN LVTGWGQLNV GEFASLVKDV PSSLSIMVNV SNTTILPGQT LGVKANVTLN GQAVNSGSFY VTLETVSGNV TTVKLNDEGS GIWSASLGVP ENDTGITFVT VWGESNGTSG YGIVETYSGY FVQFLFPAPY QVCWTGSGIN VVVNVTSPSG GIAPNTTVLN LNVYSYNVTN NSFTLVNQTS LTFNPVFNAW SGMIQGNFPA GPLLLQVVNA YGYDAIFNGI GLNSFFILSP TIAEPGTVYP GQDIIIEGGL TPPTNLVSSA PRTASDLMTG SNVTAELLYN GQVISKTQVL YTGTTYLGYL RVPENASPGL YTILLFASYD SYTLNETIPG FYYGQIYVGS RVVTPLNFSN TYVIQGSTLY IYSNITANGK VVKYGMFSAT VFPTLLTQEY SLISTVLEVP LWYNSTSGLW TGNVTLPSTL SLGNLTYLGN SYFAEPFKVL VTGVSAYGGV TSTNVSNSRE FFVEPYTLIT NDPSYTAVQT YDSAFQNDTL TVNGNLVNDL FLGNNTIINS HVTITLSNVT GTLILRNTQA TLVNVLANKL VLINSTVRLI DSQVQDLVAL SSVVSPIQTR ITSITPGPPI IQLGIAPYQN ITGNVTISIT VQGQDVEQVL VYLDGQLLAS FQGNGTHEVN LDTLKYADGT HEISVTANQA DGLNSTVTTN VVFENQLQSV SQKVSNLNET LTQGISTAHS TANVGEIVAI VALILAIVGI ALDFRRR
|
| |