Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_0744 |
Symbol | |
ID | 8567382 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | - |
Start bp | 865171 |
End bp | 868449 |
Gene Length | 3279 bp |
Protein Length | 1092 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_003290030 |
Protein GI | 268316311 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0783948 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGTCC AGCCCGGCGT TGTCGTTGTC AAGTTTGAAG CGCCGATCAC GCTGCAGGCC GGGAAAACCG GCCGTCCGAT GCTGGACCGC ACGCTGGCCC GTTTCGAGCC CGTTGTGCTG GAGCCGGCCT TTCCTTTTCT GGAGCAGGCG GCCCGGAAAC GTCCGCATCC CGCGCTGGAC CGTCTGCGCA CCATCTATCT ATTGCGCTAC AACCGTCCGA TCTCGCCCTG GCGCGTGGCG GCCGAGTTGA GCCGCCTGCC CGGCGTCGTG TACGCCGAGC CGCTGCCGAT CCGGCAGATC GTGGAGGTCC CCAACGACTC GCTTTACCCA CAGATGACCC ACCTGCCACG CATTCAGGCG CCGGAAGCCT GGGACGTGGT CAAGGGCGAA CAGGGCGACG TGGTCATCGC CATCGTGGAC GGCGGCACCG ACTGGCGTCA TCCCGATCTG ATCGACAACG TGTGGACCAA CCCGGGTGAG ATCCCTGACA ACGGCATCGA CGACGACGGC AACGGCTTCG TCGACGACGT ACACGGCTGG AATTTTGCCA ACGATACGCC CGATCCTTCC GGACTTTCGG CCACGCCGCT CAACGCGGCG CACGGCACCC AGGTAGCCGG CGTGGCCGCC GCCGTCACGA ACAACAATCG GGGCGTGGCG GGCAGTAGCT GGAACGCCCG CTTTATGCCG ATCAATGCGA GCTGCGCTGA CACGGATCGC AGCATCTGCT ACGGCTATCA GGGAATCGTG TACGCTGCCC TGAACGGCGC GCAGGTGATC AATGCGAGCT GGGGCGGTCC CGGTCTTTCC AGACTGGAAG CCGACGTGGT CGAATTCGCC ACCGATCTGG GCAGCCTCAT CGTGGCGGCC GCCGGCAACG ACAGCGGCGA CAACGACCGC GTGCCGTTCG GTCCGGCCAG CCATCCGCGC GTGCTCTCGG TGGGCGCCAC CAACAAAGAC AACGACGGCA AGGCCAGCTT TTCGAACTAC GGTCGCAGCG TGAACGTCTT CGCGCCGGGC GTCAACCTAA ACAGCACGCT GCCCAACGGC CGCTACACGG GATCGGCCAG CGGCACCTCG TTCGCCAGTC CCCTGACGGC TGGCATCGCC GCCCTGGTGC GGACGCGCTT TCCCGAATAC ACGCCCGATC AGGCCCGCGA ACAGATCCGC CTGACCGCCG ACCCGATCGA CGCCGTCAAT CCGGGCTTTT CCGGACGCCT GGGCCGCGGC CGCATCAACG CCTTCCGCGC CGTCACGGAA ACCGGCTTTC CGGCCATCCG CCTGGTCGAT CTCGATGTGA CCGATAGCGA CGCCGACGGC TACCTCGAAA GCGGCGAGAC CGTCCAGCTC ACCGCCCGCT TCACGAATCA TCTGGCCCCG GCCACCGGGG TGCAGTTTCA GTTGAGCGCC GACGCCGACT ACCTCACCAT CCTGCAGGGC GCAGCTCAGG TGTCGCAGCT CGATCCGGGC GATACCGTGC TGGTGACGTT TTCCTTCAGC ATTGCCTCCG ATGCCCCGCA AAACCGCACG GCGATTTTTC TGGCCGACAT CCAGGCCGAT GGCGGCTACG CCGACCGCGA TCTGTTCCGC CTGGTGATCA ATCCTGAGCA AACCGCCACG CTGGCAACCG GACGCATCCA GACGTCGATC ACCACCACCG GCAACCTCGG CTGGACGGGC TTTGCCGGAG AGTCGAGCGG CGTGGGCTTC GCGCTGGACG GGCACAATTT GCTTTTTGAA GGGGGCCTAT TGGCCGGAAT CTCACCGCAG TTCGTCTCTG ATGCCGTCCG AGGCGAAGAC GGCGAGACCC AGCACCGCGA TTTTCAGCCG GTCGAGGGCA GTAGCCTCGA AGTCATCGCA CCGGGACGCT TTACGGCCCA GCAGGGCACC ATCGAACTGA CCGACCGGGC AGCGCCCTTC CCGCTGCACA TCAACGTACT GCAGGAAACC TACGCCGATA CGGTTCCGGG GCGCCAGCTC TTTGTGATCG TCCACTACAC CATTGAGAAT ACGCGCACCA TAACGCTTTC TCCGCTGTAT GCCGGAATTT TTCTGGACTG GGACCTGAAT CCGGACGCCC AGGACTATGC CCGCTATGAC CCGACGCGTC GCCTGGGCAT CGTGCAGGAT AAAAGCACCA ACCCCGACAC GCTGGCGGCC ATTCGCCTGC TGACGCCGGC CCCCTTCTCC TACCGGGCGA TCGACAATCC GACGGAACTC TACGACGGCT TCACGCAGAG CGAAAAGTGG AGCGCACTCT CGGGCGGCCT GCAGCGGACG CATCTCAGCA ATACCGACGT GTCGCAGCTC ATGGCGGCGG GCCCCTTCCG ACTCGATCCG GGCTGCCGCA TTCCGGTAGC CTTTGCCATT CTGGCGGCCG CCGATGCCGA CACGCTCGTG CAGGCCGCCG ACGAAGCGCA GCGGTTCTGG GACGAGGTCA TCCGGCCTTC CATTCCCAAC GAGCCGCCGG CCTTCGTGTC CGTGCCCGAT ACGCTGGTCG TCCGTGAAGG CGAGGCGCTC AACTGGCAAT TTACGGCCAC CGATCCGGAC GCCTGCGCCT CGCTCAGCTT CCGGGTACTG GAAGGACCGG ACGGGTTCTC GGTGGATCCC TTGACCGGCC AGGTCCGGTT CGTGCCCGGC TTCAATCAGG CCGGCATTTA CACGGTACGT CTGCTGGTCA CAGATGGTCT GGCCACCGAC ACGGCCCGAA CCGTGCTCGT CGTGCAGGAT ACCAACAGCC CGCCAACCTT TGTGGCTGTC CTCACCGACA CGGTGCTCGT GGTAGGGCGG ACGTTCCGCT ATCAATTCCG CGCCGAGGAT CCCGAGGGCG ATCCGCTGAC CTACACGCTG GTTGAAGCGC CGGCCGGCGC CACCATCGAT CCTCAGAGCG GTCAGTTTAC GTTCACCCCG CAGGAAGTCG GCCAGTACAC GGTAGTCGTA GCCGTCAGCG ACGGCACGTT CACGATCGAA ACGCCGCGTA TTCACCTGGA GGTGATTCCG GCCGAGGCCG GCGTGCAGGT CTACCTGCCT TCCGGCGGTG GCAACGTCAT TCAGATCGTG TACGACGTGC CCGATCCCGA ACCCGTGCGC CTGATGATCT ACGACCTGCT GGGGCGGCGG GTGCGCCGGC TGGTGGACGG CGTGCCGGGC ACCGGCCGCC ATACCATCAC CTGGGACGGC CACAGCGATG CGGGGATCGA GGTGGCCTCG GGCCTGTACT TCGTCCGCCT GGAGATCGGC GGCAAAGCGG AGACCCGCCC GCTCGTTTAC GTGCGCTGA
|
Protein sequence | MPVQPGVVVV KFEAPITLQA GKTGRPMLDR TLARFEPVVL EPAFPFLEQA ARKRPHPALD RLRTIYLLRY NRPISPWRVA AELSRLPGVV YAEPLPIRQI VEVPNDSLYP QMTHLPRIQA PEAWDVVKGE QGDVVIAIVD GGTDWRHPDL IDNVWTNPGE IPDNGIDDDG NGFVDDVHGW NFANDTPDPS GLSATPLNAA HGTQVAGVAA AVTNNNRGVA GSSWNARFMP INASCADTDR SICYGYQGIV YAALNGAQVI NASWGGPGLS RLEADVVEFA TDLGSLIVAA AGNDSGDNDR VPFGPASHPR VLSVGATNKD NDGKASFSNY GRSVNVFAPG VNLNSTLPNG RYTGSASGTS FASPLTAGIA ALVRTRFPEY TPDQAREQIR LTADPIDAVN PGFSGRLGRG RINAFRAVTE TGFPAIRLVD LDVTDSDADG YLESGETVQL TARFTNHLAP ATGVQFQLSA DADYLTILQG AAQVSQLDPG DTVLVTFSFS IASDAPQNRT AIFLADIQAD GGYADRDLFR LVINPEQTAT LATGRIQTSI TTTGNLGWTG FAGESSGVGF ALDGHNLLFE GGLLAGISPQ FVSDAVRGED GETQHRDFQP VEGSSLEVIA PGRFTAQQGT IELTDRAAPF PLHINVLQET YADTVPGRQL FVIVHYTIEN TRTITLSPLY AGIFLDWDLN PDAQDYARYD PTRRLGIVQD KSTNPDTLAA IRLLTPAPFS YRAIDNPTEL YDGFTQSEKW SALSGGLQRT HLSNTDVSQL MAAGPFRLDP GCRIPVAFAI LAAADADTLV QAADEAQRFW DEVIRPSIPN EPPAFVSVPD TLVVREGEAL NWQFTATDPD ACASLSFRVL EGPDGFSVDP LTGQVRFVPG FNQAGIYTVR LLVTDGLATD TARTVLVVQD TNSPPTFVAV LTDTVLVVGR TFRYQFRAED PEGDPLTYTL VEAPAGATID PQSGQFTFTP QEVGQYTVVV AVSDGTFTIE TPRIHLEVIP AEAGVQVYLP SGGGNVIQIV YDVPDPEPVR LMIYDLLGRR VRRLVDGVPG TGRHTITWDG HSDAGIEVAS GLYFVRLEIG GKAETRPLVY VR
|
| |