Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_2302 |
Symbol | |
ID | 4285727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | - |
Start bp | 2508273 |
End bp | 2511152 |
Gene Length | 2880 bp |
Protein Length | 959 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638141804 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_757532 |
Protein GI | 114570852 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.676415 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCAAT ATTTCAAATC CGGAGCTGCA GCGATTGCAC TCATGTCCGT GACCGGGGCC ATGACTTCTG CATACGCGCA AAATTCCGAC GAGAACGCGG GTGACGGGTA CAATTTCCTT CACCCGTATC GCGGAGACAT CAATCCGTTT TTCGGTGATA TCAATCCCTT CCGCGGCGAC ATCAACCCGT TCCGCGGCGA CATCAATCCA TTCTACGGCG ACATCTCGCC CTTCTGGGGT GACATCAATC CCTTCTGGGG TGACATCAAT CCCTTTGGCG GGGACATCAA TCCCTTCTTC AGTGACGACA TCAATCCGTT CTGGGGCGAC ATCAATCCGT TCGGCGACGA CATCAATCCC TTCTGGAGCG ACATCAACCC GTTCTGGCGG GATGTCGGCC CGGTCTGGGG TGATCTCAAT ACAGCGTGGA ATGAGGCGGC GGCCGGCGGG GGCGACTTCA CGATGATCGC CAATGACATG GCGTCAGTCA TTGCACGCGC CGAAAGCGTG TTTGGCGGCG CCATCCAAGC GCGCACCGGC CTGAGCATGG ACGAGGCCTT CCTTGCCGCC CTGCTGGCCC GCTTCGGTAT TGATCTGGAC AACCCGGACA GCCTGGCCTC GGTCAGTGCC GGCCAGCGGT CGGAATTCTT CCTGACCTTC TATGACAGCC TGATGGGCTT TTCCGGCGTC GATCATGTCG ATCACTGGAT GCCGGCCATC AATTGGTCGC CGGCCCTGGC GGCCAGCTAT GACTGGGGCC GCTCGCCGCT GGTCGGCGTG CTCGATTTCA GCGTCAACAC GGTCGAGGGC AGCAGCTTCC GCGGACAGCG CGGAGAGCGC GATTATCTCA ACGTCAATCA CGGCAATGCT GTCGCCAGCC TGATCGGCGC GCCGGTAGAT GGTGTCGGTG TCATGGGGGT CGCCCCGCAT GCGGCGATGC GTTTCTACAA TCCGTTTGAT GCATCCCATA CGGCCAGCTG GGACGATGTT GCCGCCGGGG TGGAATCCCT GGCCAATGGC GGCACCTCTG TCATCAACAT GTCGCTGGGC GTCCCCGGCT GGACCCTGCA CCAGGACTGG GCCGAGGTCT TCCGCCAGGA CCGGGTGGCG CGCCATGCCG ATGATCTGAC CTTCGTGGTC GCCGCCGGTA ATGACGGTGT CACGCAGACG GTTGATCTGG ACTGGACCGG TGTCTCGGTG CTCGACAATC TGCTGCTCGT TGGATCGGTC GACCCGAACG GCAATATCTC GTCCTTCTCC AATACTCCGG GCGAAGCCTG CTTGCTGACC AATGGCGTTT GCGAGACCGG TGCCCGCCTG ATGGACCGTT TCCTGGTTGC ACCGGGTGAG CTCATCCTCG TCGATGATGG CGAAGGCGGT GTGACCCGTG TGTCGGGGAC CTCCTTTGCG GCGCCGCTGG TGTCCGGTGC GGCGGCCCTG GTCAAGGGTT GGTGGTTTTG GCTCGATGGC AGTGAAGTGG CGGATGTCCT GTTGTTGTCG GCCCGTGATC TCGGCGAGCC GGGTGTGGAC GCGGTCTATG GCCACGGCAT GCTCGATGTG GCCGGTGCCA TGTCACCGCT GGATCCGGCC AATCTCTATG GCCTGGACAA ACGCAATGAT CCGGTCGAAG CGGCGGAATT GATCATCACG GGTGGTCGCC TGTCCCTGCG CCATTCCAAC AAGCATTACG TCACGGTGTT TGAGGATGTC GGCAACAGCT TCCGCGACTT CACGATCGCG GTGGACGATC TGATAGTCGG GAGTTCCCTT TCGGAATCGG TCGCCAATGC CTATGCCGAG CAATATATCT ACGAGCGGAC CAGCGCTTCG CTGACGGGCT CACACTTCTC GGATGTCTCC AGCGCGTCGC AAATCCTGTC CCAGCGCGGC AATCTGATGG TGACGGCTAC GGCCTCCACC CTGGATCCGT CCAATGTCGG CTATGCCCGT GATCTTGGCT TCCATGCCGG TGTCGAGCTG ACCGATACGG AAAGCGGCCG GTCCATGAAA TTCGGTGTCG GTGAGGGTGC CTTGGCCTTG TCCAGCCAGA CCGGTTTCAA TCTCTTCTCG GACCATCGTC CGGAAAGCGG TGGGGTGAAC CCGGTTCTCG GCTTCGCATC CGGCGGTGCC TATGCAGCCA GCACGTTCAA TATGGACGGC GATCTGCAGG TCTCCTTCGG TATCACGACG ACCCATGAAC AGGCGATCTA TGTGATGCCG GGTACTGGTG AGGAACAGGC CCTGTTTGAC GGTGTTGCGC CTTACCAGGC CTTCGCGGCC AATCTCGACC TCAGCTATCC GCTGGGTGAC CAGGTCACGA TCAATGGTTC ACTGACCCAG CTGCACGAGG CGACCGGTCT GCTTGGTGCC CAGGGTGGCA GTATTCTGGC CCTCGAGGGC GGTGCGGACA CGACCGCGCT GACCGTTGGT CTCGACGCCC GGCCATCCTC GCGCATCTCT CTGAGCGCCT CGATGACCAT GGCCCAGACA CGGACCACCG CCTTTGATGG TGGCCTGCTG GACATAGCCG AGCGTATCGA CAGTACGGCG GCACAGGTCT CGGTCCGTTA CGAGGCCCTG TTCTCGAACA ATGACGGTGT CCGCTTCAGC ATGGTGCAGC CGCTCCATAT CGAAAGCGGA GCCCTGAGCT ATACCGGCAT GGCGGTCACC GATCGCGAGA CCGGTGAGCT GGGTGTCCAG AGCGACACCT GGGAGCTGGG TGGCCGACGC CCGGTCTATG CCGAGGTCAT CTATGCAACC GAGCTGGGGT CATCGAACCG CCGCCTGAGC GTCTTCAGTC GCCAGCAATT GTCCGGCGAC GAGCAGGTTT CGGAGTTCTC GGCCGCGACC AGCGGTATGC GTTTCGAAAT GCGTTTCTGA
|
Protein sequence | MRQYFKSGAA AIALMSVTGA MTSAYAQNSD ENAGDGYNFL HPYRGDINPF FGDINPFRGD INPFRGDINP FYGDISPFWG DINPFWGDIN PFGGDINPFF SDDINPFWGD INPFGDDINP FWSDINPFWR DVGPVWGDLN TAWNEAAAGG GDFTMIANDM ASVIARAESV FGGAIQARTG LSMDEAFLAA LLARFGIDLD NPDSLASVSA GQRSEFFLTF YDSLMGFSGV DHVDHWMPAI NWSPALAASY DWGRSPLVGV LDFSVNTVEG SSFRGQRGER DYLNVNHGNA VASLIGAPVD GVGVMGVAPH AAMRFYNPFD ASHTASWDDV AAGVESLANG GTSVINMSLG VPGWTLHQDW AEVFRQDRVA RHADDLTFVV AAGNDGVTQT VDLDWTGVSV LDNLLLVGSV DPNGNISSFS NTPGEACLLT NGVCETGARL MDRFLVAPGE LILVDDGEGG VTRVSGTSFA APLVSGAAAL VKGWWFWLDG SEVADVLLLS ARDLGEPGVD AVYGHGMLDV AGAMSPLDPA NLYGLDKRND PVEAAELIIT GGRLSLRHSN KHYVTVFEDV GNSFRDFTIA VDDLIVGSSL SESVANAYAE QYIYERTSAS LTGSHFSDVS SASQILSQRG NLMVTATAST LDPSNVGYAR DLGFHAGVEL TDTESGRSMK FGVGEGALAL SSQTGFNLFS DHRPESGGVN PVLGFASGGA YAASTFNMDG DLQVSFGITT THEQAIYVMP GTGEEQALFD GVAPYQAFAA NLDLSYPLGD QVTINGSLTQ LHEATGLLGA QGGSILALEG GADTTALTVG LDARPSSRIS LSASMTMAQT RTTAFDGGLL DIAERIDSTA AQVSVRYEAL FSNNDGVRFS MVQPLHIESG ALSYTGMAVT DRETGELGVQ SDTWELGGRR PVYAEVIYAT ELGSSNRRLS VFSRQQLSGD EQVSEFSAAT SGMRFEMRF
|
| |