Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_0942 |
Symbol | |
ID | 4284867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 1037524 |
End bp | 1039044 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638140410 |
Product | protease Do |
Protein accession | YP_756173 |
Protein GI | 114569493 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0681927 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.773505 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTTCGG TTGATCGCAT ACGCCGCCTC GGCGGAGTTT CGCTTCTGGT CCTGTCCGCG CTGGCAGCAG GAAGTCTCCT GGATCGCTCG ATGGGCGAGG CCTACGCCGT CCAGTCGTCC GAGGCGCCGC CCCTGGCGGC TGCCGTCCCG GCCGGCGCAC CGCTGTCCTT TGCCGACCTG ATCGAAACCG TCAGCCCGTC CGTGGTCACG GTTCAGGTCA GCGGACTGGT CGAGAGCTCG CCCTTTGGCG GCGGCAATGG ACCCGACCTC GACAATCTCC CCCCGCAAAT GCGCGAATGG ATGGAGCGCC AGTTCGGCGG CCAGCGCCAG GCGCCGCAGC CGCAACCGCG CCAGTCGCTG GGATCCGGCT TCTTCATCTC GGCTGACGGA TATCTGGTGA CCAATCACCA TGTGGTGGCC AATGCCGACG AGATCACCAT CGGAACGGCC GAGGGCGAGG AGTTTCCTGC CCGCGTCATT GGTACCGATC CGCAGACCGA CCTGGCGCTG CTCAAGGTCG ATGGCGAGAC TGATTTCCCG TTTGTGCGGC TGGAAGAGAA CCCGAACTAC CGGGTTGGCG ACTGGGTCGT CGCGGTCGGC AATCCCTTCG GTCTCGGCGG TACGGCAACA GCCGGTATCA TCTCGGCCAT CGGTCGTCCG ATCGGCAATT CCACCTATAA TGACTTCATC CAGACCGACG CCTCGATCAA TCGCGGCAAT TCCGGCGGCC CGACCTTTGA CCTCAACGGC AATGTGATCG GCGTGAACTC GCAAATCTTC TCGCCGTCTG GCGGCAATGT CGGCATCGGC TTTGCCATTC CCTCCGACGT CGCGGCCCGC ATCGTCGGCG ATCTGCGCGA TGATGGCCGG GTGGCGCGCG GCTGGCTGGG TGTCTCGATC CAGAATGTCA CCGAGGACAT TGCCGAAGCG CTGGGCCTTG AGGGCACGAC CGGCGCCATC ATCAGCTCGA TCGTCGAGGG CGGCCCCGCC GACCGCGCCG GTTTCGAGCG CGAGGATGTG GTGCTGGAAA TCGATGGCGA GGCCGTTGAC GGTTCGCGCG ACCTGACCCG CCGCGTCGGC AATATCCAGG CCGGCGGCGA TGTCCGCTTC CTGGTGCTGC GTGACGGCCG CGAGCGGACC ATCCGTGCCA CGCTGGGTGA TCGCCCGGGC GAGGAACAAC TGGCCAGCAT GAGCAGTGTT GATGCGGCTC CGGCACGGAC TTCCGTGTTC GGCATGTCGA TGGCGCCGCT CGCTGAGGAG GACCGTGAGG TCCGCGGCCT TGGCGCTGAA GTCAGCGGCG TGGTGGTCGA CGAGATCGAG CCGGGCAGCG AGGCCGCCCG CAAGGGTGTC CAGGTCGGCG ACATCATCCT GGAAGCGGGT GGCAATTCTG TTGCCGACGC CGAGGCCCTC CGTGCCATCG CCGATGAAGC CCGTGAAGAC GGTCGCAGCG CCATCCTGCT GCTGGTCGAG GGACGTGGCG GTCAGCGCTA TGTCGCCCTG CAACTCGGCT CTGCCGACTA G
|
Protein sequence | MISVDRIRRL GGVSLLVLSA LAAGSLLDRS MGEAYAVQSS EAPPLAAAVP AGAPLSFADL IETVSPSVVT VQVSGLVESS PFGGGNGPDL DNLPPQMREW MERQFGGQRQ APQPQPRQSL GSGFFISADG YLVTNHHVVA NADEITIGTA EGEEFPARVI GTDPQTDLAL LKVDGETDFP FVRLEENPNY RVGDWVVAVG NPFGLGGTAT AGIISAIGRP IGNSTYNDFI QTDASINRGN SGGPTFDLNG NVIGVNSQIF SPSGGNVGIG FAIPSDVAAR IVGDLRDDGR VARGWLGVSI QNVTEDIAEA LGLEGTTGAI ISSIVEGGPA DRAGFEREDV VLEIDGEAVD GSRDLTRRVG NIQAGGDVRF LVLRDGRERT IRATLGDRPG EEQLASMSSV DAAPARTSVF GMSMAPLAEE DREVRGLGAE VSGVVVDEIE PGSEAARKGV QVGDIILEAG GNSVADAEAL RAIADEARED GRSAILLLVE GRGGQRYVAL QLGSAD
|
| |