Gene Mmar10_0942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0942 
Symbol 
ID4284867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1037524 
End bp1039044 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content66% 
IMG OID638140410 
Productprotease Do 
Protein accessionYP_756173 
Protein GI114569493 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0681927 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.773505 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTCGG TTGATCGCAT ACGCCGCCTC GGCGGAGTTT CGCTTCTGGT CCTGTCCGCG 
CTGGCAGCAG GAAGTCTCCT GGATCGCTCG ATGGGCGAGG CCTACGCCGT CCAGTCGTCC
GAGGCGCCGC CCCTGGCGGC TGCCGTCCCG GCCGGCGCAC CGCTGTCCTT TGCCGACCTG
ATCGAAACCG TCAGCCCGTC CGTGGTCACG GTTCAGGTCA GCGGACTGGT CGAGAGCTCG
CCCTTTGGCG GCGGCAATGG ACCCGACCTC GACAATCTCC CCCCGCAAAT GCGCGAATGG
ATGGAGCGCC AGTTCGGCGG CCAGCGCCAG GCGCCGCAGC CGCAACCGCG CCAGTCGCTG
GGATCCGGCT TCTTCATCTC GGCTGACGGA TATCTGGTGA CCAATCACCA TGTGGTGGCC
AATGCCGACG AGATCACCAT CGGAACGGCC GAGGGCGAGG AGTTTCCTGC CCGCGTCATT
GGTACCGATC CGCAGACCGA CCTGGCGCTG CTCAAGGTCG ATGGCGAGAC TGATTTCCCG
TTTGTGCGGC TGGAAGAGAA CCCGAACTAC CGGGTTGGCG ACTGGGTCGT CGCGGTCGGC
AATCCCTTCG GTCTCGGCGG TACGGCAACA GCCGGTATCA TCTCGGCCAT CGGTCGTCCG
ATCGGCAATT CCACCTATAA TGACTTCATC CAGACCGACG CCTCGATCAA TCGCGGCAAT
TCCGGCGGCC CGACCTTTGA CCTCAACGGC AATGTGATCG GCGTGAACTC GCAAATCTTC
TCGCCGTCTG GCGGCAATGT CGGCATCGGC TTTGCCATTC CCTCCGACGT CGCGGCCCGC
ATCGTCGGCG ATCTGCGCGA TGATGGCCGG GTGGCGCGCG GCTGGCTGGG TGTCTCGATC
CAGAATGTCA CCGAGGACAT TGCCGAAGCG CTGGGCCTTG AGGGCACGAC CGGCGCCATC
ATCAGCTCGA TCGTCGAGGG CGGCCCCGCC GACCGCGCCG GTTTCGAGCG CGAGGATGTG
GTGCTGGAAA TCGATGGCGA GGCCGTTGAC GGTTCGCGCG ACCTGACCCG CCGCGTCGGC
AATATCCAGG CCGGCGGCGA TGTCCGCTTC CTGGTGCTGC GTGACGGCCG CGAGCGGACC
ATCCGTGCCA CGCTGGGTGA TCGCCCGGGC GAGGAACAAC TGGCCAGCAT GAGCAGTGTT
GATGCGGCTC CGGCACGGAC TTCCGTGTTC GGCATGTCGA TGGCGCCGCT CGCTGAGGAG
GACCGTGAGG TCCGCGGCCT TGGCGCTGAA GTCAGCGGCG TGGTGGTCGA CGAGATCGAG
CCGGGCAGCG AGGCCGCCCG CAAGGGTGTC CAGGTCGGCG ACATCATCCT GGAAGCGGGT
GGCAATTCTG TTGCCGACGC CGAGGCCCTC CGTGCCATCG CCGATGAAGC CCGTGAAGAC
GGTCGCAGCG CCATCCTGCT GCTGGTCGAG GGACGTGGCG GTCAGCGCTA TGTCGCCCTG
CAACTCGGCT CTGCCGACTA G
 
Protein sequence
MISVDRIRRL GGVSLLVLSA LAAGSLLDRS MGEAYAVQSS EAPPLAAAVP AGAPLSFADL 
IETVSPSVVT VQVSGLVESS PFGGGNGPDL DNLPPQMREW MERQFGGQRQ APQPQPRQSL
GSGFFISADG YLVTNHHVVA NADEITIGTA EGEEFPARVI GTDPQTDLAL LKVDGETDFP
FVRLEENPNY RVGDWVVAVG NPFGLGGTAT AGIISAIGRP IGNSTYNDFI QTDASINRGN
SGGPTFDLNG NVIGVNSQIF SPSGGNVGIG FAIPSDVAAR IVGDLRDDGR VARGWLGVSI
QNVTEDIAEA LGLEGTTGAI ISSIVEGGPA DRAGFEREDV VLEIDGEAVD GSRDLTRRVG
NIQAGGDVRF LVLRDGRERT IRATLGDRPG EEQLASMSSV DAAPARTSVF GMSMAPLAEE
DREVRGLGAE VSGVVVDEIE PGSEAARKGV QVGDIILEAG GNSVADAEAL RAIADEARED
GRSAILLLVE GRGGQRYVAL QLGSAD