Gene Mmar10_0018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0018 
Symbol 
ID4283972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp21124 
End bp23328 
Gene Length2205 bp 
Protein Length734 aa 
Translation table11 
GC content62% 
IMG OID638139478 
Productprolyl oligopeptidase 
Protein accessionYP_755252 
Protein GI114568572 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTTT ATCAGACGTT TTGGGCGATC TCTGCCAGCG CCGCCATCCT CGCGGCCTGC 
ACACAGCCTG CCAGCAATGA GGGCGACCAA TTGGCCGACA CGACACCTGA GGCGCCCGCG
GAACGCCAGA TCGTCGAAAT CGATCCCAAT AATGATCCGC GGGTCTGGCT GGAAGAGGTC
GAGGGCGAGC AGGCCATTGA ATGGGTCGAG GGTCAGAACG AGCGCACATT CGCCCGTCTG
CAAGGCGATG AGCGCTATCA GGGCCTGTAC GATCAGGCGC TGGCCATTGC CCAGTCCGAG
GACCGCATTC CCTACGGCTC CTATTCCGGC GGCTATATCT GGAATTTCTG GCAGGACGCC
GAACACACGC ACGGCCTGTG GCGCCGGACC TCGCTGGAAA GCTATCTGAC CGACGCGCCG
GAATGGGACG TGGTGCTCGA CCTCGATGCC CTGTCCGAAG CTGAAGACCG CAACTGGGTC
TGGCGCGGCT CAAATTGTCT GGCGCCGGCC TATGAGCGCT GCATTCTGAC ACTGTCGGAT
GGCGGCTCCG ACGCGGCGGT CCGCCGGGAG TTTTCCATCA CCGACCGAGC CTTTGTCGAC
GGTGGTTTCG AGACGCCGGA AGCCAAGGGC GGGGTCAGCT GGATCGACGA GAATACGCTG
ATGGTCGGCC TGGCGACCTC GCCGGAAGAC TCCACCAGCT CGGGCTATCC CTCGGTTGCC
TACCGCTGGG AACGCGGCAC CGATCTCGCC GATGCCACTG AAGTGGTGCG CGGCGACCAG
GACGATGTGG GTCTCTTTGC CTTTCGCGCC GAGGACCATG ACGGCACCGT CTACATGATG
GCGTCCGAAG CCAATACTTT CTACGACACG AGCTGGTGGT ACCTGCCGGC CGACGCCGGT
CCGGTGCAGC TGCCACTGCC CAGCAAGTCG TCGATCCAGG ATCTCTACCA GGGCGAGCTG
GTCTTCACGA TCGAGGAAAA CTGGACACCT GTGGAGGGCG GAGAAACCTT CCCGCAGGGC
GCGCTCCTGT CCTTCAACAT GGCTGAATTT GCCGCCACCG GTGAGCTGCC GGACGTCCGC
ACGGTCTTCG TGCCGGGCCC GCGCCAATCG CTGGGTGGCA TGGGTTCGAC GGCCTCGGCC
TTCCTGGTCG CGATCGACGA GAATGTCGTC GGCGGTCTCG AAGCCTTCCA CTTTGCCGAC
GGACAATGGT CGTCGGAGAC CGTTCCGGTT CCCGCCAACA TGACGATCAG CCTGCGCGGC
ACTGACAATC ATCACGATGT TGCCTTCATG AATGCCGAAG GCTTTTTGAC GCCGGACAGC
TATTTCATGG TCGATGCGGC CGAGTTGACG GTCGAGGAAA TCAAGTCCAT CCCGGCCCGC
TTTGACGCCG AAGGCCTGGT GGTCGAACAA CTCGAAGCCG CCTCTCCGGA CGGCACGATG
GTGCCCTATT TCGTGGTCCG CCGCGAAGAC ACGGTTATGG ACGGCACAAC GCCGACCCTG
CTTTACGCCT ATGGCGGATT CCAGGTCTCC ATCCGCCCGA GCTATTCCGG TTCACGCGGC
CAGCTGTGGC TGGAAAATGG CGGCGCCTAC GTGGTCGCCA ATATCCGCGG TGGGGGCGAG
TTCGGACCGG CCTGGCACCA GGCGGGCCTG AAAATGGATC GCCAGCGCAT CTATGACGAT
CTGATTGCGG TCGCCGAGGA CCTGTCGACC CGTGGCATCA CCAGCCCGCG TCATCTCGGC
GTCTATGGCG GTTCCAATGG CGGTCTGCTG ACCGGTGTCA TGTACACCCA GCGTCCGGAC
CTGTGGAACG CCGTTGTCTC GGCCGTGCCG CTGCTGGACA TGCTGCGTTA TCACACGCTG
CTGGCCGGCG CGTCCTGGAT GGGTGAATAT GGTAATCCGG AAGATCCGGA TGAGGGCGGT
TTCCTGCGCT CGATCTCGCC CTATCACAAT GTCGATGCGA ACGGCGATTA TCCGGAAATC
TACCTCTACA CCTCGACCAA GGATGACCGC GTCCATCCGG GCCATGCCCG CAAGATGGCG
CATCTTCTGG AAGAGCTGGG CCATGACTAT CTCTACTACG AGAACATGGC AGGCGGTCAC
GCCGCCGCCG CCAATCTCGA GGAGCGGGCC CGCAGTGAAG CACTGCTCTA CACCTTCCTG
ATGCAAAAGC TGATGGATGA TACCGATCCG CTCGACGCCG AATAG
 
Protein sequence
MKFYQTFWAI SASAAILAAC TQPASNEGDQ LADTTPEAPA ERQIVEIDPN NDPRVWLEEV 
EGEQAIEWVE GQNERTFARL QGDERYQGLY DQALAIAQSE DRIPYGSYSG GYIWNFWQDA
EHTHGLWRRT SLESYLTDAP EWDVVLDLDA LSEAEDRNWV WRGSNCLAPA YERCILTLSD
GGSDAAVRRE FSITDRAFVD GGFETPEAKG GVSWIDENTL MVGLATSPED STSSGYPSVA
YRWERGTDLA DATEVVRGDQ DDVGLFAFRA EDHDGTVYMM ASEANTFYDT SWWYLPADAG
PVQLPLPSKS SIQDLYQGEL VFTIEENWTP VEGGETFPQG ALLSFNMAEF AATGELPDVR
TVFVPGPRQS LGGMGSTASA FLVAIDENVV GGLEAFHFAD GQWSSETVPV PANMTISLRG
TDNHHDVAFM NAEGFLTPDS YFMVDAAELT VEEIKSIPAR FDAEGLVVEQ LEAASPDGTM
VPYFVVRRED TVMDGTTPTL LYAYGGFQVS IRPSYSGSRG QLWLENGGAY VVANIRGGGE
FGPAWHQAGL KMDRQRIYDD LIAVAEDLST RGITSPRHLG VYGGSNGGLL TGVMYTQRPD
LWNAVVSAVP LLDMLRYHTL LAGASWMGEY GNPEDPDEGG FLRSISPYHN VDANGDYPEI
YLYTSTKDDR VHPGHARKMA HLLEELGHDY LYYENMAGGH AAAANLEERA RSEALLYTFL
MQKLMDDTDP LDAE