Gene Mmar10_0282 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0282 
Symbol 
ID4284618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp334060 
End bp335220 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content65% 
IMG OID638139745 
Productpeptidase M19, renal dipeptidase 
Protein accessionYP_755513 
Protein GI114568833 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00197146 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.781669 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATCT TCCTTGGCAT CTTGCTGGCG GTCATCCTGG TGGCCGGCGG CTTTGTCTTC 
TTCATTCTCC CGGCGCGGAT TGATGCCGAC ATGAATGTGG TCCGGCCGCA CGCGGCCTAT
ACGCCGAGCG CCGAGGCCGA GGCGCTGCAC GCGACCCTGC CGGTCGCCGA CCTTCATTCC
GACATGCTGT TGTGGATGCG TGACCCGACT CGCTGGAATG ACCGCGGTCA TACCGACCTG
CCGCGCCTGC GCGCCGGCGG TGTGGCGCTG CAGGTCTTTG CCAGCGTCAC CAAGACCCCG
TCCGGCCAGA ACTATGACAG CAACACGGCC GACAGTGACG ACATCACCGC CCTTGCCATT
GTCCAGCGCT GGCCGATCCG GACCTGGAGT TCGATCCTGG AACGCGCGGT CTATCATGCC
GACCGGCTGA ACCGTCTTGC CGAGCGGGAT GACAGTTTCA CGGTCGTCAG GACGCGCGCC
GACCTGGAAG CGGTGCTGGC TGCCCGGGAG ACCGACCCGA CGGCGCTGGC CGGCCTGCTG
GAGACCGAAG GCGCCCATCC GCTCGAGGGT GATATCGCCA ATATCGATGT GCTCTGGGAT
GCCGGCTATC GCATGTTCGG CCTGCAGCAT TTCTTCGACA ATGAGCTCGG CGGCTCGCTG
CACGGGGTCT CGAATGCCGG GCTGACGGAT TTCGGCCGCG ACGTCATCCG CCGCATGGAT
GAGCGCGGCC TGATCATCGA TGTCGCCCAC TCCTCGCCGC AAGTCGTCGA GGAGGTGCTG
GCCATGACGA ATAGCCCGCT CGTGGTCTCC CATACCGGCG TGCACGGCCA TTGCGAGGTC
AAGCGCAATA TCCCTGACGC CCTGATGCAG CGCATCGCCG CCGGCGGCGG GCTGATCGGC
ATCGGTTTCT GGGCCGATGT GACCTGCGAC GACAGCCCCG AAGGCGTCGC CGCCACCCTG
CTGGCCGCCA TCGACCTGGT CGGCATCGAC CATGTCGCCC TCGGCTCGGA CTATGACGGC
ACCGTCACCA CCACTTTCGA CGCCTCGGAA TATGTCGTCC TGACTGACCG CTTGCTGGAT
GCCGGCCTGA GCGCCGATGA GGTTGGGCAG GTGATGGGTG GCAATACGAT CCGTTTCTTC
CTGGAGAATC TGCCGCAATA G
 
Protein sequence
MRIFLGILLA VILVAGGFVF FILPARIDAD MNVVRPHAAY TPSAEAEALH ATLPVADLHS 
DMLLWMRDPT RWNDRGHTDL PRLRAGGVAL QVFASVTKTP SGQNYDSNTA DSDDITALAI
VQRWPIRTWS SILERAVYHA DRLNRLAERD DSFTVVRTRA DLEAVLAARE TDPTALAGLL
ETEGAHPLEG DIANIDVLWD AGYRMFGLQH FFDNELGGSL HGVSNAGLTD FGRDVIRRMD
ERGLIIDVAH SSPQVVEEVL AMTNSPLVVS HTGVHGHCEV KRNIPDALMQ RIAAGGGLIG
IGFWADVTCD DSPEGVAATL LAAIDLVGID HVALGSDYDG TVTTTFDASE YVVLTDRLLD
AGLSADEVGQ VMGGNTIRFF LENLPQ