Gene Mmar10_1122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1122 
Symbol 
ID4285140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1228832 
End bp1229851 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content65% 
IMG OID638140600 
Productproline iminopeptidase 
Protein accessionYP_756353 
Protein GI114569673 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.29642 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0858208 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCT TTCACCCGGC CTGCGCGCCG CACCAGAGCG GCCATCTGGA CGTTGGTGAC 
GGGCATGCGA TCTATTGGGA AAGCGCCGGG AACCCGGACG GCATCCCGCT GCTGGTGCTG
CATGGCGGCC CGGGTTCGGG GATATCCGAC AAGTTCCGGA AGTTGTTCGA TCCGCAGCGA
TTCCGCATCA TCCTGCTCGA CCAGCGCGGC GCCGGCCGCT CGACGCCGCA CCTGTCGCTG
CAGGCTAATA CGACCGCCCA TCTGGTCGCT GACCTGGAAG CCCTCCGCGG GCATCTGGCC
ATCAAGCGCT GGATGGTGTT CGGCCCGTCC TGGGGCTCGA CGCTGGCCCT GGCCTATGCC
CAGACGCATC CACACGTCGT CAGTGGCCTC ATCGTCGGCG CCATCTTCAC GGCCCGCGCT
TTCGAGCTGG ACTGGTGGCA CAGCCCCGAC GGCGCGCCGA CCATCTTTCC GGACGCCTTC
GCAACCTTCA TCGCTCCGGT ACCGCAGGCA GAGCGGACCT CACCCGAAAC GATCATGCGC
TGGTATCTTG CGGAGATGCA GGACGAGATC GCGCGCGGGC TTCCGGATCT GACTGAGCTG
GCCGACATCT CGACCCCGCT CGATACGCTG CGCCGCTCTG CGGTCTATCG GTGGACTGAG
TATGAGGACC GCCTCTCCTA TCTCGACAAT CCGCCAGAAG CTGTGCGCGC GGGACTGGCG
GCTCGCGGTG CCGGCTTTAT TGCCGCGCAT TCGCTGATCG AGGTCCATTA TTTCAGCCAG
GGATGCTTCC TCGAGGAGGG TGAATTGCTG GCCAAGGCCG ACCGCCTGGC AGACATTCCG
ATGGGGATCC TGCACGCCCG CTATGACATG GTGTGCCCCG CCCGCACCGC CTTCGATCTC
GCCGCAGCCT GCCCGCATGC CGATTTCCGG CTGGTCGCCG TGGGCGGTCA TGGCATGACC
GATGCCAGCC AGGCTGAGCT GAATGTCCTT GTCGACGACG TGGTCTCCCG TATCACCTGA
 
Protein sequence
MSAFHPACAP HQSGHLDVGD GHAIYWESAG NPDGIPLLVL HGGPGSGISD KFRKLFDPQR 
FRIILLDQRG AGRSTPHLSL QANTTAHLVA DLEALRGHLA IKRWMVFGPS WGSTLALAYA
QTHPHVVSGL IVGAIFTARA FELDWWHSPD GAPTIFPDAF ATFIAPVPQA ERTSPETIMR
WYLAEMQDEI ARGLPDLTEL ADISTPLDTL RRSAVYRWTE YEDRLSYLDN PPEAVRAGLA
ARGAGFIAAH SLIEVHYFSQ GCFLEEGELL AKADRLADIP MGILHARYDM VCPARTAFDL
AAACPHADFR LVAVGGHGMT DASQAELNVL VDDVVSRIT