Gene Mmar10_1604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1604 
Symbol 
ID4283926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1757227 
End bp1758435 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content66% 
IMG OID638141091 
Productimidazolonepropionase 
Protein accessionYP_756834 
Protein GI114570154 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.413765 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATTCG ATCGCCTGCT GACCGAAGCC CGACTGGCGA CCATGGTCGC TGGAGATGAC 
GGCTATGGCG TCATCGAAAA GGCGGCGCTG GGTATCAAGG ACGGGCGGAT CGCCTGGATC
GGCCCGATGT CCGAGATCCC GGGCGAGGCC CGGGAGACCG AACGCCTGGC CAGCCGTTGG
GTGACACCCG CGCTGATCGA CTGTCACACC CACCTCGTCT TTGCCGGTGA CCGCTCGGAC
GAGTTCGAGC GTCGTCTGGG CGGGGAGAGT TATGAAAGCA TTTCGCGCTC CGGCGGCGGC
ATCGCGCGGT CTGTCGAGGC TACCCGTGCG GCGGGCGCGG CCGAGCTGGC GGCGGGGGCG
CTGACTCGGA TTGATGCATT GGCACAGGAA GGTGTCGGCA CGGTCGAGAT CAAGTCCGGC
TACGGGCTGA CGCGTGAGAG CGAGCGCACC ATGTTGCGGG CCGCGCGCGG CGTTGAACGG
GCCTCGGGCA TGCGGGTCTC GGCGACCCTG CTCGCAGCCC ATGCCGTGCC GCCGGAGTAC
AAGGGTGAGA GTGGTCGCTA TATCGACGAG ATTTGCATCC CGCTGATCCG CGAAGCCGCC
CGCGAAGGTC TCGCCGATGC TGTGGACGCC TATTGCGAAG GCATCGGCTT CTCACCTGAA
GAAACCCGAC GCCTCTTCAT TGCCGCCAAG GCTGCCGGCT TGCCAGTCAA GCTCCATGCC
GACCAATTGT CTGATACTGG CGGCGCCAGG CTGGTTGCCG AATTCGGTGG CCTGTCGGCC
GACCATATCG AATACACGAA TGCTGAGGGC ATTGCGGCGA TGGCCAAGGC CGGCACGGTC
GGTGTGTTGC TGCCGGGCGC CTTCTACGCC TTGAACGAGA CGAAAAAACC GCCCGTCGAG
GCAATGCGGG CCGCGGGCGT CGACATGGCG GTAGCGACCG ACGCCAATCC CGGCACCTCG
CCGCTGGTAT CCTTGCTGAC GGCGGCCAAC ATGGCCTGCA TCCTGTTCGG TCTGACCTTG
CCGGAAGCCT TTGCCGGGAT GACCCGCAAT GCCGCGCGGG CGCTCGGGCT ACACGGCGAG
ATCGGCACGC TGGAGGTTGG CAAGGCCGCC GACCTCGCCA TCTGGGACGT CGAACGTCCC
GCAGAAATCA TTCAATGGAT CGGACGCCGG CCACTGCATG GCCGCATCCT CGCAGGAGAG
TGGCAGTGA
 
Protein sequence
MQFDRLLTEA RLATMVAGDD GYGVIEKAAL GIKDGRIAWI GPMSEIPGEA RETERLASRW 
VTPALIDCHT HLVFAGDRSD EFERRLGGES YESISRSGGG IARSVEATRA AGAAELAAGA
LTRIDALAQE GVGTVEIKSG YGLTRESERT MLRAARGVER ASGMRVSATL LAAHAVPPEY
KGESGRYIDE ICIPLIREAA REGLADAVDA YCEGIGFSPE ETRRLFIAAK AAGLPVKLHA
DQLSDTGGAR LVAEFGGLSA DHIEYTNAEG IAAMAKAGTV GVLLPGAFYA LNETKKPPVE
AMRAAGVDMA VATDANPGTS PLVSLLTAAN MACILFGLTL PEAFAGMTRN AARALGLHGE
IGTLEVGKAA DLAIWDVERP AEIIQWIGRR PLHGRILAGE WQ