Gene Mmar10_2643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_2643 
Symbol 
ID4285974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp2875424 
End bp2876749 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content63% 
IMG OID638142142 
Productcoproporphyrinogen III oxidase, anaerobic 
Protein accessionYP_757867 
Protein GI114571187 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00538] oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGACG ACGCGCCACG CTTCATTGCC AAATACGCGC TGGAAGCCGT TCCGCGCTAC 
ACTTCCTACC CGCCCGCCAC GAAGTTTCAC GGCAAGGTCG GTCTGGCGGA TTGGCAGCGC
TGGATGACGC AGCAAGACAA AAACCCTGCC CTGTCGATCT ACGTCCACAT CCCGTTCTGC
AAATCCATGT GCTGGTATTG TGGCTGTCAC ACCACAATCC CGAACGCACA TCGACGGGTC
GAGCACTATC TGGCCGCGCT CGACATCGAC ATTCGCAGGC GAGCAGAGAG CGCCCCGCCG
GACGGTCGGG TCTGTCACAT TCATTTTGGT GGCGGCTCTC CCGACATGCT GAAGCCGGAA
GAATTCCGGG CCGTCATGCA GCAGATCAGG GCCAGCTTCA ATGTCGCCCC GGACGCGGAA
ATTGCGGTCG AACTGGACCC TCGGGGGCTG GACTCCGAGC TTTGCCAGGC CATGGCGCAG
ACCGGGGTCA ACCGTGCCAG CCTGGGCGTC CAGGACATCT CCCACGAGGT CCAGAAGCTG
ATCCACAGGA TCCAGCCCCT CGAGATCGTA CAGGCCGCCG CCGACAAATT GCGGGCGGTC
GGCATTTCGG CGATCAATAT GGACATGATG TACGGCTTGC CGGCCCAGTC CGTCGCGCGC
GTTTGCGAGA CGGCGAACGC CATCGCTGGA ATGGGAGCAG ACCGCGTGTC CGTGTTTGGC
TACGCCCATG TACCCTGGTT CAAGAAACAC CAACGCGCCA TCCCCGAAGA GCGTCTGCCC
GGAGCGGCTG AACGGTTCGA CCAAATGCTG GCTGCAAACC ACACCCTGAG CGAAGCCGGC
TATGAGCGGA TCGGTTTCGA CCATTTCGCT CGCCCCGCAG ACCCGCTCGC CCGAGCCGCC
CGTGATGGCT CGCTGCGTCG CAATTTCCAG GGTTATACGA CCGACCCTTC GACCATTCTG
ATCGGACTGG GCGCTTCGGC GATCAGTGAG TGCCCGCAGG GATATGTGCA AACCGAACCC
AACCCCGTGC GCTACGCCAA CGCTGTCACG AGCAATGCCG ACATGATTGT GCGCGGCGTC
GGGCGATCAA TGGCCCAACA GGCCGTCGCC TCGCGTATCG CAAACCTGCT GTGCCAGTTC
GAGACCGAGG TTGAGCCAGG CGACCCTGTC GACCTTGCCA AGGATATGCT TGCTGAAGCC
CTGGTGCGAA TCGACGACGG CCGGATCACG CTGACCGAAG CCGGGCGCCC CTATGTCCGC
AATCTGGCTG CACGCATCGA TCCGGCGTTC CGGAGCACAA CCGAACAGCA CAGCGTCGCG
GTCTGA
 
Protein sequence
MPDDAPRFIA KYALEAVPRY TSYPPATKFH GKVGLADWQR WMTQQDKNPA LSIYVHIPFC 
KSMCWYCGCH TTIPNAHRRV EHYLAALDID IRRRAESAPP DGRVCHIHFG GGSPDMLKPE
EFRAVMQQIR ASFNVAPDAE IAVELDPRGL DSELCQAMAQ TGVNRASLGV QDISHEVQKL
IHRIQPLEIV QAAADKLRAV GISAINMDMM YGLPAQSVAR VCETANAIAG MGADRVSVFG
YAHVPWFKKH QRAIPEERLP GAAERFDQML AANHTLSEAG YERIGFDHFA RPADPLARAA
RDGSLRRNFQ GYTTDPSTIL IGLGASAISE CPQGYVQTEP NPVRYANAVT SNADMIVRGV
GRSMAQQAVA SRIANLLCQF ETEVEPGDPV DLAKDMLAEA LVRIDDGRIT LTEAGRPYVR
NLAARIDPAF RSTTEQHSVA V