Gene Mpe_A2495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2495 
Symbol 
ID4784891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2659045 
End bp2660136 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content70% 
IMG OID640091065 
Productferrochelatase 
Protein accessionYP_001021685 
Protein GI124267681 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0276] Protoheme ferro-lyase (ferrochelatase) 
TIGRFAM ID[TIGR00109] ferrochelatase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.292462 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCACG CACCCGAACC CGCGCACCGC CACGGCGGCG CCGAACGCTG TGCGATCCTG 
CTGGTCAACC TGGGCACACC GGACGAACCG AGCGCCCCCG CACTGCGCCG CTACCTCGCC
GAGTTCCTGA GCGACCCGCG CGTCGTCGAG ATCCCGCGCG CAGTGTGGCT GCCGATCCTG
CACGGCGTCG TCCTGCGGGT GCGGCCCGCC AAGTCGGCCG CCAAGTACGC CAGCATCTGG
ACAGCCGAGG GCTCGCCGCT GAAGGTCTGG ACCGAAAAGC AGGCCAAGCT GCTCACCGGC
TATCTGGGCG AGCGCGGCCA CCCGGTGCTG GTGCGCGCGG CGATGCGCTA TGGCCAGCCC
TCGGTCGCGA CCCAGCTCGA CGCGCTGAAG GCCGATGGCG CGACACGCAT CCTCGTGCTG
CCGCTGTACC CGCAGTACGC CGCCGCCACC ACCGCCAGCG TGTTCGACGC CGTCTACGCC
TGGGCCGCTC GCACCCGGCG CGTGCCGGAG CTGCGCTTCG TCAACCACTA CCACGACGAC
CCGGGCTACA TCCTTGCGCT CGGTCGCTGC ATCGAAGACC ACTGGATGCG CAACGGCCGC
GCCGAGCGGC TGGTGCTCAG CTTCCACGGC GTGCCCGAAC GAACGCTGCG CCTCGGTGAT
CCGTATCACT GCGAGTGCCA GAAGACAGCG CGGTTGCTGA CCGAGCGGCT GGCACTGAAG
CCCGAGCAGG TGCTGGTGAC CTTCCAGAGC CGCTTCGGCA AGGCCAAGTG GCTGGAGCCC
TACACCGAGC CGACTTTGGT GCAGCTGGCG CAGCAGGGCA TCCGGCGCGT CGACGTGGCC
TGCCCCGGCT TCACCTCGGA CTGCCTCGAA ACGCTGGAAG AGATTGCGCA GGAAGCCCGT
GAGGCCTACC TGCATGCCGG TGGCGAGACG TTCCACTACA TCCCCTGCCT GAACGATCGG
CACGAGTGGA TCGCCGCGCT CAGCGACATC GCGATCCGCC ACCTGCAAGG CTGGCCGACC
CAAACGGCGC CGGATCCGGC GGCCCTGCAG GCCCAGGCTC TTCGGGCCCG CCAACTGGGG
GCCCAGGCAT GA
 
Protein sequence
MKHAPEPAHR HGGAERCAIL LVNLGTPDEP SAPALRRYLA EFLSDPRVVE IPRAVWLPIL 
HGVVLRVRPA KSAAKYASIW TAEGSPLKVW TEKQAKLLTG YLGERGHPVL VRAAMRYGQP
SVATQLDALK ADGATRILVL PLYPQYAAAT TASVFDAVYA WAARTRRVPE LRFVNHYHDD
PGYILALGRC IEDHWMRNGR AERLVLSFHG VPERTLRLGD PYHCECQKTA RLLTERLALK
PEQVLVTFQS RFGKAKWLEP YTEPTLVQLA QQGIRRVDVA CPGFTSDCLE TLEEIAQEAR
EAYLHAGGET FHYIPCLNDR HEWIAALSDI AIRHLQGWPT QTAPDPAALQ AQALRARQLG
AQA