Gene Mext_4064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4064 
Symbol 
ID5831643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4519442 
End bp4520968 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content71% 
IMG OID641369855 
Productmagnesium-protoporphyrin IX monomethyl ester anaerobic oxidative cyclase 
Protein accessionYP_001641505 
Protein GI163853462 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID[TIGR02026] magnesium-protoporphyrin IX monomethyl ester anaerobic oxidative cyclase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.416317 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCC TGCTCGTCAA CGTGCCCCAT CCGGCGATCG GCAGCCGGAT CCCCGACGAC 
CATCTGCCTC CGCTGGGGCT TCTCGCCATC GGCGGCCCGT TGATCGACGA CGGGCATGCC
GTCTCGCTCA TCGACGCCGA GTTCGGACCC GTGCCGCTGC CGGACCTCGT CCGGGAGATC
GTGGCGCAGG CACCCGAGGC GGTCCTGTTC GGTCATTCCG GCTCGACCTC GGGGCACCCG
GTCATCGCGG AGGTGTCGTC GAGGGTGGCC GCGGCGATGC CGGGCGTGAC GATCGTCTAC
GGTGGCGTCT TCCCAACCTA TCACGGGCGC GAGATTCTCG AGGCCGAGCC CCACGTCGCC
GCGATCGTGC GCGGGGAGGG CGAGGAGACC GCCCGGCGGC TCATGGCGGC GCTCGCGGCG
GGCCGGTCGC TCGGCACCGT GCCCGGCCTC GCCTACCGGG ACGGCGACGC GATCCGCGAA
ACGCCGCCGG CCCCGCTGAT CCGCGATCTC GACGCCTACC GGATCGGCTG GGAGCTGATC
GACCATGCCC GTTACAGCTA CTGGGGCGGC CTGCGTGCGG TCGTCGTCCA GTTCTCCCGC
GGCTGCCCGC ATCCCTGCAC CTATTGCGGC CAGCGCGGCT TCTGGACCCG CTGGCGCCAC
CGCGATCCCG TCCGCTTCGC CGCCGAACTC GCCCGGCTTC ACCGCGAGCA CGGCGTGCGG
GTGATCAACT TCGCCGACGA GAATCCGACC GTCTCGAAGA AGGTCTGGCG CACCTTCCTG
GAGGCGCTGA TTGCGGAGAA CGTCGATCTC ATCCTGGTCG GCTCGACGCG GGCGGACGAC
ATCGTGCGCG ATGCCGACAT CCTGCCGCTC TACAAGCGGG CGGGGTGGGA GCGCTTCCTG
CTCGGTCTCG AGAGCACCGA TACGACCACC CTCGACCTGA TCCGGAAGGG CGCCACCACG
ACGACCGACC GCGAGGCGAT CCGCCTCCTG CGGGCGAACG GCATCCTCTC CATGGCGACC
TGGGTGGTGG GCTTCGAGGA GGAGCGCGAC CGCGACTACT GGCGCGGCCT GCGGCAGCTC
CTCGCCTACG ACCCCGACCA GATCCAGATG CTCTACGTCA CGCCGCATCG CTGGACGCCC
TATTTCCGGC TTGCCGAGGA GCGCCGCGTG ATCCAGCTCG ACCGGCGGCG GTGGGACTAC
AAGCATCAGG TGCTCGCCAC CCGCCACATG CCGCCCTGGC GGGTGCTGCT CTGGTTCAAG
CTGATCGAGG TGATCCTTCA GGCCCGCCCG AAGGCGCTGG CGCGCATCTA CCTGAACCGC
GATCCCCGCC TGCGCCACGC CATGCGCTGG TACACGCGGA TGGGCCGGCG AGTCTGGCCC
CACGAGATCC TGGCTTGGCT GCGCGATCCG CTGACACGGA CGGGCCCGAC GGTCGCGGCG
TTCTGGGGCA GGGGGCAGGA GCGGGAGGAG GCGATGGCCG TGCGCGCCGC TGCCCGATCC
CGTCCGGATG GTCGCGAGGC TGCCTGA
 
Protein sequence
MKILLVNVPH PAIGSRIPDD HLPPLGLLAI GGPLIDDGHA VSLIDAEFGP VPLPDLVREI 
VAQAPEAVLF GHSGSTSGHP VIAEVSSRVA AAMPGVTIVY GGVFPTYHGR EILEAEPHVA
AIVRGEGEET ARRLMAALAA GRSLGTVPGL AYRDGDAIRE TPPAPLIRDL DAYRIGWELI
DHARYSYWGG LRAVVVQFSR GCPHPCTYCG QRGFWTRWRH RDPVRFAAEL ARLHREHGVR
VINFADENPT VSKKVWRTFL EALIAENVDL ILVGSTRADD IVRDADILPL YKRAGWERFL
LGLESTDTTT LDLIRKGATT TTDREAIRLL RANGILSMAT WVVGFEEERD RDYWRGLRQL
LAYDPDQIQM LYVTPHRWTP YFRLAEERRV IQLDRRRWDY KHQVLATRHM PPWRVLLWFK
LIEVILQARP KALARIYLNR DPRLRHAMRW YTRMGRRVWP HEILAWLRDP LTRTGPTVAA
FWGRGQEREE AMAVRAAARS RPDGREAA