Gene Mext_4814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4814 
SymbolbchH 
ID5833780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp5375357 
End bp5379073 
Gene Length3717 bp 
Protein Length1238 aa 
Translation table11 
GC content71% 
IMG OID641370611 
Productmagnesium chelatase subunit H 
Protein accessionYP_001642253 
Protein GI163854210 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1429] Cobalamin biosynthesis protein CobN and related Mg-chelatases 
TIGRFAM ID[TIGR02025] magnesium chelatase, H subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.0476197 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAGGC GCATTTCGGC CGATAGGCCC ACCATCCGCG TCGCCATCGT CACGCTCGAC 
AATCACTTGG CGAGCGCGGT GGAACGGGCG CGCATGCGGC TCGCGGCCGA GATGCCGGGG
CTGTGCTTGA GCTTCCACGC TGCCGCGGAG TGGGAGACGG ACAAGGCCGC GCTCCGGGCC
TGCGAGGCGG AGATTGCGCG GGCCGACATC ATCCTCTCGG CGATGCTGTT CCTCGACGAG
CACGTCCGGG CGATCCTGCC GGCGATCCAG GGACGCCGCG CGAGCTGCGA CGCGATGATC
GGCTGCCTCT CGGCCTCCGA GATCGTGCGC ACGACCAAGC TCAACCGCTT CGACATGAGT
GGGACCAAGC GTTCGGCCCT CGACTTCCTG AAGCGGCTGC GCGGCAAGCC GGGCGCAGAG
GGCAACGCCG CCCGGCAGAT GGCGCTGGTG CGCAAGCTGC CGAAGATCCT GCGCTTCATC
CCCGGCTCGG CGCAGGACGT GCGCGCCTAT TTCCTGACCT TGCAATACTG GCTCGCGGGC
TCGGACGAGA ACGTGGCGAG CCTCGTGCGC TTCCTCGTGC AGCGCTACGC CGCCGGCCCC
CGCGCCGTGT GGCGGGAGAT CACCGCCGCG CCCGCACCCC AACACTATCC TGAGACCGGC
CTCTACCATC CGCGCATGGC CGACCGCGTC GGCGAGAGCC TGTCCGCCCT GCCGACCGTG
CCAAGCGCCC GCGGCCGCGT CGGCCTGCTG CTGATGCGCT CCTACGTTCT GGCCGGCAAC
ACGGCGCATT ACGACGGCGT CATCGCCGCC CTGGAGGCCA AGGGCCTCTG CGTGGTGCCG
GCCTTTGCCG CCGGCCTCGA CAACCGCCCC GCGGTGGACG CCTATTTCAC GGTGAACGGC
CGGCCGGCGA TCGACGCATT GGTCTCGCTC ACCGGGTTCT CGCTGGTCGG CGGCCCCGCC
TACAACGACG CGGCGGCGGC CGAGGCGACG CTGGCGCGGC TCGACGTGCC TTATCTCGCC
GCCCACGCGC TGGAATTCCA GACGATCGAG CAATGGGAGG CGGGCAGCCG CGGCCTGTCG
CCGGTCGAGG CGACCATGAT GGTCGCGATC CCCGAACTCG ATGGCGCCAC CGCACCGATG
GTGTTCGGCG GCCGTTCCTC GGCCTCCGGC CCCGGCAACG CCCGCGACAT GCGCGTGCAT
CCCGAGCGCG CCGAGCGGCT CGCCGCCCGG ATCGAACGGC TCGTTGCCCT GCGTCGCCGG
CCGAAGCGGG AGCGGCGCCT GGCCGTCATT CTGTTCAACT TCCCGCCCAA TGCCGGCGCT
ACCGGCACGG CCGCCTTCCT CTCGGTCTAC GCCTCGCTGC TGAACACCCT GCGCGGCCTC
AAGGCCGAGG GCTACGACGT CGCGGTGCCC GACAGCGCCG ATGCGTTGCG TGAGACGATT
CTCGGCGGCA ACGCCACCCG TTACGGCACG CCGGCCAACG TCCACGCCCG CGTATCGGCC
GAGGATCACG TCCGCCGCGA AACCTACCTG CCCGAGATCG AGGCGCAGTG GGGCCCGGCG
CCCGGCCGCC ACCAGAGCAA CGGGGCCGAC ATCCTCGTGC TCGGCGCGCA GTTCGGCAAC
GTCTTCGTCG GGGTGCAGCC CGCTTTCGGC TACGAGGGCG ATCCGATGCG GCTGCTGTTC
GAGCAGGGCT TCGCGCCGAC CCACGCCTTC AGCGCCTTCT ACCGCTGGCT GCGCGAGAAT
TTCTCGGCCG ACGCCGTGCT GCATTTCGGC ACCCACGGCG CGCTCGAATT TATGCCCGGC
AAGCAGGCCG GTTTGTCAGA AGCCTGCTGG CCCGAGCGGC TGATCGGTGC ACTGCCCAAC
ATCTACCTCT ACGCCGCCAA CAACCCGTCC GAGGGCACGC TGGCCAAGCG CCGCTCCGCC
GCGACGCTCG TGAGCTACCT CACGCCGAGC CTCGCCGCCG CCGGGCTCTA CCGCGGCTTG
AGCGACCTGA AGGCCTCGGT CGAGCGCTGG CGCGGTCTGG AGCCGGAAGC GACGGTTGAG
CGGACAGCGC TCGCCGTGGT CATCCAGGCC CAGGGCGCGG CCGTCGACCT AGTCCCGGCC
GAGCCCGCCT GGGAGGGCAA TCCCGCCCCG CACGTCACCG CGCTCGCCGC CGCTTTGTCC
GAACTCGAGC AGACGCTGAT CCCACACGGC CTGCACGTGA TCGGCCAGGG GATGGTCTGC
GAGGAACGGG TCGATCTGCT GCTCGCCCTG GCGGAGGCCT CGCACGGGCT GAAGCCCGCC
CGCACCGGGA TCGAACTCCT GATCGAGGGC GCGTCCCTCA ACGAGGCGCT GGCCGCCGCG
GGTCTGCCCG CCGACGAGAC CCACCGGGCC GCCTTCGCCG ACCTCGCCCG GATCGATGCC
GACCTCGCCC GCGACAGCGA GCTGCCCGCG CTGATGCGGG CCCTCGACGG GCGCTTCGTG
GCCCCGGTCG CGGGCGGCGA CCTGCTGCGC AATCCCGGTA TCCTGCCCAC GGGGCGCAAC
CTGCACGGCT TCGACCCTTA CCGCCTGCCC ACGGCCTTCG CGCTGGCCGA CGGCGCCCGC
CAAGTGGCGC GGGTGCTGGA GCGTTACCTC GAGGAAGGCC GGGCCCTGCC GGAGAGCGTC
GCGCTCGTCC TGTGGGGCAC CGACAACCTG AAGAGCGAGG GCGGCCCCAT CGCCCAGGCC
CTGGCGCTGA TCGGCGCGGC GCCCCGCTTC GACGGCTACG GCCGCCTCGC CGGGGCCGAA
CTGATCCCGC TCGAACAACT CCGCCGCCCG CGCATCGACG CAGTAGTCAC GCTCTCGGGC
ATCTTTCGCG ATCTTCTGCC GTTGCAGACC AAGCTGCTGG CAGAAGCGAG CTTTATGGCG
GCCAGCGCGG ACGAGCCGAC CGACAAGAAC TTCGTGCGCA AGCACGCGCT GGCGATCCAG
GCCGAGCAGG GCTGCGACTT CGAGACGGCG GCCCTGCGGG TCTTCTCCAA TGCCGAGGGC
GCCTACGGCG CCAACGTGAA CCATCTCGTC GAGTCCGGCC GCTGGGACGA CGAGGACGAA
CTCTGCGAAA CCTTTTCGCG CCGCAAGAGC TTCGCCTACG GCCGCACCGG CCGCCCCGCG
CCGCAGCGTG ACCTGATGAA GGCGGTGCTG GCCCGGGTCG ATCTCGCCTA CCAGAACCTC
GATTCGGTCG AACTCGGTGT CACGAGCGTC GATCACTACT TCGACGGGCT CGGCGGCATG
GGCCGGGCTG TGGCCCGCGC CCGCGGCGAG GCGGTGCCGA TCTACATCAG TGACCAGACC
CGCGGGGAGG GGCGCGTGCG CTCCCTCGAC GAGCAGGTGG CGCTGGAAAC CCGCACCCGC
ATGCTCAACC CGAAATGGTA CGAGGGCCTG CTCGGCCACG GCTATGAGGG CGTGCGCCAG
ATCGAGGCGA CGCTGACCAA CACCGTCGGC TGGTCCGCCA CCGCCGGCGC GGTGCAGCCC
TGGATCTACG AGCGCATCAC CGAGACCTTC GTGCTCGACG ACGCCATGCG CGACCGGATG
GCGACGCTCA ACCCCACGGC CTGCGCCAAG GTCGCCAGCC GCCTGATCGA GGCTCACCGC
CGCGGCTTCT GGACTCCCGA CCCGGCGATG CGCGACGCGC TCGACCGTGC CGAGGAGGAA
CTGGAGGACC GGCTGGAGGG CGTGACACCC GGCATCACCG CAGGAGTCGC GGCATGA
 
Protein sequence
MPRRISADRP TIRVAIVTLD NHLASAVERA RMRLAAEMPG LCLSFHAAAE WETDKAALRA 
CEAEIARADI ILSAMLFLDE HVRAILPAIQ GRRASCDAMI GCLSASEIVR TTKLNRFDMS
GTKRSALDFL KRLRGKPGAE GNAARQMALV RKLPKILRFI PGSAQDVRAY FLTLQYWLAG
SDENVASLVR FLVQRYAAGP RAVWREITAA PAPQHYPETG LYHPRMADRV GESLSALPTV
PSARGRVGLL LMRSYVLAGN TAHYDGVIAA LEAKGLCVVP AFAAGLDNRP AVDAYFTVNG
RPAIDALVSL TGFSLVGGPA YNDAAAAEAT LARLDVPYLA AHALEFQTIE QWEAGSRGLS
PVEATMMVAI PELDGATAPM VFGGRSSASG PGNARDMRVH PERAERLAAR IERLVALRRR
PKRERRLAVI LFNFPPNAGA TGTAAFLSVY ASLLNTLRGL KAEGYDVAVP DSADALRETI
LGGNATRYGT PANVHARVSA EDHVRRETYL PEIEAQWGPA PGRHQSNGAD ILVLGAQFGN
VFVGVQPAFG YEGDPMRLLF EQGFAPTHAF SAFYRWLREN FSADAVLHFG THGALEFMPG
KQAGLSEACW PERLIGALPN IYLYAANNPS EGTLAKRRSA ATLVSYLTPS LAAAGLYRGL
SDLKASVERW RGLEPEATVE RTALAVVIQA QGAAVDLVPA EPAWEGNPAP HVTALAAALS
ELEQTLIPHG LHVIGQGMVC EERVDLLLAL AEASHGLKPA RTGIELLIEG ASLNEALAAA
GLPADETHRA AFADLARIDA DLARDSELPA LMRALDGRFV APVAGGDLLR NPGILPTGRN
LHGFDPYRLP TAFALADGAR QVARVLERYL EEGRALPESV ALVLWGTDNL KSEGGPIAQA
LALIGAAPRF DGYGRLAGAE LIPLEQLRRP RIDAVVTLSG IFRDLLPLQT KLLAEASFMA
ASADEPTDKN FVRKHALAIQ AEQGCDFETA ALRVFSNAEG AYGANVNHLV ESGRWDDEDE
LCETFSRRKS FAYGRTGRPA PQRDLMKAVL ARVDLAYQNL DSVELGVTSV DHYFDGLGGM
GRAVARARGE AVPIYISDQT RGEGRVRSLD EQVALETRTR MLNPKWYEGL LGHGYEGVRQ
IEATLTNTVG WSATAGAVQP WIYERITETF VLDDAMRDRM ATLNPTACAK VASRLIEAHR
RGFWTPDPAM RDALDRAEEE LEDRLEGVTP GITAGVAA