Gene RPB_3979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3979 
SymbolbchH 
ID3911786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4542243 
End bp4545989 
Gene Length3747 bp 
Protein Length1248 aa 
Translation table11 
GC content66% 
IMG OID637885883 
Productmagnesium chelatase subunit H 
Protein accessionYP_487583 
Protein GI162138517 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1429] Cobalamin biosynthesis protein CobN and related Mg-chelatases 
TIGRFAM ID[TIGR02025] magnesium chelatase, H subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.426085 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.16908 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAGC GCACTTCGCA CGCTGACAAG ACGCCAGTTC GTGTCGTCAT CGTTACTATG 
GACAGCCATC TGTCCGGCGC CGCCGCGCGC GCGCGGGATC TGCTGCGTCG CGACTATCCC
GGGCTCGAGT TGACAGTCCA CTCCGCCGAC GAGTGGGGCA CCGACGACAC CGCGCTGTCG
CGCTGTCACG CCGACATCGC CGCCGGCGAT ATCGTCATCG CCACCATGCT GTTCCTCGAC
GACCACGTCC GTGCGGTGAT GCCCGCGCTG CAGGCGCGCC GCAACGATTG CGACGCGCTG
GTGTGCTGCA TGTCGGCGGG CGAGGTGGTC AAGCTCACCC GCGTCGGCAA GTTCGACATG
AGCGCCGAAG CGCTCGGCAT GATCAACTGG CTGAAGAAGC TGCGCGGCAA GAAGACTGAA
GGCGGCGCCG GCAAGGGCGA GATGAAGATG CTGCGGCAGC TGCCCAAGCT GCTGCGCTTC
ATCCCCGGCA CCGCTCAGGA CATGCGGGCG TACTTCCTGA CACTGCAATA CTGGCTGGCC
GGCTCCGAGG CGAACATCGC CAACATGGTC CGCCTGCTGA TCGATCGCTA TGCCAGCGGT
CCGCGCAAGG TTCTGCGTGG TGTCGCCAAG GTCGAGCCGC CGGTCGAATA CGCCGATATC
GGCGTCTATC ATCCGAAGAT GAAGGGGCGG ATCGCCGAGT CGGTCGACAA GCTCCCGGCC
GGCCCCGCCG ACGCCAAGGG TTCGGTCGGC GTCCTGCTGC TGCGCTCCTA TCTGCTCGCC
GGCAATTCCG GTCATTACGA CGGCATGCTG GAAGCGTTCG AGGCCAAGGG CCTTCGCGTC
ATTCCGGCGT TCGCCTCGGG TCTCGACCAG CGCCCGGCGA TCGAGCGCTT CTTCATGAAG
AACGGCCGTC CGACGGTCGA CGCGGTGGTG TCGCTCACCG GCTTCTCGCT GGTCGGTGGT
CCCGCCTACA ACGACTCCAA GGCGGCCGAG CATATTCTCG CCGAGCTCGA CGTGCCGTAT
CTGTCGGCGC ACCCCGTCGA GTTCCAGACG CTCGAGCAGT GGGCCGCGTC CGATCGTGGA
CTGATGCCGG TGGAAAGCAC CATCATGGTG GCGATCCCCG AACTCGACGG CTCGTCGGGC
CCGATGGTCT ATGGCGGCCG TTCGGATGGC GGCGACGTCG CCTGCCCGGG CTGCGACAGG
TTCTGCAAGT TCGACCGCAA CCAGACCGGC GGCGACATGA ACATCTGCAT CGAGCGGGCG
CAGATGCTGG CGTCGCGCAC CGCGCGGCTG GTCGCGCTCC GCCGCAGCGA GCGCAAGGAC
CGCAAGGTCG CGGCCGTGCT GTTCAACTTC CCGCCGAACG CCGGCAACAC CGGCACTGCG
GCCTTCCTCG GCGTGTTCGA GTCGCTGCAC AACACGCTAA AGGCGATGAA GGCCGAGGGC
TACACCGTCG AGGTGCCGGA CAGCGTCGAT GCGCTGCGCG AAGCCATCAT CAACGGCAAC
GCCTCGCGAT TCGGCGCCCA TGCCAACGTC CATGCCCGCG TCCCGGCCGG CGATCACGTC
AAGAACGAGC GCTGGCTGCG CGAGATCGAA GGACAGTGGG GGCCGGCGCC GGGCAAGCAG
CAGAGCGACG GCAGCTCGAT CTTCGTGCTC GGTGAGCGCT TCGGCAACGT CTTCGTCGGC
GTCCAGCCGG CGTTCGGCTA CGAAGGCGAC CCGATGCGGC TGCTGTTCGA GAAGGGATTT
GCGCCCACGC ACGCTTTCTC GGCGTTCTAT CGATGGATCA GGCAGGATTT CGGGGCCCAC
GCCGTGCTGC ATTTCGGCAC CCACGGCGCG CTCGAATTCA TGCCCGGCAA GCAGACCGGC
CTGTCCGGCA CCTGCTGGCC CGACCGCATG ATCGGCGACC TGCCTAACAT GTACATCTAC
GCCTCCAACA ATCCCTCGGA AGGCGCCATC GCCAAGCGGC GCTCGGCGGC GACGCTGATC
AGCTATCTGA CGCCGCCGGT CGCTCATGCC GGTCTGTATC GCGGGCTGCT CGAATTGAAG
TCCTCGATCG AGCGCTGGCG CGGCCTGACG CCGGAGGAAG AAACCGAACG CGCCAATCTC
GCGGTGCTGG TGCAGGCGCA GGCCTCGGCC CTCGACCTGA CCCCGGCCGA ACCGGCCTGG
ACCGCGGAAG AAGCCGGCGC CACCATCGCC AAGCTCGCCG ACTCCGTGCT GGAGATGGAA
TACGCGCTGA TCCCGCATGG TTTGCATGTG ATCGGCAACG TGCCCTCCGA AGAGGAGCGG
GTCGAAACGC TGGAAGCCGT CGCCGATGCC ATGCATGGCA AGCGCCCGGA CAAGGCGCTG
CTCGAAGCGC TGGTGCGCGG CGGCCATCCC GAGCACCTGT CCGGCAACGG CCCGGAGGCG
GAAGCCGATC TGGCAATGCT TCACGAACTC GCCGGCATCG ACCGCATTCT CGCCGAAGAT
CACGAGATCC CGAGTATCCT GCGGGCGCTC GACGCCAAGT TCATCCGGCC GGCGCCGGGC
GGCGATCTGC TGCGCACGCC GGCGGTGCTG CCGACGGGCC GCAATCTGCA CGGCTTCGAT
CCGTTCCGCA TCCCGAGCGC ATTCGCGCTG CAGGACGGCG CCAAACAGGC GCAACGGCTG
ATCGATAAAC ACGTCGCCGA GGGCAATCCG CTGCCGGAGA CCGTCGCGAT CGTGCTGTGG
GGCACCGACA ATCTCAAGAA CGAAGGCGCG CCGATCGGCC AGGCTCTGGC GTTGATGGGG
GCGAAGCCGC GGTTCGACAG CTACGGCCGT CTCGCCGGCG CCGATCTGAT CCCGCTCGAC
GAATTGAAGC GGCCACGGAT CGACGTGATC ATCACCATGT CGGGCATCTT CCGCGACCTG
CTGCCGCTGC AGATCAAGCT GCTCGCGGAA GCCGCGTTCA TGGCGGCGAG CGCCGATGAG
CCGGCCGACC AGAACTTCGT CCGCAAGCAT TCGCTGGCCT ATCAGGCCGA GCACAATTGC
GACATGGAGA CCGCGTCACT GCGGGTGTTC GGCAACGCCG ACGGCGCCTA CGGTTCCAAC
GTCAACCATC TGGTCGAGAA CAGCCGCTGG GAAGACGAGG ACGAACTCGC CGAGACCTAT
ACGCGACGCA AGAGCTTCGC CTACGGACTG AAGGGTCAGC CGGTGCAGCG CGCCGATCTG
CTGAAGAGCG CGCTCGCCGA CGTCGACCTC GCTTATCAGA ATCTCGACTC GGTCGAACTC
GGCGTCACCA CCGTCGATCA CTATTTCGAC ACGCTGGGCG GCATCAGCCG CGCCGTGCGC
AAAGCCAAGG GCGGTCAGGC GGCGCCGGTC TATATCGGCG ACCAGACCCG CGGTGCCGGC
ACGGTGCGGA CGCTGTCGGA GCAGGTCGCT CTCGAGACTC GGACCCGGAT GCTCAATCCG
AAATGGTACG AAGGCATGCT CAAGCACGGC TACGAAGGCG TGCGGCAGAT CGAAGAGCAC
GTCACCAACA CCATGGGCTG GTCGGCCACC ACCGGTGAAG TCGCACCGTG GGTGTATCGC
CAGCTCACCG AAACCTTCGT GCTCGACCCC GAAATGCGGG AACGGCTCGC CTCGCTCAAT
CCGGTGGCGT CGGCGAAAGT CGCCAACCGC CTGATCGAGG CGCATGAACG CAATTACTGG
TCTCCGGACC CTGAAATGCT CGAGGTTCTG CGCAAGGCAG GCGAAGAGCT CGAGGATCGC
CTGGAAGGCG TGGGAGTGGC CGCATGA
 
Protein sequence
MQKRTSHADK TPVRVVIVTM DSHLSGAAAR ARDLLRRDYP GLELTVHSAD EWGTDDTALS 
RCHADIAAGD IVIATMLFLD DHVRAVMPAL QARRNDCDAL VCCMSAGEVV KLTRVGKFDM
SAEALGMINW LKKLRGKKTE GGAGKGEMKM LRQLPKLLRF IPGTAQDMRA YFLTLQYWLA
GSEANIANMV RLLIDRYASG PRKVLRGVAK VEPPVEYADI GVYHPKMKGR IAESVDKLPA
GPADAKGSVG VLLLRSYLLA GNSGHYDGML EAFEAKGLRV IPAFASGLDQ RPAIERFFMK
NGRPTVDAVV SLTGFSLVGG PAYNDSKAAE HILAELDVPY LSAHPVEFQT LEQWAASDRG
LMPVESTIMV AIPELDGSSG PMVYGGRSDG GDVACPGCDR FCKFDRNQTG GDMNICIERA
QMLASRTARL VALRRSERKD RKVAAVLFNF PPNAGNTGTA AFLGVFESLH NTLKAMKAEG
YTVEVPDSVD ALREAIINGN ASRFGAHANV HARVPAGDHV KNERWLREIE GQWGPAPGKQ
QSDGSSIFVL GERFGNVFVG VQPAFGYEGD PMRLLFEKGF APTHAFSAFY RWIRQDFGAH
AVLHFGTHGA LEFMPGKQTG LSGTCWPDRM IGDLPNMYIY ASNNPSEGAI AKRRSAATLI
SYLTPPVAHA GLYRGLLELK SSIERWRGLT PEEETERANL AVLVQAQASA LDLTPAEPAW
TAEEAGATIA KLADSVLEME YALIPHGLHV IGNVPSEEER VETLEAVADA MHGKRPDKAL
LEALVRGGHP EHLSGNGPEA EADLAMLHEL AGIDRILAED HEIPSILRAL DAKFIRPAPG
GDLLRTPAVL PTGRNLHGFD PFRIPSAFAL QDGAKQAQRL IDKHVAEGNP LPETVAIVLW
GTDNLKNEGA PIGQALALMG AKPRFDSYGR LAGADLIPLD ELKRPRIDVI ITMSGIFRDL
LPLQIKLLAE AAFMAASADE PADQNFVRKH SLAYQAEHNC DMETASLRVF GNADGAYGSN
VNHLVENSRW EDEDELAETY TRRKSFAYGL KGQPVQRADL LKSALADVDL AYQNLDSVEL
GVTTVDHYFD TLGGISRAVR KAKGGQAAPV YIGDQTRGAG TVRTLSEQVA LETRTRMLNP
KWYEGMLKHG YEGVRQIEEH VTNTMGWSAT TGEVAPWVYR QLTETFVLDP EMRERLASLN
PVASAKVANR LIEAHERNYW SPDPEMLEVL RKAGEELEDR LEGVGVAA