Gene RPB_2287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2287 
Symbol 
ID3909668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2643180 
End bp2645246 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content65% 
IMG OID637884184 
Productmethyltransferase type 11 
Protein accessionYP_485903 
Protein GI86749407 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1232] Protoporphyrinogen oxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0245071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.708351 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGACC TTGCGATCGT TGGTGGCGGC CCGGGCGGAC TGATGAGCGC CTGGTACCTG 
AAGAAGAAGC TGGGCCCGCT GTGCCGGGTG ACGATTTTCG AGGCGTCCGA CCGGCTCGGC
GGCAAGATCG TGTCCCGCAC CTTCGACACC GCGCCGGCGC TGTACGAGGC CGGCGTCGCC
GAATTGTACG ACTACTCGAT GACCGGGCCG GACCCGCTGC GCGAACTGGT GCAGCATTTC
GGGCTGCAGA CGATTCCGAT GGACGCCGAG CAGGTCCAGC TCGACGGCGA ATTGCTCGAC
GACGTGCCGG GGATGCGCCG GAAGTACGGC GACAAGACCG CCGACGCCAT TCTCGCCTTC
CGCAAGACCT GCAGCGAGAT GGTGACGCCG CTGGAGTACT ACGAGGGCGT CGGCGCGCAC
GACAACGAGC ATCCCTGGGC CTGGACCAAT TGCGAGCAAC TGCTCGACAA GGAGATCGAC
GACCCGGTCG CCAAGCGTTT CTTCAAGGTG ATGGCGCGGT CCGACATCGC CACCGAGAGC
CACAACACCA ACGGCCTCAA CGCGCTGAAG AATTTCGTGA TGGATATCGA CGGCTATATC
GGCCTGTATT CGATCCAGAA CGGCAACGAG CAACTGATCG AGGGGCTGCG CTCCGAGGTC
GACGCCGAGA TCCAGCTCAA CCACCGCATT CTCAAGGTCG GCAAGACCGC GAGCGGCCGC
TACGAGCTGA ACATGATGAA CGGCAAGGGC CCGGAGACGC GCGACTTCGA TCTCGTGCTG
ATGTGCCTGC CGCACAACTG GCTCGCGACG CTCGGCTGGG GCGACGAGCA GCTGCGCAAG
GCCATGGTCA AGCACGTCGC CTATTTCGAC CGGCCGGCGC ACTATCTGCG GATCTCGCTG
CTGTTCGACA GCCCGTTCTG GGGCGACAAG ATCCCCGGCG CCTGGTTCAT GTCGGAGGCG
TTCGGCGGCT GCTGCGTCTA CAACGAGGGC GCGCGCCACG ACGTCGGCAA ATACGGCGTG
CTGAACTTCC TGGTCGCCGG CTCCGACGCG CTGGCGTTCT CGAATCTGAC CGACGGCGAA
CTGACCGACC TGGCGCTGAA GTCGCTGCCG GCGGCATTCG GCGACGCGCG CGAGCATTTC
ATGGAAGGCA AGACCCATCG CTGGCTGTCG TCGGTGAATT GCATCCCCGG CGGCCTGCCG
GTGCGCGACG TGATGACCAA TCACCGGCCC GAGCCGAAGA ATCATCCGGG CTTCGTCGTG
GTCGGCGACT ATCTGTTCGA TTCGACGCTG AACGGCCTGC TGGATTCCTC CGACGCCGCC
ACCGACATCA TCGTCACCGA GACGATGAAG CTGCGCCGCG CCCGCGCGCT CGCGGGCCAG
CCCGGGTCGG ACAAGATCGA CGCCTCGTAT TTTGCCAATT ATCGCAACCA GGGGCCGTAT
GCCGAGGTCT GGAGCCGGTT CACCGACCCC GACTATCTGC TGGCGATGAT CAGGACGGTC
TGGGACGTGC CGCAGGGCGT CAAACTGCTG GTCGCGGGCT CCGCCAGCGG CGAGCTGGTC
GGCGCGCTGC GCGAGCGCGG CATCGATGCC TGGGGCGTCG ACAACAATCG CGGCATTCTC
GCCAAGACGC CGGAGGCGCT GAAGCCGTTC AACCAGTTCG GCTCGATCGT CGACCTGCCG
TTCGAGGACG CTTCGTTCGA CCTCGTGTTC GAGACCAGCC TGTGCCACGT CCCGGAGAAC
CGCGTGGTCA AGGCGATCCG CGAACTCAAC CGGGTGACGC GCACCGGCCT GGTGTTCGGC
TCGGTGACCA GCGACATGGA ACCCATCGTG ATCGACGACT ACGATCTGCT GCGCGGCGTC
AAGAAGCTCG GCACCTGGTG GGAGTGGTCG GAACTGTTCT TCAGCAACGG CTTCGATCTG
TCGATGCATC GCAACGACGT CAGCGACAAG CTCTGGGAAG CGACGCTGAA GGCCGGCAAG
GGTCCGGGCC AGTGGTACGG CGACGCGGAC AGCCTGCGCT ACTCGTTCTT CGACAAGGTC
GACGACGACG AAGACGACGA AGACTGA
 
Protein sequence
MFDLAIVGGG PGGLMSAWYL KKKLGPLCRV TIFEASDRLG GKIVSRTFDT APALYEAGVA 
ELYDYSMTGP DPLRELVQHF GLQTIPMDAE QVQLDGELLD DVPGMRRKYG DKTADAILAF
RKTCSEMVTP LEYYEGVGAH DNEHPWAWTN CEQLLDKEID DPVAKRFFKV MARSDIATES
HNTNGLNALK NFVMDIDGYI GLYSIQNGNE QLIEGLRSEV DAEIQLNHRI LKVGKTASGR
YELNMMNGKG PETRDFDLVL MCLPHNWLAT LGWGDEQLRK AMVKHVAYFD RPAHYLRISL
LFDSPFWGDK IPGAWFMSEA FGGCCVYNEG ARHDVGKYGV LNFLVAGSDA LAFSNLTDGE
LTDLALKSLP AAFGDAREHF MEGKTHRWLS SVNCIPGGLP VRDVMTNHRP EPKNHPGFVV
VGDYLFDSTL NGLLDSSDAA TDIIVTETMK LRRARALAGQ PGSDKIDASY FANYRNQGPY
AEVWSRFTDP DYLLAMIRTV WDVPQGVKLL VAGSASGELV GALRERGIDA WGVDNNRGIL
AKTPEALKPF NQFGSIVDLP FEDASFDLVF ETSLCHVPEN RVVKAIRELN RVTRTGLVFG
SVTSDMEPIV IDDYDLLRGV KKLGTWWEWS ELFFSNGFDL SMHRNDVSDK LWEATLKAGK
GPGQWYGDAD SLRYSFFDKV DDDEDDED