Gene RPB_2687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2687 
Symbol 
ID3910480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3071354 
End bp3072880 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content66% 
IMG OID637884587 
Productmagnesium-protoporphyrin IX monomethyl ester anaerobic oxidative cyclase 
Protein accessionYP_486300 
Protein GI86749804 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID[TIGR02026] magnesium-protoporphyrin IX monomethyl ester anaerobic oxidative cyclase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATCC TTCTGGTCAA CGTTCCCCAT CCCGCCATCG GCAGCCTGAT TCCGAGCGAT 
CACTTGCCGC CGCTGGGCCT GCTGGCGATC GGCGGACCGC TGATTGATGA CGGCCATGAC
GTGCGTTTGC TTGATGCCGA GTTCGGCCCG ACGTCGACCG CGCAGATCGT CGGACAGGCG
CGCGACTTCC GTCCCGATGC GGTGCTGTTC GGCCATTCCG GGTCGACCTC CGGCCACCCG
GTCATCGCCG AAGTCGCACA GGCCATCGCG CATGCCATTC CCGGCACGCG CATTGTCTAT
GGAGGTGTGT TTCCGACCTA CCACTGGCGG GAGATCCTCG ACGCCGAGCC TTACGTCACG
GCCATCGTGC GTGGGGAGGG CGAGGAGACG GCGCGGCGCT TGATGACCGC ACTCGCTGAT
GGCGATGATC TCGCGGGCGT CCATGGGATT GCCTATCGCA GAGCCGGGCA AGCCTGCGCG
ACGCCGCCGG CCGTGGTGAT CGGAGATCTC GACGCCTATC GGATCGGTTG GGAACTGATC
GACCACGCTC GCTACAGCTA TTGGGGCGGA CTGCGCGCCG TCGTGGTGCA ATTCTCGCGA
GGCTGCCCAC ATCTGTGCAG TTACTGCGGA CAACGCGGCT TCTGGACGCG CTGGCGGCAC
CGCGATCCCG TGCTGTTCGC CAAGGAGCTC GCGCGGCTGC ATCGGGAGCA GGGCGTCCGG
GTCGTCAATT TCGCCGACGA GAACCCGACG GTCTCGAAGA AGGTGTGGCA GACGTTCCTC
GAGGCGTTGA TCGCGGAGGA GGTCGACCTG ATCCTGGTGG GGTCGACCAG GGCCGACGAC
ATCGTCCGCG ACGCCAATAT CTTGCATCTG TACAAGCAGG CCGGCTGGGA TCGCTTCCTG
CTCGGCCTGG AAAACACCGA CGACGCCACG CTGGCGCTGA TCCGCAAGGG CGCGGCAACG
CCCACCGATC GCGAGGCCAT TCGGCTGCTG CGTCGGCACG GCATCCTATC GATGGCCACC
TGGGTGGTCG GCTTCGTCGA GGAGACCGAC CGCGATCACT GGCGCGGGCT GCGCCAGCTT
CTCTCGTACG ACCCGGACCA GATTCAGATG CTGTACGCGA CGCCGCACCG CTGGACGCCA
TATTTCGGGC AGGCGGCCGA ACGCCGGGTG ATCACGACTG ACCGGCGGCA CTGGGACTAC
AAGCATCAGG TCCTCGCCAA TCGCAACATG CCGCCGTGGC GCGTCCTGCT CTGGTTCAAG
TTCACCGAGC TGGTGCTTCA AGCCCGCCCG AAGGCGATGT TTCGCACCTT CTTCGAGCGC
CGCGGGCGCC TGCGTCATGC CATGCAATGG TACACGCGGA TCGGACGCCG GGTCTGGCCC
TACGAGATCT GGCAGTTCCT GCGAGCCCGG CATTTGAAGA CCGGACCGAC CGTCGGCGAA
TTCTGGGGCG ACGGCCAAGT GGTCGATGAG AACGCGATGG CCACATCGCG GCAACGACGC
CAGCTTCCCA ATCAAAGCGC CGCCTGA
 
Protein sequence
MRILLVNVPH PAIGSLIPSD HLPPLGLLAI GGPLIDDGHD VRLLDAEFGP TSTAQIVGQA 
RDFRPDAVLF GHSGSTSGHP VIAEVAQAIA HAIPGTRIVY GGVFPTYHWR EILDAEPYVT
AIVRGEGEET ARRLMTALAD GDDLAGVHGI AYRRAGQACA TPPAVVIGDL DAYRIGWELI
DHARYSYWGG LRAVVVQFSR GCPHLCSYCG QRGFWTRWRH RDPVLFAKEL ARLHREQGVR
VVNFADENPT VSKKVWQTFL EALIAEEVDL ILVGSTRADD IVRDANILHL YKQAGWDRFL
LGLENTDDAT LALIRKGAAT PTDREAIRLL RRHGILSMAT WVVGFVEETD RDHWRGLRQL
LSYDPDQIQM LYATPHRWTP YFGQAAERRV ITTDRRHWDY KHQVLANRNM PPWRVLLWFK
FTELVLQARP KAMFRTFFER RGRLRHAMQW YTRIGRRVWP YEIWQFLRAR HLKTGPTVGE
FWGDGQVVDE NAMATSRQRR QLPNQSAA