Gene RPB_3992 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3992 
Symbol 
ID3911799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4559201 
End bp4560121 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content68% 
IMG OID637885896 
Productbacteriochlorophyll/chlorophyll a synthase 
Protein accessionYP_487596 
Protein GI86751100 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0382] 4-hydroxybenzoate polyprenyltransferase and related prenyltransferases 
TIGRFAM ID[TIGR01476] bacteriochlorophyll/chlorophyll synthetase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.243579 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAGA CCGATGCGAC GCGGATGAGT AACGCCATGA TCACGCGTCC GGCGCCATCG 
GCGGTGCTCG AGGTGCTTCA CCCCATCACC TGGTTTCCGC CGATGTGGGC GTTCGCCTGC
GGCGTGGTGT CGTCGGGCGT GCCGATCTCC GCCCGATGGC CGGAAGTGAT CGCCGGCATC
GTGCTGTGCG GGCCGCTCCT GGTCGCCAGC AGCCAAGTGG TCAACGACTG GTTCGACCGC
GACGTCGACG CCATCAACGA ACCCAACCGG CCGATCCCGT CGGGGCGGAT CCCGGGACGC
TGGGGGCTGT ATCTGTCCTA TCTCTGGACC GGCGCATCGC TGCTGCTCGC GAGCCAGCTC
GGAGTCTGGG TGTTCGGCGC CGCCGCGCTC GGGCTGGTGC TGGCCTGGAT GTATTCGATG
CCGCCGTTCC GGCTGAAGCA GAATGGCTGG CTCGGCAACG GCGCCTGCGC CATCACCTAT
GAAGGCTTCG CCTGGTTCAC CGGCGCCGCG GTGATGCTCG GCGGGCTGCC GCCGTGGTGG
ATCGTGACTT TGGCTCTGCT CTACAGCGCC GGCGCGCACG GCATCATGAC GCTCAACGAC
TTCAAGTCGA TCGAGGGCGA TATCAAGACC GGGGTCGGCT CGCTGCCGGT CAAGCTCGGG
GTCGACAACG CGGCCCGCGT CGCCTGCGCC GTGATGGCGA TCCCGCAGGT GATCGTCATC
GCGCTGCTGC TGGGGTGGGA CCGCCCGATT CAGGCCGCCA TCGTCGGCCT GGTGCTGGCG
GTGCAACTCG GCCTGATGGT GCGGTTTCTC CGCGCGCCGG TCGAGCGCGC CACCTGGTTC
AGCGGCCTCG GTGTCGCGCT CTATGTCATG GGCATGATGG CGAGTGCGGT CGCGGTCTCG
CCGTTCGGAC CGTCCGCATG A
 
Protein sequence
MNKTDATRMS NAMITRPAPS AVLEVLHPIT WFPPMWAFAC GVVSSGVPIS ARWPEVIAGI 
VLCGPLLVAS SQVVNDWFDR DVDAINEPNR PIPSGRIPGR WGLYLSYLWT GASLLLASQL
GVWVFGAAAL GLVLAWMYSM PPFRLKQNGW LGNGACAITY EGFAWFTGAA VMLGGLPPWW
IVTLALLYSA GAHGIMTLND FKSIEGDIKT GVGSLPVKLG VDNAARVACA VMAIPQVIVI
ALLLGWDRPI QAAIVGLVLA VQLGLMVRFL RAPVERATWF SGLGVALYVM GMMASAVAVS
PFGPSA