Gene RPD_3754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3754 
Symbol 
ID4024270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4190097 
End bp4191548 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content64% 
IMG OID637963958 
Productchlorophyllide reductase subunit Z 
Protein accessionYP_570876 
Protein GI91978217 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01278] light-independent protochlorophyllide reductase, B subunit
[TIGR02014] chlorophyllide reductase subunit Z 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00589159 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTTGTCC TGGATCATGA TCGCGCCGGC GGTTATTGGG GCGCCGTCTA TGCCTTCACC 
GCGGTGAAGG GCCTGCAGGT GATCATCGAC GGCCCGGTCG GCTGCGAGAA CCTGCCGGTC
ACCTCGGTGC TGCATTACAC CGACGCGCTG CCCCCGCACG AATTGCCGAT CGTCGTCACC
GGTCTCGGCG AAGACGAACT CGGCAAGCTC GGCACCGAAG GCGCGATGAA GCGCGCGCAC
CGCACGCTCG ACCCGTTCCT GCCTGCCGTG GTGGTGACAG GTTCGATCGC CGAAATGATC
GGCGGCGGCG TCACGCCCGA AGGCACCAAC ATCAAGCGCT TCCTGCCGCG CACCATCGAC
GAAGACCAGT GGCAGAGTGC TGACCGCGCC ATCGTCTGGC TGTGGAAAGA ATACGGCCCG
AAGAAGATTC CGGAGCGCAA GCCGCTGTCG CCGGACGTCA AGCCGCGGGT GAACATCATC
GGCCCGATCT ACGGCACTTT CAACATGCCG TCCGACCTCG CGGAAATCCG CCGCCTGATC
GAAGGCATCG GCGCCGAAGT CAACATGGTG TTTCCGCTCG GCTCGCACCT CGCCGATATT
CCGAAGCTGG TGAATGCCGA CGTCAACGTC TGCATGTACC GCGAGTTCGG CCGCCTGCTG
TGCGAGGCGC TGGAGCGGCC CTATCTGCAG GCGCCGATCG GGTTGCATTC AACCACGCGC
TTCCTGCGCA AGCTCGGCGA GCTCACGGGT CTCGATCCGG AGCCGTTCAT CGAGCGTGAG
AAGAACACCA CGATCAAGCC GCTGTGGGAC CTGTGGCGCT CGGTGACCCA GGACTTCTTC
GGCACTGCGA GCTTTGCGGT CGTCGCCACT GATACTTATG CCCGCGGCGT GCGAAATTTC
CTCGAGACGG AAATGGGCCT GCCGTGCACC TTCGCAGTGT CGCGCAAGGC CGGCGTGAAG
CCGGACAATG ACGCGGTTCG CACCGCGATT CGGCAGACTC CGCCGCTGAT TATGTTCGGT
AGCTACAACG AAAGAATGTA CCTCGCCGAA TCGGGCTCGC GCGCGATCTA CATCCCGGCG
TCGTTTCCGG GCGCGGTGAT CCGCCGCCAT CTCGGTACGC CGTTCATGGG CTACTCGGGC
GCGACCTATC TGGTGCAGGA AGTGTGCAAC GCGCTGTTCG ATGCGCTGTT CAACATCCTG
CCGCTCGGCA GCGATCTCGA TCGCGTCGAT CCGACTCCGG CGCGTCGTCA CGAAGAGCTG
CTCTGGAGCG ACGAAGCCAA GGCGCTGCTC GACGAAGTGC TCGAGGCGCA TCCGGTGCTG
GTGCGGATTA GCGCAGCAAA GCGTTTGCGC GACGCAGCTG AAAACAGCGC GCGCCGTGCC
GGCCAAGAGC AGGTGACGAA AGAATTTGTC AGTAAAGCAC GTGCGGCGCT CTTGGATGGG
CAGTCGGCGT GA
 
Protein sequence
MLVLDHDRAG GYWGAVYAFT AVKGLQVIID GPVGCENLPV TSVLHYTDAL PPHELPIVVT 
GLGEDELGKL GTEGAMKRAH RTLDPFLPAV VVTGSIAEMI GGGVTPEGTN IKRFLPRTID
EDQWQSADRA IVWLWKEYGP KKIPERKPLS PDVKPRVNII GPIYGTFNMP SDLAEIRRLI
EGIGAEVNMV FPLGSHLADI PKLVNADVNV CMYREFGRLL CEALERPYLQ APIGLHSTTR
FLRKLGELTG LDPEPFIERE KNTTIKPLWD LWRSVTQDFF GTASFAVVAT DTYARGVRNF
LETEMGLPCT FAVSRKAGVK PDNDAVRTAI RQTPPLIMFG SYNERMYLAE SGSRAIYIPA
SFPGAVIRRH LGTPFMGYSG ATYLVQEVCN ALFDALFNIL PLGSDLDRVD PTPARRHEEL
LWSDEAKALL DEVLEAHPVL VRISAAKRLR DAAENSARRA GQEQVTKEFV SKARAALLDG
QSA