Gene RPB_3999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3999 
Symbol 
ID3911806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4564612 
End bp4566063 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content65% 
IMG OID637885903 
Productchlorophyllide reductase subunit Z 
Protein accessionYP_487603 
Protein GI86751107 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01278] light-independent protochlorophyllide reductase, B subunit
[TIGR02014] chlorophyllide reductase subunit Z 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.464812 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00738947 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTTGTCC TCGACCATGA TCGCGCCGGC GGCTATTGGG GCGCCGTCTA TGCCTTCACC 
GCGGTGAAGG GCCTGCAGGT GATCATCGAC GGCCCGGTCG GCTGTGAAAA CCTGCCGGTG
ACCTCGGTGC TGCACTACAC CGACGCGCTG CCGCCGCACG AACTGCCGAT CGTCGTGACC
GGCCTCGGCG AAGAAGAGCT CGGCAAGCTC GGCACCGAAG GCGCCATGAA GCGCGCGCAT
CGCACGCTCG ACCCGTTCAT GCCCGCGGTC GTGGTGACCG GTTCGATCGC CGAGATGATC
GGCGGCGGCG TGACGCCCGA AGGCACCGGC ATCAAGCGCT TCCTGCCGCG CACCATCGAC
GAAGACCAGT GGCAGAGCGC CGATCGCGCG ATCTCTTGGC TGTGGAAAGA ATACGGCCCG
AAGAAGATTC CGGAGCGCAA GCCGCTGTCG CCGGACGTCA AGCCGCGGGT CAACATCATC
GGCCCGATCT ACGGCACGTT CAACATGCCG TCCGATCTCG CGGAAATCCG CCGCCTGATC
GAGGGCATCG GCGCCGAAGT CAACATGGTG TTTCCGCTCG GGACGCATCT GTCCGATATC
CCGAAGCTGG TGAACGCCGA CGTCAACGTC TGCATGTATC GCGAGTTCGG CCGGCTGCTG
TGCGAAACCT TGGAGCGGCC GTATCTCCAG GCGCCGATCG GACTGCATTC GACGACGCGC
TTCCTGCGCA AGCTCGGCGA ACTCACCGGT CTCGATCCGG AGCCGTTCAT CGAGCGCGAG
AAGAACACCA CGATCAAGCC GTTGTGGGAC CTTTGGCGCT CGGTGACGCA GGACTTCTTC
GGCACCGCCA GCTTCGCGAT CGTCGCGACC GACACGTACG CCCGCGGTGT GCGGCATTTC
CTCGAAGAGG AAATGGGCCT GCCGTGCGCC TTCGCGATGT CGCGCCGGGC CGGCGTCAAG
CCGGACAACG ACGCGGTGCG GACCGCGATC CGGCAGACCC CGCCGTTGAT CATGTTCGGA
AGCTACAACG AAAGAATGTA CCTCGCCGAA TCCGGCTCAC GCGCGATCTA CATCCCGGCG
TCGTTTCCCG GCGCGGTGAT CCGCCGCCAT CTCGGCACGC CCTTCATGGG ATACTCCGGC
GCGACCTATC TGGTGCAGGA GGTCTGCAAC GCGCTGTTCG ACGCGCTGTT CAACATCCTG
CCGCTCGGCA GTGATCTCGA TCGGGTCGAT CCGACCCCGG CGCGCCGTCA CGAAGAGCTG
CTCTGGAGTG ACGAGGCCAA GGCGCTGCTC GACGAGGTTC TCGAAGCTCA TCCGGTGCTG
GTGCGAATTT CCGCGGCGAA GCGCTTGCGC GACGCAGCCG AGAATAGCGC GCGCCGCGCC
GGCCAGGAGC GTGTGACCGA AGAATTCGTC ACGAAAGCGC GTGCAGCGCT GATGGACGGG
CAGACTGTGT AA
 
Protein sequence
MLVLDHDRAG GYWGAVYAFT AVKGLQVIID GPVGCENLPV TSVLHYTDAL PPHELPIVVT 
GLGEEELGKL GTEGAMKRAH RTLDPFMPAV VVTGSIAEMI GGGVTPEGTG IKRFLPRTID
EDQWQSADRA ISWLWKEYGP KKIPERKPLS PDVKPRVNII GPIYGTFNMP SDLAEIRRLI
EGIGAEVNMV FPLGTHLSDI PKLVNADVNV CMYREFGRLL CETLERPYLQ APIGLHSTTR
FLRKLGELTG LDPEPFIERE KNTTIKPLWD LWRSVTQDFF GTASFAIVAT DTYARGVRHF
LEEEMGLPCA FAMSRRAGVK PDNDAVRTAI RQTPPLIMFG SYNERMYLAE SGSRAIYIPA
SFPGAVIRRH LGTPFMGYSG ATYLVQEVCN ALFDALFNIL PLGSDLDRVD PTPARRHEEL
LWSDEAKALL DEVLEAHPVL VRISAAKRLR DAAENSARRA GQERVTEEFV TKARAALMDG
QTV