Gene RPB_4000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4000 
Symbol 
ID3911807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4566063 
End bp4567664 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content67% 
IMG OID637885904 
Productchlorophyllide reductase subunit Y 
Protein accessionYP_487604 
Protein GI86751108 
COG category 
COG ID 
TIGRFAM ID[TIGR02015] chlorophyllide reductase subunit Y 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.130756 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00873228 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACGTTA TGCCCCGCTA TTCCGGTCCG ATCGGCGAAT CGATCGTCAG CGATCCTGCT 
GATCTCGATG TCGCGTCGGA GTCCGACGGC GCAACGCCCG CGATCAATGC GGCGGCGACC
AAGGCCGACG GCCTCGGCTG CCACGCCGGC GCCGCCGAAA TGAAGGCGGC CGCCGAAGCC
GCGGGCAAGA GCGAGATTCT CGATCGCTAC GCCGCGGACT ATCCGAAGGG GCCGCACGAC
CAGCCGCAGA GCATGTGCCC GGCGTTCGGC TCGCTGCGCG TCGGCCTGCG GATGCGGCGC
ACCGCGACGA TCCTGTCCGG CTCGGCCTGC TGCGTCTACG GCTTGACCTT CGTGTCGCAT
TTCTACGGCG CGCGTCGCAC GGTCGGCTAC GTGCCGTTCA ATTCGGAGAC GCTGGTCACC
GGCAAGCTGT TCGAGGACAT CCGCGACGCA GTGTTCAAGC TCGCCGATCC CGAACATTAC
GACACCATCA TCATCACCAA TCTGTGCGTG CCGACCGCCT CGGGCGTGCC GCTCGATCTA
CTGCCGAACG AGATCAACGG CGTGCGCATC ATCGGCATCG ACGTGCCGGG CTTCGGCGTG
CCGACCCATG CCGAGGCCAA GGACGTGCTG GCCGGGGCGA TGCTGGAATA CGCCCGCAAG
GAAGCCGAAC AGGGCCCGGT GCAGGCGCCG CGCGGCGGTC GCAGCGAGCG CCCCACGGTG
ACGCTGCTCG GCGAGATGTT CCCGGCCGAT CCGGTTGGTA TCAACATGAT GCTGGAGCCG
CTCGGCCTCG CGGCCGGGCC GGTGGTGCCG ACCCGCGAAT GGCGCGAGCT GTATGCGGCG
CTCGACTGCC AGGTCGTCGC CGCGATCCAT CCGTTCTACA AGGCCTGCGT CCGCCAGTTC
GATCTCGCCG GCCGCAAGAC CGTCGGCTCC GCGCCGGTCG GTCATGACGG CACCGAGACC
TGGCTGGAAG CGATCGGCAC CGCCTGCAAC GTCTCGCGCG ACAAGATCGA CGCCGCCAAG
AATCGTTTTC TGCCGGCCAT CAAGGCGGCG CTCGCTGCCA AGCCGATCGA TGCGCGGATC
ACGGTGTCCG GCTATGAGGG CTCCGAGCTT CTGGTGGCGC GCCTGCTGAT CGAAAGCGGC
GCGCGCGTGC CTTATGTCGG CACCGCGGTG CCGCGCACGC CATGGTCCGA CCTCGATCGC
GAATGGCTCG AGGCCAAGGG CGTGCGGGTG CAGTACCGCG CCTCGCTGGA GCAGGACTTC
GCGGCGGTCG ACGAATTCAA GCCGGATCTG GCGATCGGCA CCACGCCCGT GGTGCAGAAG
GCCAAGACCA TGGCGATCCC GGCGCTGTAC TTCACCAATC TGATCTCGGC GCGGCCGCTG
TTCGGCGTCG CCGGTGCCGG TTCGCTGGCG CAGGTGATCA ACGCGGCGCT CGCCAACCAG
GCGCGCTTCG ACGAAATGAA AGAGTTCTTC GGCGACGTCG GGCAGGGTTA TGCGGCCGGC
GTCTGGGAAG ATACGCCGAA GGACAACCCG GCGTTCCGCG AGAAATACAG GAAGCAGATC
GAAGCCGCAG CCAAGAAGCG CAAGGCGGAG GAGATGATCT GA
 
Protein sequence
MNVMPRYSGP IGESIVSDPA DLDVASESDG ATPAINAAAT KADGLGCHAG AAEMKAAAEA 
AGKSEILDRY AADYPKGPHD QPQSMCPAFG SLRVGLRMRR TATILSGSAC CVYGLTFVSH
FYGARRTVGY VPFNSETLVT GKLFEDIRDA VFKLADPEHY DTIIITNLCV PTASGVPLDL
LPNEINGVRI IGIDVPGFGV PTHAEAKDVL AGAMLEYARK EAEQGPVQAP RGGRSERPTV
TLLGEMFPAD PVGINMMLEP LGLAAGPVVP TREWRELYAA LDCQVVAAIH PFYKACVRQF
DLAGRKTVGS APVGHDGTET WLEAIGTACN VSRDKIDAAK NRFLPAIKAA LAAKPIDARI
TVSGYEGSEL LVARLLIESG ARVPYVGTAV PRTPWSDLDR EWLEAKGVRV QYRASLEQDF
AAVDEFKPDL AIGTTPVVQK AKTMAIPALY FTNLISARPL FGVAGAGSLA QVINAALANQ
ARFDEMKEFF GDVGQGYAAG VWEDTPKDNP AFREKYRKQI EAAAKKRKAE EMI