Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4001 |
Symbol | |
ID | 3911808 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4567690 |
End bp | 4568688 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637885905 |
Product | chlorophyllide reductase iron protein subunit X |
Protein accession | YP_487605 |
Protein GI | 86751109 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1348] Nitrogenase subunit NifH (ATPase) |
TIGRFAM ID | [TIGR02016] chlorophyllide reductase iron protein subunit X |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.18225 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00239808 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACGTCG TTCCGCAGAT CAACCTGCAA GACGCGCAAC TCCGGGCCGA GGCGGCAATC GAGCCCGACG CGCCGCTGAC GACTCCCGTG ACCAAGGAAA CCCAGATCAT CGCGATCTAC GGCAAGGGCG GTATCGGCAA GAGCTTCACG CTCTCCAACC TGTCCTACAT GATGGCGCAG CAGGGCAAGA AAGTGCTGCT GATCGGCTGC GATCCGAAGA GCGATACGAC ATCGCTGCTG TTCGGCGGCA AGGCCTGTCC GACCATCATC GAGACGTCTT CGAAGAAGAA GCTTGCCGGC GAGGAAGTGC AGATCGGCGA CGTCTGCTTC AAGCGCGACG GCGTGTTCGC GATGGAGCTC GGCGGCCCGG AAGTCGGCCG CGGTTGTGGC GGCCGTGGCA TCATTCACGG CTTCGAGACG CTCGAAAAGC TCGGCTTCCA CGAATGGGGC TTCGACTACG TGCTGCTCGA TTTCCTCGGC GACGTGGTGT GCGGCGGCTT CGGCCTGCCG ATCGCCCGCG ACATGTGCCA GAAGGTGATC ATCGTCGGCT CCAACGATCT GCAGTCGCTG TACGTCGCCA ACAACGTCTG CTCCGCGGTT GAATATTTCC GCAAGCTCGG CGGCAATGTC GGCGTCGCCG GTCTGGTGAT CAACAAAGAT GACGGCACCG GCGAGGCGCA GGCCTTCGCC GAAGCGGCCG GCATTCCGGT GCTGGCGGCG ATTCCCGCCG ATGACGACAT CCGCAGGAAG AGCGCCAATT ACGAAATCAT CGGCCTGCCG GACGGGGAGT GGGGTCCGCT GTTCGCGGAG CTGGCCGCCA ACGTCGCCAC CGCGCCGCCG GTACGTCCGA AGCCGCTCAC CCAGGACGGG CTGCTCGGCC TGTTCTCCAG TGACGTGACC GGCCGCGATG TCGTGCTGCT GCCCGCCACC ATGGAAGACA TGTGCGGCGC CGCGGTGCTG AACAAGCCGT CGCTCGAAGT GATCTACGAC GCGGTTTGA
|
Protein sequence | MNVVPQINLQ DAQLRAEAAI EPDAPLTTPV TKETQIIAIY GKGGIGKSFT LSNLSYMMAQ QGKKVLLIGC DPKSDTTSLL FGGKACPTII ETSSKKKLAG EEVQIGDVCF KRDGVFAMEL GGPEVGRGCG GRGIIHGFET LEKLGFHEWG FDYVLLDFLG DVVCGGFGLP IARDMCQKVI IVGSNDLQSL YVANNVCSAV EYFRKLGGNV GVAGLVINKD DGTGEAQAFA EAAGIPVLAA IPADDDIRRK SANYEIIGLP DGEWGPLFAE LAANVATAPP VRPKPLTQDG LLGLFSSDVT GRDVVLLPAT MEDMCGAAVL NKPSLEVIYD AV
|
| |