Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3999 |
Symbol | |
ID | 3911806 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4564612 |
End bp | 4566063 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637885903 |
Product | chlorophyllide reductase subunit Z |
Protein accession | YP_487603 |
Protein GI | 86751107 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01278] light-independent protochlorophyllide reductase, B subunit [TIGR02014] chlorophyllide reductase subunit Z |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.464812 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00738947 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTTGTCC TCGACCATGA TCGCGCCGGC GGCTATTGGG GCGCCGTCTA TGCCTTCACC GCGGTGAAGG GCCTGCAGGT GATCATCGAC GGCCCGGTCG GCTGTGAAAA CCTGCCGGTG ACCTCGGTGC TGCACTACAC CGACGCGCTG CCGCCGCACG AACTGCCGAT CGTCGTGACC GGCCTCGGCG AAGAAGAGCT CGGCAAGCTC GGCACCGAAG GCGCCATGAA GCGCGCGCAT CGCACGCTCG ACCCGTTCAT GCCCGCGGTC GTGGTGACCG GTTCGATCGC CGAGATGATC GGCGGCGGCG TGACGCCCGA AGGCACCGGC ATCAAGCGCT TCCTGCCGCG CACCATCGAC GAAGACCAGT GGCAGAGCGC CGATCGCGCG ATCTCTTGGC TGTGGAAAGA ATACGGCCCG AAGAAGATTC CGGAGCGCAA GCCGCTGTCG CCGGACGTCA AGCCGCGGGT CAACATCATC GGCCCGATCT ACGGCACGTT CAACATGCCG TCCGATCTCG CGGAAATCCG CCGCCTGATC GAGGGCATCG GCGCCGAAGT CAACATGGTG TTTCCGCTCG GGACGCATCT GTCCGATATC CCGAAGCTGG TGAACGCCGA CGTCAACGTC TGCATGTATC GCGAGTTCGG CCGGCTGCTG TGCGAAACCT TGGAGCGGCC GTATCTCCAG GCGCCGATCG GACTGCATTC GACGACGCGC TTCCTGCGCA AGCTCGGCGA ACTCACCGGT CTCGATCCGG AGCCGTTCAT CGAGCGCGAG AAGAACACCA CGATCAAGCC GTTGTGGGAC CTTTGGCGCT CGGTGACGCA GGACTTCTTC GGCACCGCCA GCTTCGCGAT CGTCGCGACC GACACGTACG CCCGCGGTGT GCGGCATTTC CTCGAAGAGG AAATGGGCCT GCCGTGCGCC TTCGCGATGT CGCGCCGGGC CGGCGTCAAG CCGGACAACG ACGCGGTGCG GACCGCGATC CGGCAGACCC CGCCGTTGAT CATGTTCGGA AGCTACAACG AAAGAATGTA CCTCGCCGAA TCCGGCTCAC GCGCGATCTA CATCCCGGCG TCGTTTCCCG GCGCGGTGAT CCGCCGCCAT CTCGGCACGC CCTTCATGGG ATACTCCGGC GCGACCTATC TGGTGCAGGA GGTCTGCAAC GCGCTGTTCG ACGCGCTGTT CAACATCCTG CCGCTCGGCA GTGATCTCGA TCGGGTCGAT CCGACCCCGG CGCGCCGTCA CGAAGAGCTG CTCTGGAGTG ACGAGGCCAA GGCGCTGCTC GACGAGGTTC TCGAAGCTCA TCCGGTGCTG GTGCGAATTT CCGCGGCGAA GCGCTTGCGC GACGCAGCCG AGAATAGCGC GCGCCGCGCC GGCCAGGAGC GTGTGACCGA AGAATTCGTC ACGAAAGCGC GTGCAGCGCT GATGGACGGG CAGACTGTGT AA
|
Protein sequence | MLVLDHDRAG GYWGAVYAFT AVKGLQVIID GPVGCENLPV TSVLHYTDAL PPHELPIVVT GLGEEELGKL GTEGAMKRAH RTLDPFMPAV VVTGSIAEMI GGGVTPEGTG IKRFLPRTID EDQWQSADRA ISWLWKEYGP KKIPERKPLS PDVKPRVNII GPIYGTFNMP SDLAEIRRLI EGIGAEVNMV FPLGTHLSDI PKLVNADVNV CMYREFGRLL CETLERPYLQ APIGLHSTTR FLRKLGELTG LDPEPFIERE KNTTIKPLWD LWRSVTQDFF GTASFAIVAT DTYARGVRHF LEEEMGLPCA FAMSRRAGVK PDNDAVRTAI RQTPPLIMFG SYNERMYLAE SGSRAIYIPA SFPGAVIRRH LGTPFMGYSG ATYLVQEVCN ALFDALFNIL PLGSDLDRVD PTPARRHEEL LWSDEAKALL DEVLEAHPVL VRISAAKRLR DAAENSARRA GQERVTEEFV TKARAALMDG QTV
|
| |