Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4384 |
Symbol | |
ID | 3912199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4968275 |
End bp | 4969273 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637886290 |
Product | cytochrome-c peroxidase |
Protein accession | YP_487982 |
Protein GI | 86751486 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1858] Cytochrome c peroxidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCAAAGCA GCAGGCCGGC CCGCCGTCCC CGATCCAGGC TGAGCGGTCT GTGCGCGGCG CTGCTGTGCT CGGTGGTCGA CTGTGCGGCG GCCGAAATGC GGGAGTTGCG CACGTTCGCC CCGCTCGAGG CGGCATCGTC GCTCGACGCC GACAAGGTCG CGCTCGGCCG GATGTTGTTC GGCGATTCGA TTCTGTCGAG GACGCGGGCG ATCGCGTGCA CATCCTGCCA CGACCTCGCC CGTGGCGGAA CGGTGCCGCT GTCCCGCGCC ATCGCCGAAG ACGGCCGGGA GCACGGCTTC AATGCTCCGA CGATCTTCAA CGTCGCGGCG AACTACCTGC TGGGCTGGCG CGGCAAGCAG ACGTCGCTCG AGGCCGTCAG CGAGAAAGTG CTGCTCGACG GCCGGTTGAT GGCCGCCGAT TGGGAGCTGC TGACCGCAAG GCTCGAACAG AGCCGATCGT ATGTGTCGTG GTTTCGCCGC ATCTACGGGC GCAAGGCCGA TCGGGCGAGC CTGCTCGACG CGCTGGTGAC GTTCCAGCGA TCGCTGCTGA CGCCGAACTC CCGCTTCGAT CGCTACCTGC GCGGCGACGG CTCCGCACTG ACGCCGGGCG AACGCGAGGG GCTGAAGCTG TTCATGAGCT ACGGGTGCGC GTCGTGTCAC CAGGGAGTCA ATCTCGGCGG CAATATGCGT CAGCGCTTCG GAATCTTCCC GGAGCCGGAC GGACCGCCGG AATCCCCTTC GAAGGCTGCG CCGCCGGACG CGTCCGAGCA AAATCTGTTT CGGGTGCCGA GCCTGCGAAA CGTCGCGGTC ACCGCACCGT ACTTCCACGA TGGAGGCGTC GCGAGTCTGT CGGAGGCGGT GTCGATCATG GGGCGACGCC AACTCGGCCA GACCCTCTCC GCCTCGGACA CGGACGCCAT TGTGTCCTTC CTGAAAACGC TCACCGGCGA ATACGATGGT CGCGAACTCG AAAATCCGGC GCCGGCACGT GTTCCGTGA
|
Protein sequence | MQSSRPARRP RSRLSGLCAA LLCSVVDCAA AEMRELRTFA PLEAASSLDA DKVALGRMLF GDSILSRTRA IACTSCHDLA RGGTVPLSRA IAEDGREHGF NAPTIFNVAA NYLLGWRGKQ TSLEAVSEKV LLDGRLMAAD WELLTARLEQ SRSYVSWFRR IYGRKADRAS LLDALVTFQR SLLTPNSRFD RYLRGDGSAL TPGEREGLKL FMSYGCASCH QGVNLGGNMR QRFGIFPEPD GPPESPSKAA PPDASEQNLF RVPSLRNVAV TAPYFHDGGV ASLSEAVSIM GRRQLGQTLS ASDTDAIVSF LKTLTGEYDG RELENPAPAR VP
|
| |