Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3868 |
Symbol | |
ID | 3911672 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4422829 |
End bp | 4423680 |
Gene Length | 852 bp |
Protein Length | 283 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637885769 |
Product | 3,4-dihydroxyphenylacetate 2,3-dioxygenase HpaD_Fe |
Protein accession | YP_487472 |
Protein GI | 86750976 |
COG category | [S] Function unknown |
COG ID | [COG3384] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02298] 3,4-dihydroxyphenylacetate 2,3-dioxygenase |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.90869 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAAAC TGGTGCTCGC CGCGAAGGTG ACCCACGTTC CCTCGTTGAT GTTGTCGGAG GCCGGCGACA GTCCACTGAA GCAGGCGCGC GCCGGCGCGG TGGAGTCGCT GCGCGAACTC GGCCGACGCG CCAGGACCCG CGACGTCTCC ACCTTCGTGG TGTTCGACAC CCATTGGCTG TCGAATTTCG GCTACCACGT CAACGCCAAT GCCCGGCATC GCGGCTCCTT CACCAGCCAC GAAGCGCCGC ACATGATCCA GGATCTGCGC TACGATCTGC CGGGCGACAC CGCGCTCGCC GAAGCGATCG CCAGCGAGGC GGAGAGCGCC GGTCTCAAGG TGCTCGCGCA TCAAGTTCCG ACGCTGGGGC TGGAGTACGG CACCATCGTG CCGATGCACT ATCTGAATCC CGACGGCTGG GCCAAGGTGG TGTCGATCGC CTCGCCGTTG TTCACCTCGA TCGAGGAAAG CCGCGCGCTC GGCGAGGCGA CGCGGCGCGC GATCGATCAG TCCGGCGAGC GAGTGGCTCT TCTCGCCAGC GGATCGCTGT CGCATCGGCT GTGGCCGAAC AAAAACCTCG GCCCGGAGGC CTGGACCTCG ATCGCCAGTG AGTTCAATCG CCAGGTCGAC CTGCGCGTGC TGCAACTCTG GCAGGAGCGG CGCTACCGCG AATTCACCGC GATGCTCCCG GACTATGCGG TGAAGTGCAA CGGCGAGGGC GGCATGGGCG ACACCGTGAT GCTGTTCGCC GCGCTCGGCT GGGACAAGTA CGACGGCGAG GCGGAGCCGC TTTGCGACTA CTTTCCGTCG AGCGGCAGCG GGCAGATCAA CGTCGAATTC CATGTCGCGT GA
|
Protein sequence | MGKLVLAAKV THVPSLMLSE AGDSPLKQAR AGAVESLREL GRRARTRDVS TFVVFDTHWL SNFGYHVNAN ARHRGSFTSH EAPHMIQDLR YDLPGDTALA EAIASEAESA GLKVLAHQVP TLGLEYGTIV PMHYLNPDGW AKVVSIASPL FTSIEESRAL GEATRRAIDQ SGERVALLAS GSLSHRLWPN KNLGPEAWTS IASEFNRQVD LRVLQLWQER RYREFTAMLP DYAVKCNGEG GMGDTVMLFA ALGWDKYDGE AEPLCDYFPS SGSGQINVEF HVA
|
| |