Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3504 |
Symbol | |
ID | 3911306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4010789 |
End bp | 4012585 |
Gene Length | 1797 bp |
Protein Length | 598 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637885406 |
Product | chloride peroxidase |
Protein accession | YP_487110 |
Protein GI | 86750614 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATCA GGCGAGTTTT CGTCGCGACG TTCATACTGT CGCTCACTCC CGTGGTTTCC ACGGCCCAGC CGGTGCTTCA GACACAATCG CTCGACGACT TCCGGCGAAT CCTCAGTCCC GGCCTGATTT CGCGCGAGGT GACCAGCGAA AAGGTGAGTC AACTCAACGC GGAGGCTGCG AAGAGCTGCT TCGCCCCCGA CGCGTTTCAG CGCGCGATCC CGGCACTGCC GATCAGATAT AAGCGGCCAC TTCCTTGCGC CCCGCAGAAT GCGTTCGACC GGTTCGTGAT GTGGAATCAG ATCGCACTCG ACACCACATC GATGGATCAT GCTCCGCCGC CATTGGGCAC GCCGGACAAT CCCGCCCTGC ACAATTACGG CCCGCACCGC TCGAGCCGGG TGATGGCGAT CGTTCACATC GCGATGTTCG ACGCGATCAA TCTGGCGATG CGGTTCGACT CGAAATCACC CGACAAGCCG AGATACGTGT TCGCGACCTA TCTGCAGGGC ATTCCCAATC CGAAAGAAGA CGCCTCGGTC GACATGGCGA TCACGTATTC GGCGGGTGAG ACCCTGAAGG CGCTTTATCC TCAACAGGTG GATGTGATCG AGAATCTCAT CGCCATCGAC GAGGGCGCCG TCCTGAGCGG CAAGGATCGC AACGCGCCGA TGTATCGGGC CGGCCGCGAA CTCGGAATCA CCGTCGCGAA GGCGATTCTC CGGGCACGCC GGAACGACGG CTCGGCGCAC GAAGAGCCGC AGGTCGGCGA TCCTGGCTTT CCGCTCGCGG AAAAATCAGG TCAGTGGCGA CCCGACCCCG TCAGCAACAT CGCCGCGGCG CTCGGCGGCA AGTGGAAGGA CGTTCAGCCG TTCGTCATCA AGAACGTGCC GACCTTCCGG CCGCCCCCGC CGCCGTCGAC CGACAGCGCC GAATACGCCC GGGACTTCGA AGAGGTCAGG AAGCTCGGCG GCGAGAACAC CGAGCGCTCG AGGTCTGAAC GCAGTGCGGA CCAGACACTG ATCGGAACGT TCTGGGCTTA TGACGGCACC GCTTTCCTGT GCGCTCCGCC GCGGCTCTAC AATCAGGTGA TCCGGCAGAT CGTGCAGCAG CAGATCAAGG ACGGCGATAC CAAGGACGAG GATCCGAAGC TGCTCAACTA CGCCCGATTG TTCGCGCTCG CCAACATCGC AATGGCGGAC GCGGCGATCG CCGCATGGGA CGCAAAGTAT CACTATCGCT ACTGGCGCCC GGTCGTCGGC ATTCGGGCGG CCGCGACCGA CGGCAACGAT GCCACCCATG CCGACGTCTA CTGGAAGGCG CTGGGGGCCC CGGCCAGCAA CTCCGTTCGC GGTCCCAACT TCACGCCTCC TTTCCCGGCC TATCCTTCCG GACACGCCAC GTTCGGCGGC GCTCTGTTCG AAGTGCTGCG CGCGTTCTAC CCGGACGACA CGTCCTTCAG CTTCATCTCC GACGAGTACA ACGGGCAAAA CAAGCCGGCC GGCTCGGACG TGCCGCGACC GGAAGTGACC CGACGCTTCG TCAACTTCCG CGCGGCGGAG GACGAAAATG CGCGCAGCCG GGTCTATCTC GGTGTGCACT GGCAGTTCGA TGCCGATGCC GGGATCGCGC AGGGCAACCA GGTCGGCAGC TTCGTGGTCG GCAGCACGCT GCGCTGTCTC GACGATGACG GCAGAGCGCT GGATTGCAAA CCGGGCAGCG GATTCGACGT CAAGCGCAAG TTCCTGATCT CGACGGAGAA GCGGGTGCTG AGCACTCCGT TCACTCCGTC CCAGTAA
|
Protein sequence | MTIRRVFVAT FILSLTPVVS TAQPVLQTQS LDDFRRILSP GLISREVTSE KVSQLNAEAA KSCFAPDAFQ RAIPALPIRY KRPLPCAPQN AFDRFVMWNQ IALDTTSMDH APPPLGTPDN PALHNYGPHR SSRVMAIVHI AMFDAINLAM RFDSKSPDKP RYVFATYLQG IPNPKEDASV DMAITYSAGE TLKALYPQQV DVIENLIAID EGAVLSGKDR NAPMYRAGRE LGITVAKAIL RARRNDGSAH EEPQVGDPGF PLAEKSGQWR PDPVSNIAAA LGGKWKDVQP FVIKNVPTFR PPPPPSTDSA EYARDFEEVR KLGGENTERS RSERSADQTL IGTFWAYDGT AFLCAPPRLY NQVIRQIVQQ QIKDGDTKDE DPKLLNYARL FALANIAMAD AAIAAWDAKY HYRYWRPVVG IRAAATDGND ATHADVYWKA LGAPASNSVR GPNFTPPFPA YPSGHATFGG ALFEVLRAFY PDDTSFSFIS DEYNGQNKPA GSDVPRPEVT RRFVNFRAAE DENARSRVYL GVHWQFDADA GIAQGNQVGS FVVGSTLRCL DDDGRALDCK PGSGFDVKRK FLISTEKRVL STPFTPSQ
|
| |