Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3652 |
Symbol | |
ID | 3911454 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4192015 |
End bp | 4193235 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637885554 |
Product | cytochrome P450 |
Protein accession | YP_487258 |
Protein GI | 86750762 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2124] Cytochrome P450 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.348066 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAATGC CCTCACGCGA ACTGGCGGCG GAGTTCGAAC TCGAACGGCT GACGCCGGAG TTCTATGACA ATCCCTACCC CACCTATCGC GCCCTGCAGA CGCATCAGCC GGTCAAGCGG CTGCGCAATG GCGGGTACAT CCTGACGCGC TATGACGATC TGGTGACGGT CTACAAGAAC ACCACGCTGT TCAGCTCCGA CAAGAAGCGC GAATTCGCGC CGAAATACGG CGACTCGCTG CTGTTCGAGC ATCACACTAC CAGCCTGGTG TTCAACGACC CGCCGGCGCA TACGCGAGTG CGGCGGCTGA TCACGGGCGC GCTGTCGCCG CGCGCGATCG CCGGCATGCA GCCGGATCTG ATCGCGCTGG TCGACCGCCT GCTCGATGCG ATGGCGGCCA AAGCCGGCGT CGATCTGATC GAGGATTTCG CCGCCGCGAT CCCGATCGAG GTGATCGGCA ATCTGCTCGG CGTGCCGCAC GACGAGCGCG GCCCGCTCCG CGACTGGTCG CTGGCGATTC TCGGCGCACT CGAGCCGGTG ATCGGGCCGG AGACGTTTTC GCGCGGCAAT GAGGCTGTCC GCGACTTCCT CGCCTATCTC GAAATCCTGA TCACGCGCCG TCGCGCCGAG CCCGGCGATC CGGAGCACGA TGTTCTGACC CGGCTGATCC AGGGCGACGA CGGCACCGGC GAGAAGCTCT CCGCCAAGGA GCTGCTGCAC AATTGCATCT TCCTGCTCAA CGCCGGACAT GAAACCACCA CCAACCTGAT CGGCAACGGG CTCGTGGCGC TCGCAGACAA TCCTGCGGAA AAACAGCGGC TGATCGGCCA GCCCGGCCTC GCCCGCACCG CGGTCGAAGA GATCCTGCGC TATGAGAGCT CGAACCAGCT CGGCAACCGC ATCACCACCA CCGAGGTCGA GATCGGAGGC GTAACGATGC AGGCCAACAC CTCGCTGACG CTGTGCATCG GCGCTGCCAA CCGCGATCCG GCGCAGTTTC CCGATCCCGA CCGGTTCGAC GTCGGACGAA CGCCGAACCG GCACCTCGCT TTTGCCACGG GGCCACATCA ATGCGCCGGC ATGGCGCTGG CGCGGCTCGA AGGCGTGATC GCGCTGACGC GATTCCTGGC GCGCTTCCCG AACTACACGC TCGACGGCAC GCCGTCGCGC GGCGGGCGGG TGCGGTTTCG CGGCTATCTG CGCGTGCCAT GCCGCCTGTA G
|
Protein sequence | MEMPSRELAA EFELERLTPE FYDNPYPTYR ALQTHQPVKR LRNGGYILTR YDDLVTVYKN TTLFSSDKKR EFAPKYGDSL LFEHHTTSLV FNDPPAHTRV RRLITGALSP RAIAGMQPDL IALVDRLLDA MAAKAGVDLI EDFAAAIPIE VIGNLLGVPH DERGPLRDWS LAILGALEPV IGPETFSRGN EAVRDFLAYL EILITRRRAE PGDPEHDVLT RLIQGDDGTG EKLSAKELLH NCIFLLNAGH ETTTNLIGNG LVALADNPAE KQRLIGQPGL ARTAVEEILR YESSNQLGNR ITTTEVEIGG VTMQANTSLT LCIGAANRDP AQFPDPDRFD VGRTPNRHLA FATGPHQCAG MALARLEGVI ALTRFLARFP NYTLDGTPSR GGRVRFRGYL RVPCRL
|
| |