Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_0969 |
Symbol | |
ID | 4021444 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 1096707 |
End bp | 1101167 |
Gene Length | 4461 bp |
Protein Length | 1486 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637961160 |
Product | cytochrome P450-like |
Protein accession | YP_568108 |
Protein GI | 91975449 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2124] Cytochrome P450 |
TIGRFAM ID | [TIGR01413] Dyp-type peroxidase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCACGC TCAAGTACTA TGACGCGATC AAGAAGGCCC AGTCCGACGC AGCCTCCGCT CCTCCTCCGC CGCTCGTCCC GTTCGATCTC GACGATCTCG GCGCCGACAC CACGCTGAAG CGCTGGAGCA CCACGGCGTT CAGCTATGTG TTGCGTGGCG CGCTGTATCT GTTCAGGGAA TTCTGGCCGA ACCCGCAATT CGGTCGCCTC GTGATCGTCA CCCGCGAAAC CGACGTCCGC GAGGTGTTGG CGCAGCCCGG TGTTTTCGAG GTGCCCTACG GGCCGGAGAT GACCGAACTC GCCGGCGGCA CCAATTTCGT GCTCGGCCTC GAAGGTCCGG AGCATGACCG GCAGAACGCC ATCATCCGCA GCGTGCTGCG CCCGACCGAT CTCGACCGCA TCAAGAGTCT CGCCCGCCAT TACGCGCAGA TCCTGATCGA CGGCTCCGGC GGCCGGATCG ACGTGATGAA GGACCTGATG ACGCGGGTCG CCACCGAGAC CTGCTGCGGT TATTTCGGCC TCGCTCCGGA GGACCCGGAC GCGTTCGCCG AATGGGCGAT GTCGATCTCG GCGCTGCTGT TCGCCGATCC GTTCGGAAAT GCCGCCACCC GTCGCCTTGC GCTGAATGGC GCGGCGCAGG TCCGCGAGGT GATCGATCGC GCGATCGCGC GCGCAAAGGC GGCACCGGAG ACCGACACCG TGGTCGGCCG GTTGGTCGCG CAATCCAGCG ACGGCGCAGC GACCGAAGGC GAGATCCGCG CCATTCTGGT CGGGCTCGTC ACCGGTTTCA TCCCGACCAA CACGCTCGCC GCCGGCAAGA TTCTCGAAGA GCTGCTGCGC CGGCCCAAGG TGTGGGCGGA AGCGATCGAC TGCGCCGGCC GTGACGACAG CGCCGGGCTC GAAGCGATCC TGCTCGAAGC CGGCAGGCTC AATCCCGCTC TGGCGCCGGG GCAGTGGCGC TACGCGACGA AAGACGGCGT CATCGCCCAC AACACCTCGC GGCAGCGCAA GGTCCGCGCC GGCTCGGTGC TGATGGTGGC GACGATGTCG GCGCTGCGCG ACAAGCGGGC GTTCGTGTCC GCGGGATCGT TCCGCGCCGA CCGGCCGAAC CAATCGAGCC TGATGTTCGG CGACGGTGTT CATGCCTGCC TCGGTATGCA TGTTGCGATC GCGCAGATCA CCGAAGTCTT TCGCGTGCTG CTGAGGCAGC CGAATTTGCG CAGGGCCTCC GACAGATCGG GCGCGATCGG CTGGGTCGGT CCGTTCCCGC GCCGGCTCGA CATGGAGTTC GAGCCGAAGA TCGCGCCGCA GACGCAGAAC ATGATCGTGA TCTGCGCGCC GGTTCGTCCC GACACCGATC TCGACGCCCT GCGCGCGCAG ATCACCGCGC TCGGCAATCC GGCGCGGCCG GACGTCGTCG CCGCGCTGCA GGCGACAGGA CTGATCCACT TCGCCTCGAT GACGCTGATC GACGCCGGCA CGCCGGATCA GCCCGCGCCG CATCTGTTGC TCGAACTCAA TGTCGACGGC TCGCCGGAGA GTGCGATCCG CGCCGTCGTG AACGAGGCGG GCGAATGGCT GGCGCCGATC TTCGCGCACG CCGACGAACG CGCCGGCGCC GCGCTCGGCG ACATCCTCCG CCGCAACATG CTCGATCTGC AGACGAAGCC ATGGGGCGCG ATCGGGTTGA ACTTCAACGG CACGCCGGAA TTCGCCGTCG CCGACATTGT GCGGCAGCGC GAGCTCGCGC GTTTCACCCA GGACGCGCTC GAGGCGTATC TCGAGAACCA TGCCGGCCTC GGCAGCCGCG CGATGGTCGC GCTCGGCTAT GTCCGGAAGC TGATCCGGCA GGATCCCGCG CTGAAGCGGA TCATCGACCA ATCGCCGGAC TCGACCCGCA AGGCCCGGCT GCAGGCGTTG TTCACGCGCG GCGCTGCATT CACCGACTAT CTGATCCGGC CGAGCCGGCG GCGTTTGCAG ATCTCCGACT GGGTCCCGCG CTCCGGCGCC GAGTCGCTGC TGGCGATGTT CAACTCGACC ACGTTCCGGT GGATCGGCGC GATCGTCATC GGCCTCGTGC TGATCGCGAG CCAGGCGATC TATTTTGCGA TCGAGCCGTA TTCGGACGCG ACCTATATCG GCCGCATCGC GCTGGCGGTG GTCGGCGGTT TGCTGCTGGT AGCGCTGAAG CTCGCCGCGC TGGCCGGCCT GTTCCTGCTG GTGTTGCGCT ATTACGAAAA CGGCGACGTT CCCGACGACA GCGATCCCGA CATCGCCAGG GTTCGCGAGA TCGCCGCCAG CGAAAACCAT CCCGGCTTCG TGCAGAACCA CATCACCGCC GTCACTGCGC TGAAGCCCGG CGCGTTTCGC AAGCTGACGC TTGCGCTGTC GCTGTGGGGC ATCAAGGAGC TGGTGACGAA TTTCTACCGG CCCGGCTTCG TGCTCAACAT GGGCACCATC CACAAGGCCA AATGGTTCAG GCCGCCCGGC GCCGACAAGC TGATCTTCCT CGCCAATTAC GATGGAAGCT GGGAGAGCTA TCTCGAAGAC TTCGTGATGA AGGCCCATGC CGGCCAGTCG GCGGCGTGGA GTAATGGCGT CGGCTTTCCG CGCACCCGCT TCCTGATCTA CGACGGCGCG CAGGACGGCG ACCGCTTCAA GCGCTGGGTG CGGCGGCAGC AGGTGCCGAC GCAGTTCTGG TTCAACCGCT ATCCGCGCCT CACCACCGAC GAGATCCGCC GCAACGCGCT GATCCATGAC GGGCTGGTCC GCGCCTCGAC CGACAGCGCG GCGCAGGCGT GGATCGACTG CTTCGGTTCG ATGACGCGTC CGGTCGACGC GATCGAGACG CCGGAGGTGC AGTCGCTGGT GTTCCGCGGG ATGGGCCAGC TCGCCTACAC GGCGACCGCG CTGCTGCGGC TTCCCGCCGA CAAAGCGGCT AGCAAAACAT GGCTGCGCGC GATCATGCCG GAGCCGGGGC TGCTGCCGGA TTCATCCGGC GCTGCGCCGC GGCCGGCGGT CGGCGCGGTC ACGTTCGGCG ATCGTCCGTT CGCCGGCGGC GATGCGCCGC ATCATGTCGC GACCTTCGTC GCTTTCTCGG CGTCAGGGCT TGCGAGGCTC GGGATGTCGC GGTCCAACGC CAATGACGGG CTGACGACGT TTCCGACCGC GTTCAACATC GGTATGGCAA ATCGCGCCAA CATCCTGCGC GACACCGGCG CTTCTGCGCC GGAGCGGTGG GACTGGGTCG ACGCCGCGCT CGACGGCCGC GACGACGTCG CCGCCGCTGA CGCCGCGTTG TTCGTCTATG GCCGCTCGGC GGAGGACTGC CGCGCTGCGC TCGACAATCA CGCCGCGCTG CTCGGCGGCT CCGACGCGCT GCTCTACGTG GTCGAGACCC GACCCGCCAC GGTCGAAACC GAGGACGGGC CGAAGACCTC GCTCGACTAC GAACATTTCG GTTTCGTCGA TGGCATCTCG CAGCCGGTGA TCCGCGGCAC CCAGCGCTTC GCCAAGGGCG TGGCTGCGCG CGACATCGTC GAGCCGGGCG AATTCATCCT CGGCTATCGC AACAACCAGG GCTACTTTCC GCCGAGCGCG ACGGTGCGGA GCAGCTCGGA TCCCGCCGAT CATCTGCCGA TCCTGCCCGA CGCGCTGCCG GGTCGCTTCC CGAAGTTCTG CTCGGACACG CCGGCGAAAC CGGTGCGCGA TTTCGGCCGC AACGGCACCT TCCTGGCGAT CCGCCACTTC GTGCAGGATG TCGACGGCTT CCGCAGCTTC ACCGAGGCGA AGGCCGCCGA GCTCGGCAGA TATCGCGATC TCGCCGCGGT GATCGGCGAG GAACCGACCG CCGAATGGGT CGCGGCGAAG ATGATGGGCC GCTGGCGCAA CGGCGTGCCG CTGGTCGATC AGCCGAATTC GAAGACGTTC AACAATCGCC GCGGCCCGTC GCGCGACGAC GTCGACCGCG CCTATGATCG CGACAACGAT TTCAGCTACG GCCAGGATGA TCCGCAGGGG CTGCACTGTC CGTTCGGCGC GCATATCCGC CGCGCCAATC CGCGCGACAG CCTGCAGCCC GATGATCCGA CGCAGCAGCG ATTGACCGCG CGTCACCGGC TGCTGCGCCG CGGCCGCTCG TTCGAAAGCC AGCAAGGCGA TGCAGGCAGA CCGGAAAAGG GCCTGCTGTT CGTCGCGGTC TGCGCCGACG TCGAGCGTCA GTTCGAGCTG GTGCAGCAAT CCTGGGTGTC GTCGCCGTCG TTCCACGGCC TCAGCGACGA GCCGGACCCG ATCATTTCGG CGACGCCGGA CGATCCGGCC GAGCAGCGGG TGTTCACCAT CCCGACCGCC GCCGGCCCGC TGACGCTGCA CGGCATCCAG AGCTACGTCA CGGTGAAAGG CGGCGGCTAC TTCTTCATGC CGAGCCGCTC GGCGCTGCAA TATTTGATCG ATCTGGAGTG A
|
Protein sequence | MFTLKYYDAI KKAQSDAASA PPPPLVPFDL DDLGADTTLK RWSTTAFSYV LRGALYLFRE FWPNPQFGRL VIVTRETDVR EVLAQPGVFE VPYGPEMTEL AGGTNFVLGL EGPEHDRQNA IIRSVLRPTD LDRIKSLARH YAQILIDGSG GRIDVMKDLM TRVATETCCG YFGLAPEDPD AFAEWAMSIS ALLFADPFGN AATRRLALNG AAQVREVIDR AIARAKAAPE TDTVVGRLVA QSSDGAATEG EIRAILVGLV TGFIPTNTLA AGKILEELLR RPKVWAEAID CAGRDDSAGL EAILLEAGRL NPALAPGQWR YATKDGVIAH NTSRQRKVRA GSVLMVATMS ALRDKRAFVS AGSFRADRPN QSSLMFGDGV HACLGMHVAI AQITEVFRVL LRQPNLRRAS DRSGAIGWVG PFPRRLDMEF EPKIAPQTQN MIVICAPVRP DTDLDALRAQ ITALGNPARP DVVAALQATG LIHFASMTLI DAGTPDQPAP HLLLELNVDG SPESAIRAVV NEAGEWLAPI FAHADERAGA ALGDILRRNM LDLQTKPWGA IGLNFNGTPE FAVADIVRQR ELARFTQDAL EAYLENHAGL GSRAMVALGY VRKLIRQDPA LKRIIDQSPD STRKARLQAL FTRGAAFTDY LIRPSRRRLQ ISDWVPRSGA ESLLAMFNST TFRWIGAIVI GLVLIASQAI YFAIEPYSDA TYIGRIALAV VGGLLLVALK LAALAGLFLL VLRYYENGDV PDDSDPDIAR VREIAASENH PGFVQNHITA VTALKPGAFR KLTLALSLWG IKELVTNFYR PGFVLNMGTI HKAKWFRPPG ADKLIFLANY DGSWESYLED FVMKAHAGQS AAWSNGVGFP RTRFLIYDGA QDGDRFKRWV RRQQVPTQFW FNRYPRLTTD EIRRNALIHD GLVRASTDSA AQAWIDCFGS MTRPVDAIET PEVQSLVFRG MGQLAYTATA LLRLPADKAA SKTWLRAIMP EPGLLPDSSG AAPRPAVGAV TFGDRPFAGG DAPHHVATFV AFSASGLARL GMSRSNANDG LTTFPTAFNI GMANRANILR DTGASAPERW DWVDAALDGR DDVAAADAAL FVYGRSAEDC RAALDNHAAL LGGSDALLYV VETRPATVET EDGPKTSLDY EHFGFVDGIS QPVIRGTQRF AKGVAARDIV EPGEFILGYR NNQGYFPPSA TVRSSSDPAD HLPILPDALP GRFPKFCSDT PAKPVRDFGR NGTFLAIRHF VQDVDGFRSF TEAKAAELGR YRDLAAVIGE EPTAEWVAAK MMGRWRNGVP LVDQPNSKTF NNRRGPSRDD VDRAYDRDND FSYGQDDPQG LHCPFGAHIR RANPRDSLQP DDPTQQRLTA RHRLLRRGRS FESQQGDAGR PEKGLLFVAV CADVERQFEL VQQSWVSSPS FHGLSDEPDP IISATPDDPA EQRVFTIPTA AGPLTLHGIQ SYVTVKGGGY FFMPSRSALQ YLIDLE
|
| |