Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0862 |
Symbol | |
ID | 3909120 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 987425 |
End bp | 991894 |
Gene Length | 4470 bp |
Protein Length | 1489 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637882755 |
Product | cytochrome P450-like |
Protein accession | YP_484484 |
Protein GI | 86747988 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2124] Cytochrome P450 |
TIGRFAM ID | [TIGR01413] Dyp-type peroxidase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCACGC TCAAGTACTA CGACGCGATC AAGAAAGCCC AGTCCGACAC CGCTTCCGCC CCGCCCGCTC CGATCCTCCC GTTCAATCTC GACGACCTCG GCGCCGACAC CACGCTGAAG CGCTGGACCA GCTGGGGCTT CGGCTGGCTG ATGCGCGGAG CGCTGCATGT CTTCCGCGAG GTCTGGCCGA ATCCGCAATT CGGTCGTCTG ATCATCGTCA CCCGCGAGAC CGACGTTCGT GACGTGCTGG CGCAGCCCGG CCTGTACGAG GTGCCCTACG GGCCGGAGAT GACCGAACTC GCCGGCGGCA CCAATTTCGT GCTCGGCCTC GAGGGTCCGG AGCACGACCG CCAGAACGCC ATCATCCGCA GCGTGCTGCG CCCGGCCGAT CTCGATCGCA TCAGGGAATT GTCCGCGCAC TACACGACGA TCCTGCTCGA CGCCTCCGGC GGCCGCATCG ACGTGATGAA GGATCTGATG ACGCGGGTCG CGACCGAAAC CTGCTGCGGC TATTTCGGCC TCGAACCGGA GGATCCGGAC GCCTTCGCCG AATGGGCGAT GTCGATCTCG GCGCTGTTGT TCGCCGATCC GTTCGGCGAC GCCGCGACGC GGCGGCTGGC GCTGAACGGC GCCGCACAGG TCCGCGAGGT CATCGACCGC GCCATCGCGC GCGCCAAGGC GGCGCCGGAA ACCGACACCG TGGTCGGGCG CCTGGTGGCG CAGGCGCACG ACGGCGTCGT CACCGAGGAC GAGATCCGCG CCATCATCGT CGGCCTCGTC ACCGGCTTCA TCCCGACCAA CACGCTCGCC GCCGGCAAGA TGCTCGACGA ATTGCTGCGC CGTCCGAAAG TGTGGGCGGA GGCGATCGCT TGCGCCGGCC GCGACGACGT CGCCGGGCTG CAGGCGATCC TGCTCGAAGC CGGCCGGCTC AATCCGGCGC TGGCGCCGGG GCAGTGGCGC TACGCGACGC AGGACGGCGT GATCGCGCAC AACACCAGCC GGCAGCGCCG GGTCAAGGCC GGCTCGGTGC TGATGGTGGC GACGATGTCG GCGCTGCGCG ACAAGCGCGC TTTCGTGGCG CCGGGGTCGT TCCGCGCCGA CCGGCCGAAC GATTCCGGCC TGATGTTCGG CGACGGCGCC CATGTCTGCC TCGGCAAGCA CGTCGCGATC ACGCAGATCA CGCAGGTGTT CCGCGGGCTG CTGCAACAGC CGAATCTGCG CACCGCTTCC GGCAAGGATG GCGCGATCGG CTGGGTCGGT CCTTTCCCGC GCCGGCTCGA CATGGAGTTC GAATCGCGGG TCGCGCCGCA GACCCAGAAC ATGGTGGTGA TCTGCGCACC GGTGCGTCCC GGCGCCGATC TCGACACCTT GCGGACGCAG ATCACCGCGC TCGGCAATCC GGCGCGGCCG GAGCTGGTCG CGGCGTTCGA GGCGACCGGC ATCGTCCACT TCGCCTCGAT GACGCTGATC GACGCCGGCA CGCCCGAGCA GCCCGCGCCG CATCTGTTGC TCGAGCTCAA TGTCGACGGC ACCCCGGACG GCGCGATCCG CGCCGTGGCC GAGGTCGCCG GACAATGGCT GGCGCCGATC TTCGCGCAGG CCGACGCGCC CGCCGGCGCG GCGCTGATCG ACATCCTGCG CGACAACACG CTCGATCTGC AGACCCGGCC GTGGGGCGCG ATCGGGCTGA ACTTCAACGG CACGCCGGAA TTCGCCGTCG GCGATATCAT CCGGCAGCGC GAGCTGGCGC AATTCGCCCA GGACGCGCTG GAGGACTATC TCGAAAATCA CGCCTGCCTC GGCAGCCGCG CGATGGTGGC GCTCGGCTAT GTCCGCAAGC TGATCCGGCA GGACCCCGCG CTGAAGCGGA CCATCGACGA ATCGCCGGAC TCGCCGCGCA GGGCGCGGCT GCAGGCGCTG TTCGCCCGCG GCGCGGCGTT CACCCAATAT CTGATCCGGC CGAGCCGGCG GCGGCTCAAG ATCTCCGACT GGGTGCCGCG CTCCGGCACC GACTCGCTGC TGTCGCTGCT CGGCTCGCCG ACTTTCCAGT GGATCGGCGC GATCGTCGCC GCGCTGGTGC TGATCGCCGG CCAGGCGATC TATTTCGCGA TCGAGCCGTT CTCGGACGCG ACCTATCTTG GCCGCATCGC GCTCGCCTTG GTCGGCGGGA TTTTGCTGGT GGCGCTGATC CTCGCCGCAC TCGGCGGGCT GTTCCTGCTC GTGCTGAACG ACTACGAATC GCGCGACGTT CCCGACGACA GCGACCCCGA TCTCGGAAAG GTCCGCGAGA TCGCCGCCAG CGAGAACCAT CCCGGCTTCA TCCAGAACCA CATCACCGCG GTCACCACGC TCAAGCCCGG CTGGTTCCGG AAACTGACGC TGGCGCTGTC GCTGTGGGGC ATCAAGGAGC TGGTGACGCA CTGGTATCGC CCCGGCTTCG TGCTCAACAT GGGCACCATC CACAAGGCGA AATGGTTCCG GCCGCCCGGC ACCGACAAGC TGATCTTCCT CGCCAATTAC GACGGCAGCT GGGAGAGCTA TCTCGAGGAC TTCGTGATGA AGGCCCATGC CGGCCAGTCG GCGGCGTGGA GCAACGGCGT CGGCTTTCCG CGCACCCGCT TCCTGATCTT CGACGGCGCG CAGGACGGCG ATCGCTTCAA GCGCTGGGTG CGGCGCCAGC AGGTGCCGAC CCAGTTCTGG TTCAACCGCT ATCCGCAACT CACCACCGAC GACATCCGCC GCAACGCCAT GATCCACGAC GGCCTGGTCC GCGCCTCGAC CGACAGCGCG GCGCGGGCGT GGCTCGATTG CTTCGGCTCG ATGACCCGGC CGAACTATGC GATCGAGACT CCCGAGGTGC AGTCGCTGGT GTTCCGCGCG ATGGGGCAGC TCGACCACAC CGCGACCGCA TTGCTGCGGC TGCCCGCGGA CCGGGGCGCG GGCAAGGAAT GGCTGCGCGC GATCATGCCG GAGGCGGGGC TGCTGCAGGA TCCGCAGGCG CCGCGGCCCG CGATCGGCGC GATCACCTTC GGCGATCGAC CGTTCGTCGG CGGCGACGCC GCGCACAATG TCGCGACCTT CGTCGCCTTC TCGGCGAGCG GGCTCGGCAA GCTCGGCCTG TCGGCGCGCA ACGCCAATGA CGGGCTGACC ACGTTCCCGA CCGCGTTCAA CATTGGCATG TCGCAGCGCG CCAACATCCT GCGCGACACC GGCGCGTCGA AGCCGGAGCG ATGGGATTGG GTCGACGCCG CGCTGGAGGG CAGCGACGGG GCGGCGGCGG CCGACGCCAC GCTGTTCGTC TACGGCAAAT CCGCCGAGGT CTGCCGCAAG GCGCTCGACG CGCATGCGGC GCTGCTCGGC GGCCGCGATG CGCTGCTCTA CGTCGTCGAG ACCAGCCCGC CGACGGTCGA CACCCCGGAC GGTCCGAAGA CCTCGCTGGA GTACGAACAT TTCGGCTTCG TCGACGGCAT CTCGCAGCCG GTGATCAAGG GCACCCAGCG CTTCGCCAAG GGCGTGCCGG CGCGCGACAT CGTCGAGCCC GGCGAGTTCA TTCTCGGCTA TCGCAACAAT CAGGGTTACT TTCCGCCGAG CGCGACGGTG GCCAGCAGCT CCGATCCGGC CAATCATCTG CCGATCCTGC CGGATCTGCT GCCGAGCCGG TTTCCGAATT TCCGCGCCGA TACGCCGGCC AAGCCGGTGC GCGATTTCGG CCGCAACGGG ACTTTTCTCG CGATCCGCCA GTTCGTGCAG GACGTCGACG GCTTCAAGGC GTTCACCGAA GCCAAGGCGC AGGAGCTGTC GAAATATCGC GACCTCGCAG CGGTGATCGG CGAGACGCCG ACGGCCGAAT GGGTGGCGGC GAAGATGATG GGGCGCTGGC GCAACGGCGT GCCGCTGGTC GACAAGCCGA ATTCCACCAC CTTCAACACC CGTCGCGGCA AATCGCGTAG CGACGCGCGC GATCCCTACG ACCGCGACAA CGACTTCGCC TACGGCCAGG ACGATCCGCA GGGGCTGCAT TGTCCGTTCG GCGCGCATAT CCGCCGCGCC AATCCGCGCG ACAGCCTGCA GCCCGACGAT CCGACGCAGC AGCAGCTCAC CGCGCGGCAC CGGCTGCTGC GCCGCGGCCG CTCGTTCGAA GCCGGGCAGG GCAGTGGCGG GCCGGGTCGC GGCGGCAAGC CGGAAAAGGG CCTGCTGTTC GTCGCGGTCT GCGCCGACGT CGAACGCCAG TTCGAACTGG TGCAGCAATC CTGGGTGTCG TCGCCGTCGT TCCACGGCCT CAGCCACGAG CCGGACCCGA TCATCGCGTC GGCGCCCGAC GATCCGGCGA AAAAGCGCGT CTTCACCATC CCGACCGCCG CCGGCCCGCT GACCCTGCAC GGTATCCACA GCTACGTCAC GGTGACAGGC GGCGGCTACT TCTTCATGCC GAGCCGCTCG GCGCTGCAAT ATCTGATCGA TCTGGAGTGA
|
Protein sequence | MFTLKYYDAI KKAQSDTASA PPAPILPFNL DDLGADTTLK RWTSWGFGWL MRGALHVFRE VWPNPQFGRL IIVTRETDVR DVLAQPGLYE VPYGPEMTEL AGGTNFVLGL EGPEHDRQNA IIRSVLRPAD LDRIRELSAH YTTILLDASG GRIDVMKDLM TRVATETCCG YFGLEPEDPD AFAEWAMSIS ALLFADPFGD AATRRLALNG AAQVREVIDR AIARAKAAPE TDTVVGRLVA QAHDGVVTED EIRAIIVGLV TGFIPTNTLA AGKMLDELLR RPKVWAEAIA CAGRDDVAGL QAILLEAGRL NPALAPGQWR YATQDGVIAH NTSRQRRVKA GSVLMVATMS ALRDKRAFVA PGSFRADRPN DSGLMFGDGA HVCLGKHVAI TQITQVFRGL LQQPNLRTAS GKDGAIGWVG PFPRRLDMEF ESRVAPQTQN MVVICAPVRP GADLDTLRTQ ITALGNPARP ELVAAFEATG IVHFASMTLI DAGTPEQPAP HLLLELNVDG TPDGAIRAVA EVAGQWLAPI FAQADAPAGA ALIDILRDNT LDLQTRPWGA IGLNFNGTPE FAVGDIIRQR ELAQFAQDAL EDYLENHACL GSRAMVALGY VRKLIRQDPA LKRTIDESPD SPRRARLQAL FARGAAFTQY LIRPSRRRLK ISDWVPRSGT DSLLSLLGSP TFQWIGAIVA ALVLIAGQAI YFAIEPFSDA TYLGRIALAL VGGILLVALI LAALGGLFLL VLNDYESRDV PDDSDPDLGK VREIAASENH PGFIQNHITA VTTLKPGWFR KLTLALSLWG IKELVTHWYR PGFVLNMGTI HKAKWFRPPG TDKLIFLANY DGSWESYLED FVMKAHAGQS AAWSNGVGFP RTRFLIFDGA QDGDRFKRWV RRQQVPTQFW FNRYPQLTTD DIRRNAMIHD GLVRASTDSA ARAWLDCFGS MTRPNYAIET PEVQSLVFRA MGQLDHTATA LLRLPADRGA GKEWLRAIMP EAGLLQDPQA PRPAIGAITF GDRPFVGGDA AHNVATFVAF SASGLGKLGL SARNANDGLT TFPTAFNIGM SQRANILRDT GASKPERWDW VDAALEGSDG AAAADATLFV YGKSAEVCRK ALDAHAALLG GRDALLYVVE TSPPTVDTPD GPKTSLEYEH FGFVDGISQP VIKGTQRFAK GVPARDIVEP GEFILGYRNN QGYFPPSATV ASSSDPANHL PILPDLLPSR FPNFRADTPA KPVRDFGRNG TFLAIRQFVQ DVDGFKAFTE AKAQELSKYR DLAAVIGETP TAEWVAAKMM GRWRNGVPLV DKPNSTTFNT RRGKSRSDAR DPYDRDNDFA YGQDDPQGLH CPFGAHIRRA NPRDSLQPDD PTQQQLTARH RLLRRGRSFE AGQGSGGPGR GGKPEKGLLF VAVCADVERQ FELVQQSWVS SPSFHGLSHE PDPIIASAPD DPAKKRVFTI PTAAGPLTLH GIHSYVTVTG GGYFFMPSRS ALQYLIDLE
|
| |