Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3645 |
Symbol | |
ID | 3911447 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4180778 |
End bp | 4183996 |
Gene Length | 3219 bp |
Protein Length | 1072 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637885547 |
Product | FAD-binding oxidoreductase |
Protein accession | YP_487251 |
Protein GI | 86750755 |
COG category | [P] Inorganic ion transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0369] Sulfite reductase, alpha subunit (flavoprotein) [COG2124] Cytochrome P450 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.368759 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.294023 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTCGT CCAACAAGCT CGCGCCGATT CCGCATCCAC CGAAGCAGCC GGTGGTCGGC AACATGCTGT CGATCGACAC CAAGGCGCCG GTGCAGCATC TGGTGCGTCT CGCCGAGGAA CTCGGGCCGA TCTTCTGGCT CGACATGATG GGCGCGCCGA TCGTGATCGT GTCGGGCTAC GATCTGGTCG ACGAGATCAG CGACGAGAAG CGTTTCGACA AGGCGGTGCG CGGCGCGCTG CGTCGGGTCC GTACGGTCGG CGGCGACGGG CTGTTCACGG CCGACACCAG CGAGCCGAAC TGGAGCAAGG CACACAACAT CCTGCTGACG CCGTTCGGCG GCCGTGCCAT GCAGTCGTAT CATCCGAGTA TGGTCGATAT AGCCGAGCAG CTCGTCAAAA AATGGGAGCG TCTCAACGCC GACGACGAGA TCGACGTCGT TCACGACATG ACCGCGCTGA CGCTCGACAC CATCGGCCTG TGCGGCTTCG ACTATCGCTT CAATTCGTTC TATCGGCGCG ACTACCACCC CTTCGTGGAA TCGCTGGTGC GCTCGCTCGA GACCATCATG ATGACCCGCG GCCTGCCGCT GGAAAATCTC TGGATGAAGA AGCGGCGCGA CACGCTGGCC GAAGACGTCG CCTTCATGAA TGCGATGGTC GACGAGATCA TCGCCGAGCG ACGCAAGGCG GCCGCCGTCG CCGACAAGAT GGACATGCTC GGCGCGATGA TGACCGGCGT CGATAAGGTC ACCGGCGAGC CGCTCGACGA CGTCAACATC CGCTATCAGA TCAACACCTT CCTGATCGCC GGCCACGAGA CCACCAGCGG GCTGCTGTCC TGCGCGATCT ATGCGCTGTT GAAGCATCCC GAGGTTTTGC AGAAGGCCTA TGACGAGGTC GACCGCGTGC TCGGCGCCGA CACGTCGGTC GAGCCGAGCT ATCAGCAGGT CAATCAGCTC GGCTATATCA CCCAGATTCT CAAGGAGACG CTGCGGCTAT GGCCGCCGGC GCCGGCCTAC GGCGTGGCGC CGATCCAGGA CGAGACCATC GGCGGCCAAT ATCATCTGAA ACGCGGCACC TTCACCACGG TGCTGGTGCT GGCGCTGCAT CGCGACCCGA GTATCTGGGG TCCGAATCCG GATGCGTTCG ACCCGGAGAA TTTTTCGCGC GAGGCGGAAT CCAAGCGCCC GGCCAATGCG TGGAAACCGT TCGGCAACGG CCAGCGCGCT TGCATCGGCC GCGGCTTTGC GATGCACGAG GCGGCGCTGG CGCTCGGCAT GATCCTGCAA CGCTTCAAGC TGATCGATCA CACGCGCTAT CGCATGGTGC TGAAGGAAAC GCTGACGATC AAGCCGGAGG GCTTCAAGAT CAAGGTGCGG CCCCGCAGCG ACAAGGATCG AGCCACGCGG ATCGCGTCGG GAGTATCGCA CTCTGTGGCC CCGGCCCCGG CCGCGCCGCG CGCGCGGCCG GGCCACAACA CGCCGCTGCT GGTGCTGTAC GGCTCCAATC TCGGCACCGC CGAGGAGCTG GCGCACCGCG TCGCCGATCT CGCCGACCTG AACGGCTTCG CGACGCGACT CGGCGCGCTC GATCAGTATG TCGGTCAGTT GCCGGAAGAG GGGGGCGTAC TGATCTTCGC CGCCTCCTAC AACGGCGCGC CGCCGGACAA CGCCACGCAG TTCGTGCGCT GGCTGTCGGG CGATTTGCCG CCCGATGCCT TTGCCAAGCT GCGCTATGCC GTGTTCGGCT GCGGCAATCG CGACTGGACC GCGACCTATC AGGCGATCCC GCGGCTGATC GACGAGCGCC TCGCCGCGCA TGGCGGCCGC AACATCTTCG TGCGCGGCGA GGGCGACGCC CGTGACGATC TCGAAGGCCA GTTCGAGGCC TGGTTCGCCA CGCTCGGCCC GCTGGCGGTG AAGGAGTTCG GCATCGACGC TGCGTTCGAT CGCGGTGCCG ACGATACGCC GCTGTATGGA ATCGAGCCCC TCGCGCCGGC GGCGTCGCAG CCGCTGGCCG CCACTGGCGT CGCAGTGGCG ATGCGCGTGC TGGAGAACCG CGAGCTGCAG GATCGCGCAG CCTCCGGCCG CTCGACCCGG CACATCGAGA TCGCATTGCC GCAGGGCATG AGCTACCGCG TCGGGGATCA TCTCAGCGTG ATCCCGCGCA ACGATCCGGC GCTGGTCGCC GCCGTCGCGC AGCGCTTCGG CTTTGCGCCC GACGACCAGA TCAGATTGTC GGCGGCGCCC GGGCGCCGCG CGCAATTGCC GGTGGGTGAA GCCGTGTCGA TCGGCGGCCT GCTCGGCGAC CATGTCGAAC TGCAGCAGGT GGCCACCCGC AAGCAGATCG TGGCGCTGGC CGCGCACACG CGCTGTCCGC AGACGCGACC GAAGCTGCAG GCGCTCGCCG GCGGCGACGG CGCGGCCGAC GATGCCTATC GCGCGGAGGT ACTGGGTAAG CGCCGGTCGG TGTTCGATCT CTTGCAGGAA CATCCCGCTT GCGAGTTGCC GTTCGCGGCC TATCTTGAAA TGCTGACGCC GCTGCAGCCG CGTTACTACT CGATCTCGTC GTCACCGGCG CGAGATCCGG CGCGGGCCTC GGTCACCGTC GCGGTGGTCG AGGGACCGGC GCTGTCCGGC CGTGGCATCT ATCGCGGCGC GTGCTCGAGC TGGCTCGCCG GCCGCGGCAG CGGCGATACC GTTCAGGCCA CGGTACGTGC GACCAAGGCC TGCTTCCGTC TGCCGGACGA CGATCGCGTG CCGTTGATCA TGATCGGGCC GGGCACCGGG GTGGCGCCGT TCCGCGGCTT TCTACAGGAG CGTTCCGCGC GCAAGGTCGG CGGCGCAACG CTCGGCCCAG CGCTGCTGTT CTTCGGTTGC CGCCATCCGG CGCAGGACTA TCTCTATGCC GACGAATTGC AGGGCTTCGC GGCCGACGGA ATCGTCGAAT TGCACGCCGC GTTCTCGCGC GGCGACGGGC CCAAGACCTA TGTGCAACAT CTGATCGCCG CGCAAAAGGA TCGGGTGTTC GCATTGATCG AGCAGGGCGC GATCGTTTAT GTCTGCGGCG ACGGCGGCCG GATGGAGCCG GATGTCAAGG CCGCGCTGTG TGCGATCCAT CGCGAGCGCA GCGGCGCCGA CGCGACGGCC GCCGCGGCAT GGATTGCGGA TCTCGGCGCG CGCGATCGCT ACGTGCTCGA TGTTTGGGCG AGCGTGTAA
|
Protein sequence | MSSSNKLAPI PHPPKQPVVG NMLSIDTKAP VQHLVRLAEE LGPIFWLDMM GAPIVIVSGY DLVDEISDEK RFDKAVRGAL RRVRTVGGDG LFTADTSEPN WSKAHNILLT PFGGRAMQSY HPSMVDIAEQ LVKKWERLNA DDEIDVVHDM TALTLDTIGL CGFDYRFNSF YRRDYHPFVE SLVRSLETIM MTRGLPLENL WMKKRRDTLA EDVAFMNAMV DEIIAERRKA AAVADKMDML GAMMTGVDKV TGEPLDDVNI RYQINTFLIA GHETTSGLLS CAIYALLKHP EVLQKAYDEV DRVLGADTSV EPSYQQVNQL GYITQILKET LRLWPPAPAY GVAPIQDETI GGQYHLKRGT FTTVLVLALH RDPSIWGPNP DAFDPENFSR EAESKRPANA WKPFGNGQRA CIGRGFAMHE AALALGMILQ RFKLIDHTRY RMVLKETLTI KPEGFKIKVR PRSDKDRATR IASGVSHSVA PAPAAPRARP GHNTPLLVLY GSNLGTAEEL AHRVADLADL NGFATRLGAL DQYVGQLPEE GGVLIFAASY NGAPPDNATQ FVRWLSGDLP PDAFAKLRYA VFGCGNRDWT ATYQAIPRLI DERLAAHGGR NIFVRGEGDA RDDLEGQFEA WFATLGPLAV KEFGIDAAFD RGADDTPLYG IEPLAPAASQ PLAATGVAVA MRVLENRELQ DRAASGRSTR HIEIALPQGM SYRVGDHLSV IPRNDPALVA AVAQRFGFAP DDQIRLSAAP GRRAQLPVGE AVSIGGLLGD HVELQQVATR KQIVALAAHT RCPQTRPKLQ ALAGGDGAAD DAYRAEVLGK RRSVFDLLQE HPACELPFAA YLEMLTPLQP RYYSISSSPA RDPARASVTV AVVEGPALSG RGIYRGACSS WLAGRGSGDT VQATVRATKA CFRLPDDDRV PLIMIGPGTG VAPFRGFLQE RSARKVGGAT LGPALLFFGC RHPAQDYLYA DELQGFAADG IVELHAAFSR GDGPKTYVQH LIAAQKDRVF ALIEQGAIVY VCGDGGRMEP DVKAALCAIH RERSGADATA AAAWIADLGA RDRYVLDVWA SV
|
| |