Gene RPB_3645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3645 
Symbol 
ID3911447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4180778 
End bp4183996 
Gene Length3219 bp 
Protein Length1072 aa 
Translation table11 
GC content67% 
IMG OID637885547 
ProductFAD-binding oxidoreductase 
Protein accessionYP_487251 
Protein GI86750755 
COG category[P] Inorganic ion transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0369] Sulfite reductase, alpha subunit (flavoprotein)
[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.368759 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.294023 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTCGT CCAACAAGCT CGCGCCGATT CCGCATCCAC CGAAGCAGCC GGTGGTCGGC 
AACATGCTGT CGATCGACAC CAAGGCGCCG GTGCAGCATC TGGTGCGTCT CGCCGAGGAA
CTCGGGCCGA TCTTCTGGCT CGACATGATG GGCGCGCCGA TCGTGATCGT GTCGGGCTAC
GATCTGGTCG ACGAGATCAG CGACGAGAAG CGTTTCGACA AGGCGGTGCG CGGCGCGCTG
CGTCGGGTCC GTACGGTCGG CGGCGACGGG CTGTTCACGG CCGACACCAG CGAGCCGAAC
TGGAGCAAGG CACACAACAT CCTGCTGACG CCGTTCGGCG GCCGTGCCAT GCAGTCGTAT
CATCCGAGTA TGGTCGATAT AGCCGAGCAG CTCGTCAAAA AATGGGAGCG TCTCAACGCC
GACGACGAGA TCGACGTCGT TCACGACATG ACCGCGCTGA CGCTCGACAC CATCGGCCTG
TGCGGCTTCG ACTATCGCTT CAATTCGTTC TATCGGCGCG ACTACCACCC CTTCGTGGAA
TCGCTGGTGC GCTCGCTCGA GACCATCATG ATGACCCGCG GCCTGCCGCT GGAAAATCTC
TGGATGAAGA AGCGGCGCGA CACGCTGGCC GAAGACGTCG CCTTCATGAA TGCGATGGTC
GACGAGATCA TCGCCGAGCG ACGCAAGGCG GCCGCCGTCG CCGACAAGAT GGACATGCTC
GGCGCGATGA TGACCGGCGT CGATAAGGTC ACCGGCGAGC CGCTCGACGA CGTCAACATC
CGCTATCAGA TCAACACCTT CCTGATCGCC GGCCACGAGA CCACCAGCGG GCTGCTGTCC
TGCGCGATCT ATGCGCTGTT GAAGCATCCC GAGGTTTTGC AGAAGGCCTA TGACGAGGTC
GACCGCGTGC TCGGCGCCGA CACGTCGGTC GAGCCGAGCT ATCAGCAGGT CAATCAGCTC
GGCTATATCA CCCAGATTCT CAAGGAGACG CTGCGGCTAT GGCCGCCGGC GCCGGCCTAC
GGCGTGGCGC CGATCCAGGA CGAGACCATC GGCGGCCAAT ATCATCTGAA ACGCGGCACC
TTCACCACGG TGCTGGTGCT GGCGCTGCAT CGCGACCCGA GTATCTGGGG TCCGAATCCG
GATGCGTTCG ACCCGGAGAA TTTTTCGCGC GAGGCGGAAT CCAAGCGCCC GGCCAATGCG
TGGAAACCGT TCGGCAACGG CCAGCGCGCT TGCATCGGCC GCGGCTTTGC GATGCACGAG
GCGGCGCTGG CGCTCGGCAT GATCCTGCAA CGCTTCAAGC TGATCGATCA CACGCGCTAT
CGCATGGTGC TGAAGGAAAC GCTGACGATC AAGCCGGAGG GCTTCAAGAT CAAGGTGCGG
CCCCGCAGCG ACAAGGATCG AGCCACGCGG ATCGCGTCGG GAGTATCGCA CTCTGTGGCC
CCGGCCCCGG CCGCGCCGCG CGCGCGGCCG GGCCACAACA CGCCGCTGCT GGTGCTGTAC
GGCTCCAATC TCGGCACCGC CGAGGAGCTG GCGCACCGCG TCGCCGATCT CGCCGACCTG
AACGGCTTCG CGACGCGACT CGGCGCGCTC GATCAGTATG TCGGTCAGTT GCCGGAAGAG
GGGGGCGTAC TGATCTTCGC CGCCTCCTAC AACGGCGCGC CGCCGGACAA CGCCACGCAG
TTCGTGCGCT GGCTGTCGGG CGATTTGCCG CCCGATGCCT TTGCCAAGCT GCGCTATGCC
GTGTTCGGCT GCGGCAATCG CGACTGGACC GCGACCTATC AGGCGATCCC GCGGCTGATC
GACGAGCGCC TCGCCGCGCA TGGCGGCCGC AACATCTTCG TGCGCGGCGA GGGCGACGCC
CGTGACGATC TCGAAGGCCA GTTCGAGGCC TGGTTCGCCA CGCTCGGCCC GCTGGCGGTG
AAGGAGTTCG GCATCGACGC TGCGTTCGAT CGCGGTGCCG ACGATACGCC GCTGTATGGA
ATCGAGCCCC TCGCGCCGGC GGCGTCGCAG CCGCTGGCCG CCACTGGCGT CGCAGTGGCG
ATGCGCGTGC TGGAGAACCG CGAGCTGCAG GATCGCGCAG CCTCCGGCCG CTCGACCCGG
CACATCGAGA TCGCATTGCC GCAGGGCATG AGCTACCGCG TCGGGGATCA TCTCAGCGTG
ATCCCGCGCA ACGATCCGGC GCTGGTCGCC GCCGTCGCGC AGCGCTTCGG CTTTGCGCCC
GACGACCAGA TCAGATTGTC GGCGGCGCCC GGGCGCCGCG CGCAATTGCC GGTGGGTGAA
GCCGTGTCGA TCGGCGGCCT GCTCGGCGAC CATGTCGAAC TGCAGCAGGT GGCCACCCGC
AAGCAGATCG TGGCGCTGGC CGCGCACACG CGCTGTCCGC AGACGCGACC GAAGCTGCAG
GCGCTCGCCG GCGGCGACGG CGCGGCCGAC GATGCCTATC GCGCGGAGGT ACTGGGTAAG
CGCCGGTCGG TGTTCGATCT CTTGCAGGAA CATCCCGCTT GCGAGTTGCC GTTCGCGGCC
TATCTTGAAA TGCTGACGCC GCTGCAGCCG CGTTACTACT CGATCTCGTC GTCACCGGCG
CGAGATCCGG CGCGGGCCTC GGTCACCGTC GCGGTGGTCG AGGGACCGGC GCTGTCCGGC
CGTGGCATCT ATCGCGGCGC GTGCTCGAGC TGGCTCGCCG GCCGCGGCAG CGGCGATACC
GTTCAGGCCA CGGTACGTGC GACCAAGGCC TGCTTCCGTC TGCCGGACGA CGATCGCGTG
CCGTTGATCA TGATCGGGCC GGGCACCGGG GTGGCGCCGT TCCGCGGCTT TCTACAGGAG
CGTTCCGCGC GCAAGGTCGG CGGCGCAACG CTCGGCCCAG CGCTGCTGTT CTTCGGTTGC
CGCCATCCGG CGCAGGACTA TCTCTATGCC GACGAATTGC AGGGCTTCGC GGCCGACGGA
ATCGTCGAAT TGCACGCCGC GTTCTCGCGC GGCGACGGGC CCAAGACCTA TGTGCAACAT
CTGATCGCCG CGCAAAAGGA TCGGGTGTTC GCATTGATCG AGCAGGGCGC GATCGTTTAT
GTCTGCGGCG ACGGCGGCCG GATGGAGCCG GATGTCAAGG CCGCGCTGTG TGCGATCCAT
CGCGAGCGCA GCGGCGCCGA CGCGACGGCC GCCGCGGCAT GGATTGCGGA TCTCGGCGCG
CGCGATCGCT ACGTGCTCGA TGTTTGGGCG AGCGTGTAA
 
Protein sequence
MSSSNKLAPI PHPPKQPVVG NMLSIDTKAP VQHLVRLAEE LGPIFWLDMM GAPIVIVSGY 
DLVDEISDEK RFDKAVRGAL RRVRTVGGDG LFTADTSEPN WSKAHNILLT PFGGRAMQSY
HPSMVDIAEQ LVKKWERLNA DDEIDVVHDM TALTLDTIGL CGFDYRFNSF YRRDYHPFVE
SLVRSLETIM MTRGLPLENL WMKKRRDTLA EDVAFMNAMV DEIIAERRKA AAVADKMDML
GAMMTGVDKV TGEPLDDVNI RYQINTFLIA GHETTSGLLS CAIYALLKHP EVLQKAYDEV
DRVLGADTSV EPSYQQVNQL GYITQILKET LRLWPPAPAY GVAPIQDETI GGQYHLKRGT
FTTVLVLALH RDPSIWGPNP DAFDPENFSR EAESKRPANA WKPFGNGQRA CIGRGFAMHE
AALALGMILQ RFKLIDHTRY RMVLKETLTI KPEGFKIKVR PRSDKDRATR IASGVSHSVA
PAPAAPRARP GHNTPLLVLY GSNLGTAEEL AHRVADLADL NGFATRLGAL DQYVGQLPEE
GGVLIFAASY NGAPPDNATQ FVRWLSGDLP PDAFAKLRYA VFGCGNRDWT ATYQAIPRLI
DERLAAHGGR NIFVRGEGDA RDDLEGQFEA WFATLGPLAV KEFGIDAAFD RGADDTPLYG
IEPLAPAASQ PLAATGVAVA MRVLENRELQ DRAASGRSTR HIEIALPQGM SYRVGDHLSV
IPRNDPALVA AVAQRFGFAP DDQIRLSAAP GRRAQLPVGE AVSIGGLLGD HVELQQVATR
KQIVALAAHT RCPQTRPKLQ ALAGGDGAAD DAYRAEVLGK RRSVFDLLQE HPACELPFAA
YLEMLTPLQP RYYSISSSPA RDPARASVTV AVVEGPALSG RGIYRGACSS WLAGRGSGDT
VQATVRATKA CFRLPDDDRV PLIMIGPGTG VAPFRGFLQE RSARKVGGAT LGPALLFFGC
RHPAQDYLYA DELQGFAADG IVELHAAFSR GDGPKTYVQH LIAAQKDRVF ALIEQGAIVY
VCGDGGRMEP DVKAALCAIH RERSGADATA AAAWIADLGA RDRYVLDVWA SV