Gene RPB_3641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3641 
SymbolpaaA 
ID3911443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4177662 
End bp4178729 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content66% 
IMG OID637885543 
Productphenylacetate-CoA oxygenase subunit PaaA 
Protein accessionYP_487247 
Protein GI86750751 
COG category[S] Function unknown 
COG ID[COG3396] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02156] phenylacetate-CoA oxygenase, PaaG subunit 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.805195 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCGTCC AATTGGTCAA TTATATTTCC AACAAGGCCG GGCCGCGCGC CCGAGCCTGC 
GAGGAAACTG CCGAGGACGC CGCCATGTAC ACACAGGCGC TCAACACCAC CGAGGCCGAG
GATCGCAACC TCGAGGACGC CGGCCGCGCC GCGCTGTTCC AGGCGCGGAT CGATGCCGAA
GAGCGGATCG AGCCCAACGA CTGGATGCCG GCGGCGTATC GCAAGACGCT GACGCGGCAG
ATCTCGCAGC ACGCCCATTC CGAGATCGTC GGCATGTTGC CGGAAGGCAA TTGGATCACC
CGCGCGCCGA CGCTGCGCCG CAAGGCCGCC TTGCTCGCCA AGGTGCAGGA CGAATGCGGC
CACGGGCTGT ATCTCTACGC CGCCGCAGAG ACGCTCGGCT CGTCGCGCGA AGAGCTGGTC
GATCAGATGC TGAGCGGCAA GGCGAAGTAC TCCTCGATCT TCAACTACCC GACGCTGACC
TGGGCGGATA TCGGCGCGAT CGGCTGGCTG GTCGACGGCG CCGCGATCAT GAACCAGATT
CCGCTGTGCC GCTGCTCCTA CGGCCCCTAT GCGCGGGCGA TGATCCGCGT CTGCAAGGAG
GAGTCGTTCC ACCAGCGCCA GGGCTACGAG ATCATGCTGA CGCTGTGCCG CGGTTCGGCC
GAGCAGAAGG CGATGGCGCA GGATGCGCTC GACCGCTGGT GGTGGCCATG CCTGATGATG
TTCGGCCCGC CGGATCAGGC CAGCCAGCAC AGCGACACCT CGACCAGATG GAAGATCAAG
CGCTTCTCCA ACGACGAATT GCGCCAGAAA TTCGTCGATG CGACCGTGCC GCAGGCGCAC
TATCTCGGGC TCACGATTCC CGATCCGGCG TTGACCAGGA ACGAGTCCAC CGGGCACTGG
GACTACGGCA CGATCGACTG GGACGAATTC AAGCAGGTGC TGGCCGGCAA CGGCCCGTGC
AACCGCGACC GGCTGGCGGC GCGGCGCAAG GCGCATGACG ACGGCGCCTG GGTTCGCGAA
GCCGCGATGG CCTTCGCCGA AAAGCGCAAG AAGAAGATCG CGGCCTAG
 
Protein sequence
MTVQLVNYIS NKAGPRARAC EETAEDAAMY TQALNTTEAE DRNLEDAGRA ALFQARIDAE 
ERIEPNDWMP AAYRKTLTRQ ISQHAHSEIV GMLPEGNWIT RAPTLRRKAA LLAKVQDECG
HGLYLYAAAE TLGSSREELV DQMLSGKAKY SSIFNYPTLT WADIGAIGWL VDGAAIMNQI
PLCRCSYGPY ARAMIRVCKE ESFHQRQGYE IMLTLCRGSA EQKAMAQDAL DRWWWPCLMM
FGPPDQASQH SDTSTRWKIK RFSNDELRQK FVDATVPQAH YLGLTIPDPA LTRNESTGHW
DYGTIDWDEF KQVLAGNGPC NRDRLAARRK AHDDGAWVRE AAMAFAEKRK KKIAA