Gene RPD_1824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1824 
SymbolpaaA 
ID4022306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2042952 
End bp2043935 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content66% 
IMG OID637962018 
Productphenylacetate-CoA oxygenase subunit PaaA 
Protein accessionYP_568961 
Protein GI91976302 
COG category[S] Function unknown 
COG ID[COG3396] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02156] phenylacetate-CoA oxygenase, PaaG subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.123429 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACACCC AGGCGCTCAA TGTCTCCGAC GGCGACGACC GAAATCTGGA GGATGCCGGC 
CGCGCCGCGC AGTTTCAGGC GCGGATCGAC GCCGAGGAGC GGATCGAGCC GAACGACTGG
ATGCCGGCGG CCTATCGCAA GACGCTGACG CGGCAGATTT CCCAGCACGC CCATTCCGAA
ATTGTCGGGA TGCTGCCGGA AGGCAACTGG ATCACCCGGG CGCCGACGCT GCGCCGCAAG
GCGGCCTTGC TCGCCAAGGT GCAGGACGAG TGCGGCCACG GGCTGTATCT CTACGCCGCC
GCCGAGACGC TCGGCTCCTC GCGCGAGGAG CTGGTCGATC AGATGCTGAG CGGCAAGGCG
AAGTACTCCT CGATCTTCAA CTACCCGACG TTGACCTGGG CGGATATCGG CGCGATCGGC
TGGCTGGTCG ACGGCGCTGC GATCATGAAC CAGATCCCGC TGTGCCGCTG TTCCTACGGT
CCCTATGCCC GCGCGATGAT CCGCGTCTGC AAGGAGGAGT CGTTCCACCA GCGTCAGGGT
TACGAGATCA TGTTGACGCT GTGCCGCGGC TCCGCCGAGC AGAAGGCGAT GGCGCAGGAC
GCGCTGAACC GCTGGTGGTG GCCGTGCCTG ATGATGTTCG GCCCGCCGGA TCAGGCGAGC
CAGCACAGCG ACACCTCGAC CAAATGGAAG ATCAAGCGCT TCTCCAACGA CGAGCTGCGC
CAGAAATTCG TCGATGCCAC CGTGCCGCAG GCGCATTACC TCGGCCTGAC GCTTCCCGAT
CCGGCGCTGA CCAAGAACGA GGCGACCGGG CATTGGGACT ACGGCGCGAT CGACTGGGAT
GAATTCAAGC AGGTGCTGGC CGGCAACGGC CCGTGCAACC GCGATCGCCT CGCGGCGCGG
CGCAAGGCCC ATGACGACGG CGCCTGGGTT CGCGACGCCG CGGTCGCCTA TGCCGAAAAG
CGCAAGAACA GACTGGCGGC GTAA
 
Protein sequence
MYTQALNVSD GDDRNLEDAG RAAQFQARID AEERIEPNDW MPAAYRKTLT RQISQHAHSE 
IVGMLPEGNW ITRAPTLRRK AALLAKVQDE CGHGLYLYAA AETLGSSREE LVDQMLSGKA
KYSSIFNYPT LTWADIGAIG WLVDGAAIMN QIPLCRCSYG PYARAMIRVC KEESFHQRQG
YEIMLTLCRG SAEQKAMAQD ALNRWWWPCL MMFGPPDQAS QHSDTSTKWK IKRFSNDELR
QKFVDATVPQ AHYLGLTLPD PALTKNEATG HWDYGAIDWD EFKQVLAGNG PCNRDRLAAR
RKAHDDGAWV RDAAVAYAEK RKNRLAA