Gene Rpal_4289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4289 
SymbolpaaA 
ID6411973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4616321 
End bp4617307 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content66% 
IMG OID642714171 
Productphenylacetate-CoA oxygenase subunit PaaA 
Protein accessionYP_001993260 
Protein GI192292655 
COG category[S] Function unknown 
COG ID[COG3396] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02156] phenylacetate-CoA oxygenase, PaaG subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00273791 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACACCC AGGCACTGAA TGTGTCGGAC GGTGACGAGC GGACCATCGA GGACGCCGAG 
CGCGCCGCGC GTTTCCAGGC GCGGATCGAT GCCGAGGAGC GGATCGAGCC GAACGATTGG
ATGCCGGCGG CGTATCGCAA GACGTTGGTC CGACAGATCT CCCAGCACGC GCATTCCGAA
GTCGTCGGCA TGCTGCCGGA GGGCAACTGG ATCACCCGCG CGCCGACGCT GCGGCGCAAG
GCGGCGCTGC TCGCCAAGGT CCAGGACGAA TGCGGCCACG GCCTGTACCT TTACGCCGCC
GCCGAAACGC TCGGCGCCTC GCGCGAGGAG CTGGTCGATC AGCTGCTCAG CGGCAAGGCG
AAGTACTCGT CGATCTTCAA CTATCCGACC CTGACCTGGG CCGATATCGG CGCGATCGGC
TGGCTGGTCG ACGGCGCCGC GATCATGAAC CAGATCCCGC TGTGCCGCTG CTCCTACGGG
CCGTATGCGC GCGCGATGAT CCGCGTCTGC AAGGAGGAAT CCTTCCACCA GCGACAGGGC
TACGAGATCA TGCTGACGCT AGCCAAGGGC TCGGCCGAGC AGAAAGCGTT GGCGCAGGAC
GCGCTGAACC GCTGGTGGTG GCCGTGCCTG ATGATGTTCG GCCCGCCCGA TCAGGCCAGC
CAGCACAGCG ACACCTCCAC CAAGTGGAAG ATCAAGCGGT TCTCCAACGA CGAGTTACGG
CAGAAATTCG TCGATGCCAC GGTGCCGCAG GCGCACTATC TCGGCCTGAC GCTTCCCGAT
CCCGATCTGA AGCAGAACGA CGCGACCGGG CATTGGGAGT ACGGCGAAAT TCCTTGGGAC
GAGTTCAAGC AGGTGCTCGC CGGCAACGGC CCTTGCAACC GCGACCGCAT GGCGGCGCGG
CGCAAGGCCC ACGACGACGG CGCCTGGGTG CGCGAGGCGG CCGCTGCCTA CGCCGAGAAA
CGCAAGAAGA AACTGGCGGC GGCGTAA
 
Protein sequence
MYTQALNVSD GDERTIEDAE RAARFQARID AEERIEPNDW MPAAYRKTLV RQISQHAHSE 
VVGMLPEGNW ITRAPTLRRK AALLAKVQDE CGHGLYLYAA AETLGASREE LVDQLLSGKA
KYSSIFNYPT LTWADIGAIG WLVDGAAIMN QIPLCRCSYG PYARAMIRVC KEESFHQRQG
YEIMLTLAKG SAEQKALAQD ALNRWWWPCL MMFGPPDQAS QHSDTSTKWK IKRFSNDELR
QKFVDATVPQ AHYLGLTLPD PDLKQNDATG HWEYGEIPWD EFKQVLAGNG PCNRDRMAAR
RKAHDDGAWV REAAAAYAEK RKKKLAAA