Gene RPB_0525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0525 
Symbol 
ID3909429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp586052 
End bp587260 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content69% 
IMG OID637882413 
Productbeta-ketoadipyl CoA thiolase 
Protein accessionYP_484147 
Protein GI86747651 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02430] beta-ketoadipyl CoA thiolase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.859351 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACG TCTTCATCTG CGACGCAGTG CGCACCCCAA TCGGCCGGTT CGGCGGATCT 
CTCGCCAAAG TCCGTGCCGA CGACCTCGCC GCGGTTCCGA TCAAGGCGCT GATGGCCAAG
CACCCCGATC TCGACTGGAG CGCGGTGGAC GAGGTGTTCT TCGGCTGCGC CAACCAGGCC
GGCGAAGACA ACCGCAACGT CGCCCGGATG GCGACGCTGC TCGCGGGCCT GCCGGATTCG
GTGCCGGCCC AGACCCTCAA CCGGCTGTGC GCCTCCGGGC TCGACGCGGT CGGCGCCGCG
GGCCGCGCGA TCCGCGCCGG CGAGATCGAT CTGGCGATTG CCGGCGGCGT CGAATCGATG
ACACGGGCGC CGTTCGTGAT GGGCAAAGCC GGCGAGGCGT TTTCCCGCCA GGCGGACATC
TTCGACACCA CGATCGGCTG GCGTTTCATC AATCCGCTGA TGAAGGCGCA ATACGGTGTC
GACGCGATGC CGGAGACCGG CGAGAACGTC GCCGAGGAAT TCCAGATTTC GCGCGCCGAT
CAGGACGCCT TCGCGATCCG ATCCCAGCAG CGCGCAGGCG CCGCCATCGC CGCCGGTTAC
TTCGCGCAGG AGATCGCGCC GGTGTCGGCG CCGGGCGGCA AGGCCGGTCC GATCATCGTC
GACAAGGACG AGCATCCGCG CCCGGAGACG ACGCTGGAAG GCCTCGCCAA GCTGAAGCCG
ATCGTGCGCA ATCCCGGCAC GGTGACCGCC GGCAACGCCT CGGGCGTCAA TGACGGCGCT
GCGGCGATGA TCGTGGCCTC GGAGGCTGCG GTGAAGAAAC ACGGCCTGAC GCCCCGGGCG
AAGATTCTCG GCCTCGCCTC GGCGGCAGTG CCGCCGCGCA TCATGGGCAT CGGCCCGGTG
CCGGCGACCC GCAAGCTGAT GGAGCGGCTG GGGCTGAAGA TCTCCGACTT CGACCTGATC
GAGCTCAACG AAGCCTTCGC CTCGCAGGGC ATCGCCTGCC TGCGCCAGCT CGGCGTCGCC
GACGATGCCG ATTTCGTCAA TCCGCATGGT GGCGCGATCG CGCTCGGCCA CCCGCTCGGC
ATGAGCGGCA CGCGGCTGGC GCTGACGGCG GTGCACGGCA TGGAAGCCCG CGGCGGCAAA
TTGGCGCTGG CGACGATGTG CGTCGGCGTC GGCCAGGGCG TCGCGATGGC GATCGAGAAA
CTGAACTAA
 
Protein sequence
MADVFICDAV RTPIGRFGGS LAKVRADDLA AVPIKALMAK HPDLDWSAVD EVFFGCANQA 
GEDNRNVARM ATLLAGLPDS VPAQTLNRLC ASGLDAVGAA GRAIRAGEID LAIAGGVESM
TRAPFVMGKA GEAFSRQADI FDTTIGWRFI NPLMKAQYGV DAMPETGENV AEEFQISRAD
QDAFAIRSQQ RAGAAIAAGY FAQEIAPVSA PGGKAGPIIV DKDEHPRPET TLEGLAKLKP
IVRNPGTVTA GNASGVNDGA AAMIVASEAA VKKHGLTPRA KILGLASAAV PPRIMGIGPV
PATRKLMERL GLKISDFDLI ELNEAFASQG IACLRQLGVA DDADFVNPHG GAIALGHPLG
MSGTRLALTA VHGMEARGGK LALATMCVGV GQGVAMAIEK LN