Gene RPD_0320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0320 
Symbol 
ID4020779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp368865 
End bp370073 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content70% 
IMG OID637960498 
Productbeta-ketoadipyl CoA thiolase 
Protein accessionYP_567459 
Protein GI91974800 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02430] beta-ketoadipyl CoA thiolase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.970819 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACG TGTTTGTCTG CGACGCAGTG CGCACCCCGA TCGGCCGGTT CGGCGGGTCG 
CTCGCCCGGG TCCGCGCCGA CGACCTCGCC GCGGTTCCGA TCAAGGCGCT GATCGCCAGG
CACCCCAATC TCGACTGGAG CGCGGTGGAC GAGGTGTTTT TCGGCTGCGC CAACCAAGCC
GGCGAGGACA ACCGCAACGT CGCGCGGATG GCGGCGCTGC TGGCCGGCCT GCCGGATTCG
GTGCCGGGCC AGACCCTGAA CCGGCTGTGC GCCTCGGGCC TCGACGCGGT CGGCGCCGCG
GGCCGCGCGA TCCGCTCCGG CGAGATCGAT CTGGCTATCG CCGGCGGCGT CGAATCGATG
ACGCGGGCGC CGTTCGTGCA AGGCAAGGCG ACCGAGGCGT TCTCGCGCCA GGCCGAGATT
TTCGACACCA CGATCGGCTG GCGTTTCATC AACCCGCTGA TGAAGGCGCA ATATGGCGTC
GACGCGATGC CGGAGACCGG CGAGAACGTC GCCGAGGAAT TCCAGATTTC GCGCGCCGAC
CAGGACGCCT TCGCGATCCG CTCCCAGCAG CGCGCCGGCG CGGCGATCGC CGCGGGTTAT
TTCGCCGAGG AGATCGCGCC GGTGTCGGCG CCGGGCGGCA AAGCGGGGCC GATCATCGTC
GACAAGGACG AGCATCCGCG CCCGGAGACC ACGCTGGAAG GCCTCGCCAA GCTCAAGCCG
ATCGTGCGCA ATCCCGGCAC GGTGACGGCC GGCAACGCCT CGGGCGTCAA TGACGGCGCA
GCGGCGATCA TCGTCGCCTC CGAAGCCGCG GTGAAGAAAC ACGGGCTGAC GCCGCGGGCG
CGCATTCTCG GCCTCGCCTC GGCCGCGGTG CCGCCGCGGA TCATGGGCAT CGGCCCGGTG
CCGGCGACCC GCAAGCTGAT GGACCGGCTC GGCCTGAAGA TCAGCGATTT CGACCTGATC
GAACTCAACG AGGCGTTCGC CTCGCAGGGC ATCGCCTGCC TGCGTCAGCT CGGCGTCGCC
GACGATGCGG ATTTCGTCAA TCCGCATGGC GGCGCGATCG CGCTCGGCCA TCCGCTCGGC
ATGAGCGGCG CGCGGCTGGC GCTGACAGCG GTGCACGGCA TGGAAAAGCG CGGCGGCAGG
CTGGCGCTGG CGACGATGTG CGTCGGCGTC GGCCAGGGCG TCGCGATGGC GATCGAGAAG
TTGAACTAA
 
Protein sequence
MADVFVCDAV RTPIGRFGGS LARVRADDLA AVPIKALIAR HPNLDWSAVD EVFFGCANQA 
GEDNRNVARM AALLAGLPDS VPGQTLNRLC ASGLDAVGAA GRAIRSGEID LAIAGGVESM
TRAPFVQGKA TEAFSRQAEI FDTTIGWRFI NPLMKAQYGV DAMPETGENV AEEFQISRAD
QDAFAIRSQQ RAGAAIAAGY FAEEIAPVSA PGGKAGPIIV DKDEHPRPET TLEGLAKLKP
IVRNPGTVTA GNASGVNDGA AAIIVASEAA VKKHGLTPRA RILGLASAAV PPRIMGIGPV
PATRKLMDRL GLKISDFDLI ELNEAFASQG IACLRQLGVA DDADFVNPHG GAIALGHPLG
MSGARLALTA VHGMEKRGGR LALATMCVGV GQGVAMAIEK LN