Gene Rpal_0514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0514 
Symbol 
ID6408163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp561318 
End bp562526 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content68% 
IMG OID642710426 
Productbeta-ketoadipyl CoA thiolase 
Protein accessionYP_001989549 
Protein GI192288944 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02430] beta-ketoadipyl CoA thiolase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.201521 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGACG TGTTTGTCTG CGACGCGGTG CGCACCCCGA TCGGCCGTTA CGGCGGATCC 
CTCGCCAAGG TGCGGACCGA TGATCTCGCG GCGGTGCCGA TCAAGGCGCT GATGGCCAAA
CATCCGGACA TGGATTGGAG CGCGGTGGAC GAGGTGTTCT TCGGCTGCGC CAATCAGGCC
GGCGAAGACA ACCGCAACGT TGCCCGGATG GCGGCGCTGC TGGCGGGTTT GCCGGATTCG
GTGCCCGGCC AGACTCTCAA CCGGTTGTGC GCCTCCGGCC TCGATGCGGT CGGAGCCGCG
GGCCGTGCGA TCCGCGGCGG CGAGATCGAT CTGGCGATCG CCGGCGGCGT CGAGTCGATG
ACCCGGGCGC CGTTCGTGCA GGGCAAGGCC ACCGAAGCGT TCGCGCGCTC GGCCGACATC
TTCGACACCA CGATCGGTTG GCGCTTCATC AATCCGCTGA TGAAACAGCA ATACGGCGTC
GACTCGATGC CGGAGACCGG TGAGAACGTC GCCGAAGAAT TCCAGATCTC GCGTGCCGAT
CAGGATGCGT TCGCGATCCG CAGCCAGCAG CGCGCCGGCG CGGCGATCAC GGCCGGCTAC
TTCGCCGAGG AGATCGCGCC GGTGACCTAC GCGGGCGGCA AGGCCGGCCC GATCACTGTC
GATAAGGACG AGCATCCGCG CCCCGAAACC ACGCTGGAGG GCCTCGCCAA GCTAAAGCCG
ATCGTGCGCA ATCCGGGCAC CGTGACGGCC GGTAACGCCT CGGGCGTCAA TGACGGCGCC
GCAGCGCTGC TGATCGCCTC GGAAGCCGCG GTGAAGAAAT ACGGCCTGAC GCCGCGCGCC
AAGATTCTGG GCCTCGCCTC GGCGGCGGTG CCGCCGCGGA TCATGGGCAT CGGCCCGGTG
CCGGCGACCC GCAAGCTGAT GGAACGGCTC GGGCTGAAGA TCAGCGACTT CGACCTGATC
GAGCTGAACG AAGCGTTCGC CTCGCAGGGC ATCGCCTGCC TGCGCCAACT CGGCGTTGCC
GACGATGCCG ACTTCGTCAA TCCGCACGGC GGCGCGATCG CACTCGGCCA CCCGCTCGGT
ATGAGCGGCG CCCGCCTGGC GCTGACCGCG GTGCACGGCC TCGAAAAGCG CGGCGGCAAG
CTGGCGCTGG CCACGATGTG CGTCGGCGTC GGCCAGGGCG TTGCGATGGC GATCGAGAAG
CTGAACTAA
 
Protein sequence
MADVFVCDAV RTPIGRYGGS LAKVRTDDLA AVPIKALMAK HPDMDWSAVD EVFFGCANQA 
GEDNRNVARM AALLAGLPDS VPGQTLNRLC ASGLDAVGAA GRAIRGGEID LAIAGGVESM
TRAPFVQGKA TEAFARSADI FDTTIGWRFI NPLMKQQYGV DSMPETGENV AEEFQISRAD
QDAFAIRSQQ RAGAAITAGY FAEEIAPVTY AGGKAGPITV DKDEHPRPET TLEGLAKLKP
IVRNPGTVTA GNASGVNDGA AALLIASEAA VKKYGLTPRA KILGLASAAV PPRIMGIGPV
PATRKLMERL GLKISDFDLI ELNEAFASQG IACLRQLGVA DDADFVNPHG GAIALGHPLG
MSGARLALTA VHGLEKRGGK LALATMCVGV GQGVAMAIEK LN