Gene RPD_3306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3306 
Symbol 
ID4023816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3660855 
End bp3662042 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content68% 
IMG OID637963510 
ProductAcetyl-CoA C-acetyltransferase 
Protein accessionYP_570431 
Protein GI91977772 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.836004 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGATC CTGTCGTCAT CGTTTCCGCG GCGCGCACGC CGCTCGGCCG GTTCCAGGGC 
GAACTGTCTG CACTCAGCGC TCATCAACTC GGCAGCCAGG TGATCGGCGC AGCGCTGGCG
CGGGGCAAGC TCGCCCCCGA ACGGATCGAC GAAGTCCTGA TGGGCTGTGT TCTCACCGCC
GGCCAGGGTC AGGCACCGGC ACGGCAGGCG GCGCGCGGTG CGAAATTGCC GGACGCCACC
GGCGCCACAA CGGTCAACAA GGTCTGCGGC TCCGGCATGA AAGCGACCAT GCTGGCAAAC
GACCTGATCC GCGCCGGCTC TGCCGACATC GTGCTGTCGG GCGGCATGGA GAGCATGAGC
AACGCCCCCT ATCTGCTGGC CAAGGCGCGC AGCGGCTATC GCGTCGGCCA CGACCGGATC
ATCGACCACA TGCTGATGGA CGGCCTGGAA GACGCCTATG AGAGCGGTCG GTCGATGGGC
GATTTCGGCG AGGCCACCGC CGAGGCCTAT CAATTCACCC GCGCCGACCA GGACGCCTAT
GCGATCGAGA CGCTGACCCG CGCCCGCAAT GCGGTTCAGA CCGGCGCGTT CCATGCGGAG
ATTGTGCCGG TCACCGTGAC CGACAAGGCC GGACCGCGCG AAATCGCCAA TGACGAACAC
CCGCTGAAGG TCGATCCGGC GAAGATCCCC GCTTTGAAGC CGGCGTTCCG AGCCGGCGGC
ACGATCACGC CGGCGGCCTC CTCCGCAAAT GCCGACGGCG CCGCGGCGCT GATTCTGGCG
CGGCGCTCGC TCGCCGAGCG CGACGGCCTG CCGCTACTGG CCGAGATCAA GGGCCATGCC
ACCCACAGCC AGGAGCCGCA ATGGTTCACC ACCGCGCCGA TCCCGGCGAT CCGTAAACTC
CTCGACAAGG TCGGCTGGAA CGTCAAGGAC GTCGACCTGT TCGAAATCAA CGAGGCCTTC
GCGGTGGTGG CAATGGCGGC GCGACAGGAC CTCGATATTC CGCGCGACAG GCTCAACGTC
AATGGCGGCG CCTGCGCGCT CGGCCACCCG ATCGGCGCCA CCGGCGCGCG GCTGATCGTG
ACCCTGCTGC ACGCGCTGCA GGCACGCGGC CTGAAACGCG GCGTCGCGGC GCTGTGCATC
GGCGGCGGTG AAGCCACCGC GATTGCAATC GAGCGCGACG CTCACTAG
 
Protein sequence
MSDPVVIVSA ARTPLGRFQG ELSALSAHQL GSQVIGAALA RGKLAPERID EVLMGCVLTA 
GQGQAPARQA ARGAKLPDAT GATTVNKVCG SGMKATMLAN DLIRAGSADI VLSGGMESMS
NAPYLLAKAR SGYRVGHDRI IDHMLMDGLE DAYESGRSMG DFGEATAEAY QFTRADQDAY
AIETLTRARN AVQTGAFHAE IVPVTVTDKA GPREIANDEH PLKVDPAKIP ALKPAFRAGG
TITPAASSAN ADGAAALILA RRSLAERDGL PLLAEIKGHA THSQEPQWFT TAPIPAIRKL
LDKVGWNVKD VDLFEINEAF AVVAMAARQD LDIPRDRLNV NGGACALGHP IGATGARLIV
TLLHALQARG LKRGVAALCI GGGEATAIAI ERDAH