Gene RPD_3105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3105 
Symbol 
ID4023610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3450390 
End bp3451577 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content69% 
IMG OID637963306 
ProductAcetyl-CoA C-acetyltransferase 
Protein accessionYP_570232 
Protein GI91977573 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0273238 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCACG ATCCCATCGT TATCGTCGGC TCCGCACGCA CGCCGATGGG CGGTTTCCAG 
GGCGAGCTGA AGGACGCCAC TGCGTCCCAG CTCGGCTCCG CCGCCATCGC AGCCGCGGTC
GCGCGCGCCG GGCTGAAACC GGACGCGATC GACGAGGTGG TGTTCGGCTG CGTGCTGCCG
GCCGGCCAGG GCCAGGCTCC GGCGCGGCAG GCCGCGCTCG GCGCCGGGCT GCCGCTGTCG
ACCGGCGCCA CGACCATCAA CAAGATGTGC GGCTCGGGCA TGAAGGCGGC GATGCTGGCC
AATGATCTTC TGATCGCCGG AAGCGCGACA ATCGCGGTCG CCGGCGGCAT GGAGAGCATG
ACCAACGCCC CCTATCTGCT CGACCGTGCC CGCGGCGGCT ATCGCATGGG CCACGGCCGT
GTGCTCGACC ACATGTTCCT CGACGGGCTC GAAGACGCCT ACGACAAGGG CCGGCTGATG
GGCACCTTCG CCGAGGACTG TGCCCAGAAC TACCAGTTCA CCCGTGAGCT GCAGGACAAT
TTCGCCATCA CCTCGCTGAC CCGGGCACAG ACCGCGATCA AGGACGGCTC GTTCGCCGGC
GAGGTAACGC CGGTGACGGT GAAGTCCGGC AGGTCCGAGA TCACCGTGAC TACCGACGAA
CAGCCGCTGA AAGCGAAACT CGACAAGATC CCGACGCTGA AGCCGGCGTT CCGCGACGGC
GGCACGGTGA CAGCGGCCAA CTCCTCGTCG ATCTCCGACG GCGCCGCCGC TCTGGTGCTG
ATGCGTCGCT CGGAGGCCGA ACGGCGCGGG TTGACCCCGC TTGCCGCTAT CGCCGGCCAC
GCCACCCATG CCCATGAGCC CAATCTGTTT GCCACTGCGC CGATCGGCGC GATACGGAAG
CTCGCCGAGC GCACCGGTTG GAACCTCGCC GATGTCGACC TGTTCGAAAT CAACGAGGCG
TTCGCGGTGG TGGCGCTGGC GGCGATGCAC GACCTTGGCC TGCCGCACGA CAAGGTCAAC
GTCCATGGCG GGGCCTGCGC GCTCGGCCAC CCGATCGGCG CCTCCGGCGC ACGCGTGCTG
GTGACGCTGC TGGCGGCGCT CGAAAAATAC GACCTCAAGC GCGGCATCGC CTCGTTGTGC
ATCGGCGGCG GCGAGGCCAC CGCCGTCGCC GTGGAACGGT TGTCCTAA
 
Protein sequence
MSHDPIVIVG SARTPMGGFQ GELKDATASQ LGSAAIAAAV ARAGLKPDAI DEVVFGCVLP 
AGQGQAPARQ AALGAGLPLS TGATTINKMC GSGMKAAMLA NDLLIAGSAT IAVAGGMESM
TNAPYLLDRA RGGYRMGHGR VLDHMFLDGL EDAYDKGRLM GTFAEDCAQN YQFTRELQDN
FAITSLTRAQ TAIKDGSFAG EVTPVTVKSG RSEITVTTDE QPLKAKLDKI PTLKPAFRDG
GTVTAANSSS ISDGAAALVL MRRSEAERRG LTPLAAIAGH ATHAHEPNLF ATAPIGAIRK
LAERTGWNLA DVDLFEINEA FAVVALAAMH DLGLPHDKVN VHGGACALGH PIGASGARVL
VTLLAALEKY DLKRGIASLC IGGGEATAVA VERLS