Gene RPC_2228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2228 
Symbol 
ID3973226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2429422 
End bp2430609 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content69% 
IMG OID637925336 
ProductAcetyl-CoA C-acetyltransferase 
Protein accessionYP_532101 
Protein GI90423731 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.138441 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.114685 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGG CCGTTATCGT TTCCACCGCG CGCACCCCGA TCGGCAAGGC CTATCGCGGC 
GCGCTCAATG CCACCGAGGG CGCGACCTTG CTCGGCCACG CCATCGAGCA CGCGGTGAAG
CGCGCCGGCA TCGACCCCAA GGAGGTCGAG GACGTGGTGA TGGGCGCCGC CTTGCAGCAG
GGCTCAACCG GCGGCAACAT CGCCCGCAAG GCGCTGCTGC GCGCCGGGCT GCCGGTGTCG
GTCGCCGGCA CCACCATCGA TCGGCAGTGC GCCTCCGGCC TGCAAGCGAT CGCGCTGGCG
GCGCGTTCGG TGCTGTTCGA TGGCGTCGAG ATCGCGGTCG CCGGCGGCGG CGAGTCGATC
AGCCTGGTGC AGAACGACAA AATGAATACT TTCCACGCCG TCGATCCGGC GCTGCAGGCG
ATCAAGGGCG ACGTCTATAT GGCGATGCTC GACACCGCCG AGGTGGTGGC GAAGCGCTAC
GGCATTTCGC GCGAACGCCA GGACGAGTAC TCGCTGGAGA GCCAGCGCCG CACCGCGGCG
GCGCAGCAGG GCGGCAAGTT CGCCGACGAG ATCGCTGCGA TCTCGACCAA GATGGGCGTG
GTCGACAAGG CCAGCGGCGC GGTGTCGTTC AAGGACATCA CGCTGTCGCA GGACGAAGGC
CCGCGGCCGG ACACCTCGGC GGAAGGCTTG GCGGCGTTGA AGGCGGTGCG CGGCGAGGGC
TTCACCATCA CCGCCGGCAA CGCCTCGCAG CTCTCCGACG GCGCCAGCGC CACCGTGGTG
ATGAGCGACA CGCTCGCCGC CAAGAAGGGC CTAAAGCCGC TCGGCATCTT CCGCGGCTTC
GTCTCGGCGG GCTGCGAGCC GGACGAGATG GGGATCGGCC CGGTCTATGC GGTGCCGCGG
CTGCTCAAGC GCCACGGCTT GAAGATCGAG GACATCGACC TGTGGGAGCT CAACGAAGCC
TTTGCGGTGC AGGTGCTGTA TTGTCGCGAC AAGCTCGGCA TCGATCCGGA GAAGATCAAC
GTCGATGGCG GCGCCATCGC GGTCGGCCAT CCCTACGGCA TGTCGGGCGC CCGGCTCACC
GGCCACGCGC TGATCGAAGG CCGCCGCCGC AAGGCGAAAT ACGCCGTCGT CACCATGTGC
GTCGGCGGCG GCATGGGCTC GGCCGGCCTG TTCGAGATCG TGCAGTAA
 
Protein sequence
MTEAVIVSTA RTPIGKAYRG ALNATEGATL LGHAIEHAVK RAGIDPKEVE DVVMGAALQQ 
GSTGGNIARK ALLRAGLPVS VAGTTIDRQC ASGLQAIALA ARSVLFDGVE IAVAGGGESI
SLVQNDKMNT FHAVDPALQA IKGDVYMAML DTAEVVAKRY GISRERQDEY SLESQRRTAA
AQQGGKFADE IAAISTKMGV VDKASGAVSF KDITLSQDEG PRPDTSAEGL AALKAVRGEG
FTITAGNASQ LSDGASATVV MSDTLAAKKG LKPLGIFRGF VSAGCEPDEM GIGPVYAVPR
LLKRHGLKIE DIDLWELNEA FAVQVLYCRD KLGIDPEKIN VDGGAIAVGH PYGMSGARLT
GHALIEGRRR KAKYAVVTMC VGGGMGSAGL FEIVQ