Gene Rpal_4236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4236 
Symbol 
ID6411920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4547984 
End bp4549171 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content69% 
IMG OID642714118 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001993207 
Protein GI192292602 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAGG CCGTTATCGT TTCAACCGCG CGCACGCCGA TCGGCAAGGC GTATCGCGGC 
GCCCTCAACG CCACCGAGGG CGCCACGCTG CTCGGCCACG CCATCGAGCA CGCGGTGAAG
CGCGCCGGAA TCGACCCGAA GGAGGTCGAG GACGTGGTGA TGGGCGCGGC GATGCAGCAG
GGCGCCACCG GCGGCAACAT CGCCCGCAAG GCGCTGCTGC GCGCCGGCCT GCCGGTGACC
ACCGCCGGCA CCACCATCGA CCGGCAGTGC GCGTCCGGCC TGCAGGCGAT CGCGCTCGCT
GCCCGCTCGG TGCTGTTCGA CGGCGTCGAG ATCGCGGTCG GCGGCGGCGG CGAGTCGATC
TCGCTGGTGC AGAACGACAA GATGAACACC TTCCACGCCG TCGATCCGGC GCTCGAGGCG
ATCAAGGGTG ACGTCTACAT GGCGATGCTC GACACCGCCG AAACCGTGGC GAAGCGCTAC
GGCATCTCGC GCGAGCGCCA GGACGAGTAT TCGCTGGAAA GCCAGCGCCG CACCGCAGCG
GCGCAGCAGG GCGGCAAGTT CAACGACGAG ATCGCGCCGA TCTCCACCAA GATGGGCGTC
GTCGACAAGG CCACCGGCGC AGTGTCGTTC AAGGACATCA CGCTGTCGCA GGACGAAGGC
CCGCGGCCGG AGACGACCGC CGAAGGTCTC GCCGGTCTTA AGGCCGTGCG TGGTGAAGGC
TTCACCATCA CTGCCGGCAA TGCCAGCCAG CTGTCGGACG GCGCGTCGGC CACGGTGATC
ATGAGCGACA AGACGGCGGC CGCGAAGGGC CTCAAGCCGC TCGGCATCTT CCGCGGCATG
GTCTCCTACG GCTGCGAGCC GGACGAGATG GGCATCGGCC CGGTGTTCGC GGTGCCGCGC
CTGCTCAAGC GCCACGGCCT GACCGTCGAC GACATCGGCG TTTGGGAGCT GAACGAGGCA
TTCGCCGTGC AGGTGCTGTA CTGCCGCGAT AAGCTCGGCA TCGATCCGGA GAAGCTCAAC
GTCAACGGCG GCGCGATCTC GGTCGGCCAC CCCTACGGCA TGTCGGGCGC CCGCCTGACC
GGCCACGCGC TGATTGAAGG CCGCCGCCGC AAGGCGAAGT ACGCGGTGGT CACGATGTGC
GTTGGCGGCG GCATGGGCTC CGCCGGCCTG TTCGAGATCG TGCACTGA
 
Protein sequence
MTEAVIVSTA RTPIGKAYRG ALNATEGATL LGHAIEHAVK RAGIDPKEVE DVVMGAAMQQ 
GATGGNIARK ALLRAGLPVT TAGTTIDRQC ASGLQAIALA ARSVLFDGVE IAVGGGGESI
SLVQNDKMNT FHAVDPALEA IKGDVYMAML DTAETVAKRY GISRERQDEY SLESQRRTAA
AQQGGKFNDE IAPISTKMGV VDKATGAVSF KDITLSQDEG PRPETTAEGL AGLKAVRGEG
FTITAGNASQ LSDGASATVI MSDKTAAAKG LKPLGIFRGM VSYGCEPDEM GIGPVFAVPR
LLKRHGLTVD DIGVWELNEA FAVQVLYCRD KLGIDPEKLN VNGGAISVGH PYGMSGARLT
GHALIEGRRR KAKYAVVTMC VGGGMGSAGL FEIVH