Gene RPB_4605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4605 
Symbol 
ID3912422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5200023 
End bp5201231 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content66% 
IMG OID637886509 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_488199 
Protein GI86751703 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAAG CCTATATCTA CGATCACGTT CGCACGCCTC GCGGCCGCGG CAAGGCCGAC 
GGCGCCTTGC ACGAGGTGAC CGCGCTGGCG CTCGCCACCG TACCGCTGAA AGCGCTGAAG
GAACGCAACA ACCTGAAGCA GGACGTGGTC GACGACGTCA TCCTCGGCGT GGTCGATCCG
GTCGGTGAGG CCGGCTCGGA CATCGCGCGG TTCGCGGCGA TGAACGCGGG CCTCGGCGAA
GCCGTGCCGG GCATCCAGAT CAGCCGCTTC TGTGCCTCGG GCCTCGACGC GGTGAACTTC
GCTGCGGCGC AGATCATGAG CGGCCAGCAC GAGCTGGTGA TCGGCGGCGG CGCGGAATCG
ATGAGCCGCA TCGGCATCGG CGCCTCCGGC GGCGCCTGGC CGATGGACCC GTCGATGGCT
GTGCCGTCCT ACTTCATGCC GCAGGGCATT TCGGCCGATT TGATCGCGAC CAAATACGGT
TTCTCGCGGG ACGACGTCGA CGCTTATGCG GTGCAGAGCC AGCAGCGCTC GGCGAAGTCG
TGGGAAGAAG GCCGCTTCGC CAAATCGGTC GTGCCGGTCA AGGACATCAA CGGCCTGACC
ATTCTGGCCA AGGACGAGCA CATGCGGCCA TCGACGACGA TGCAGTCGCT CGGGCAGTTG
CAGCCGTCGT TCGCGCCGAT GGCGGTGATG GGCGGTTTCG ACGCGGTGGC GATCCAGTCG
CATCCGGAGA TCGAGAAGGT CAACTACGTC CATCATGCCG GCAACTCTTC CGGCATCGTC
GATGGCGCCG GCGCGGTGCT GCTCGGCAGC AAGGAAGCCG GTGCCAAGCA CGGCCTCAAG
CCGCGCGCAA AAATTCGCGC CTTCGCCAAT ATCGGCTCCG AGCCGGCGAT GATGCTGACC
GGCCCGGTCG ACGTCACCAA GAAGCTGTTC GAGCGCTCCG GCATGAAGAA GAGCGACATC
GACCTGTTCG AGCTCAACGA GGCTTTCGCC TCGGTGGTGC TGCGCTTCAT GCAGGCGTTC
GAGATCGACA ACGACCAGAT CAACGTGACC GGCGGCGCGA TCGCGCTCGG CCATCCGCTC
GGCGCGACCG GCGCGATGAT CCTCGGCACC GTGCTCGACG AGCTCGAGCG CACCGGCAAG
GCGACCGCGC TGGTGACGCT GTGCATCGGC GGCGGCATGG GCACCGCGAC GATCATCGAA
CGCGTCTGA
 
Protein sequence
MPEAYIYDHV RTPRGRGKAD GALHEVTALA LATVPLKALK ERNNLKQDVV DDVILGVVDP 
VGEAGSDIAR FAAMNAGLGE AVPGIQISRF CASGLDAVNF AAAQIMSGQH ELVIGGGAES
MSRIGIGASG GAWPMDPSMA VPSYFMPQGI SADLIATKYG FSRDDVDAYA VQSQQRSAKS
WEEGRFAKSV VPVKDINGLT ILAKDEHMRP STTMQSLGQL QPSFAPMAVM GGFDAVAIQS
HPEIEKVNYV HHAGNSSGIV DGAGAVLLGS KEAGAKHGLK PRAKIRAFAN IGSEPAMMLT
GPVDVTKKLF ERSGMKKSDI DLFELNEAFA SVVLRFMQAF EIDNDQINVT GGAIALGHPL
GATGAMILGT VLDELERTGK ATALVTLCIG GGMGTATIIE RV