Gene RPB_4151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4151 
Symbol 
ID3911959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4726209 
End bp4727462 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content67% 
IMG OID637886055 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_487754 
Protein GI86751258 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.409684 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.916878 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAAG CCTACATCAT CGACGCCGTC CGCACCCCCC GCGGCATCGG CAAAGTCGGC 
AAGGGCAAAC TGGCGGAGAT GCATCCGCAA CACCTTGCCG CCGCCGTGCT CAAGGCCATT
GCCGAGCGCA ACCAGCTCAA CACCGCCGAA GTCGACGATA TCATCTGGTC GACCTCCACC
CAGCGCGGTA AGCAGGGCGG CGACCTCGGC CGCATGGCGG CGCTCGATGC GGGCTACGAC
ATCAAGGCCT CCGGCACCAC GCTCGACCGT TTCTGCGGCG GCGGCATCAC CGCGGTGAAT
TTCGCCGCGG CCCAGATCAT GAGCGGCATG GAAGACGTGG TGATCGCCGG CGGCACCGAG
ATGATGTCGC TGACCGCATC GATGGCCGCC GAGGACATGG CCGCCGGCAA GCCGCCACTC
GGCATGGGCT CGGGCAATGC CCGCCTCGCT CAGGTGCACC CGCAATCGCA TCAGGGCATC
TGCGGCGACG CGATCGCCAC GATGGAAGGC ATCAGCCGCG AGGCGCTCGA CGCGCTCGGG
CTGGAGAGCC AGCGCCGCGC CGCGATCGCC ATCAAGGAAG GCCGCTTCGA CAAGAGCATC
ATCCCGGTCA AGGACGACGA CGGCAACGTC GTGCTGGCGA AGGACGAATA TCCGCGCCCC
GAAACCACCG CCGAAGGCCT CGCCGCGCTG AAGCCGGCCT TCACCGCGAT CGCCGACTAT
CCGCTCGACG ACAAGGGCAC CACCTATCGC AAGCTGATCA ACCAGAAGTA TCCGGACGTC
GACATCAAGC ACGTCCACCA CGCCGGCAAT TCCTCGGGCG TGGTCGACGG CGCCGCCGCG
GTGCTGCTGA CCTCGAAGGC CTATGCCGAC GCCCACGGCC TCAAGCCGCG CGCCAAGATC
GTGGCGATGG CCAATATCGG CGACGACCCG ACGCTGATGC TGAACGCGCC GGTGCCGGCG
GCCAAGAAGG TGCTGGCCAA GGCCGGACTC ACCAAGGACG ATATCGACCT CTGGGAGATC
AACGAAGCCT TCGCCGTGGT CACCGAGAAA TTCATCCGCG ACCTCGACCT CGACCGTGAC
AAGGTCAACG TCAATGGCGG CTCGATCGCC CTCGGCCACC CGATCGGCGC CACCGGCGCG
ATCCTGATCG GCACCGTGCT CGACGAACTG GAGCGCCGCG GCCTGAAGCG CGGCCTCGTC
ACGATGTGCG CCGCCGGCGG CATGGCCCCG GCGATCATCA TCGAGCGGGT GTGA
 
Protein sequence
MAEAYIIDAV RTPRGIGKVG KGKLAEMHPQ HLAAAVLKAI AERNQLNTAE VDDIIWSTST 
QRGKQGGDLG RMAALDAGYD IKASGTTLDR FCGGGITAVN FAAAQIMSGM EDVVIAGGTE
MMSLTASMAA EDMAAGKPPL GMGSGNARLA QVHPQSHQGI CGDAIATMEG ISREALDALG
LESQRRAAIA IKEGRFDKSI IPVKDDDGNV VLAKDEYPRP ETTAEGLAAL KPAFTAIADY
PLDDKGTTYR KLINQKYPDV DIKHVHHAGN SSGVVDGAAA VLLTSKAYAD AHGLKPRAKI
VAMANIGDDP TLMLNAPVPA AKKVLAKAGL TKDDIDLWEI NEAFAVVTEK FIRDLDLDRD
KVNVNGGSIA LGHPIGATGA ILIGTVLDEL ERRGLKRGLV TMCAAGGMAP AIIIERV