Gene Rpal_3956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3956 
Symbol 
ID6411637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4245178 
End bp4246347 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content66% 
IMG OID642713837 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001992927 
Protein GI192292322 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.358639 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGCCA GCATCGTTGG ATGGGCCCAT ATGCCGTTCG GCAAGTTCGA CGCCGAAACC 
GTGGAAAGCA TGATCGTCCG TGTCGCCACC GAGGCGATCG CCGACGCCGG GATCGCGGCC
TCGGATGTCG ACGAAATCGT GCTCGGGCAT TTCAATGCCG GATTCTCGCC GCAGGACTTC
ACAGCCGCGC TGGTGCTGCA GGCCGATCCG GCGCTCCGCT TCAAGCCGGC GACGCGCGTC
GAGAACGCCT GCGCGACCGG CTCGGCCGCC GTGCATCAGG GCATCCGCGC GATCGAAGCC
GGTGCCGCCA AGATCGTGCT GGTGGTCGGC GTCGAGCAGA TGACCCGCAC GCCCGGGCCG
GAGATCGGCA AGAACCTGCT GCGCGCCTCT TACTTGCCGG AGGACGGCGA CACGCCCGCC
GGGTTCGCTG GTGCGTTCGG CATCATCGCT CAGAAGTACT TCCAGAAATA TGGCGACCAG
TCCGATGCGC TGGCGATGAT CGCCGCCAAG AACCACCACA ACGGCGTTGC CAATCCCTAT
GCGCAGATGC GCAAGGATTT CGGCTTCGAG TTCTGCCGCG CCGAAGGCGA GAAGAATCCA
TTCGTCGCCG GGCCCTTGAA GCGCACCGAT TGCTCGCTGG TCTCGGACGG CGCCGCGGCG
CTGGTGCTGA CCTCGGCCGA GAACGCCAAG GCGATGGGCA AGGCGGTCAA CATCCGCGCC
CGCGCCCATG CGCAGGACTT TCTGCCGATG TCCAAGCGCG ACATCCTGCA GTTCGAAGGC
TGCACCGTCG CCTGGCAGCG CGCGCTGGAG CAGGCCGGCG TCACGCTGAA CGATCTGTCG
TTCGTCGAGA CCCACGATTG CTTCACCATC GCCGAGCTGA TCGAATACGA AGCGATGGGC
CTGACGCCGA AGGGGCAGGG CGCCCGCGCC ATCAAGGAGG GCTGGACCCA GAAGGACGGC
AAGCTGCCGA TCAATCCGTC CGGCGGTCTC AAGGCCAAGG GCCATCCGAT CGGCGCCACC
GGCGTGTCGA TGCACGTGCT GAGCGCGATG CAGCTGCTTG GCCAGGCGCC GGAAGGCATG
CAGATCAAGG ACGCCAAGCT CGCCGGCATC TTCAACATGG GCGGCGCCGC GGTCGCCAAC
TACGTGTCGG TGCTCGAACC GGCCAAGTAA
 
Protein sequence
MTASIVGWAH MPFGKFDAET VESMIVRVAT EAIADAGIAA SDVDEIVLGH FNAGFSPQDF 
TAALVLQADP ALRFKPATRV ENACATGSAA VHQGIRAIEA GAAKIVLVVG VEQMTRTPGP
EIGKNLLRAS YLPEDGDTPA GFAGAFGIIA QKYFQKYGDQ SDALAMIAAK NHHNGVANPY
AQMRKDFGFE FCRAEGEKNP FVAGPLKRTD CSLVSDGAAA LVLTSAENAK AMGKAVNIRA
RAHAQDFLPM SKRDILQFEG CTVAWQRALE QAGVTLNDLS FVETHDCFTI AELIEYEAMG
LTPKGQGARA IKEGWTQKDG KLPINPSGGL KAKGHPIGAT GVSMHVLSAM QLLGQAPEGM
QIKDAKLAGI FNMGGAAVAN YVSVLEPAK