Gene RPC_3100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3100 
Symbol 
ID3974051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3442733 
End bp3443899 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content67% 
IMG OID637926208 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_532961 
Protein GI90424591 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.514057 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0765503 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCCA GCATCGTCGG GTGGGCGCAC ACGCCGTTCG GCAAGCTCGA AGCCGAGACC 
GTCGAGAGCC TGATCGTGCG GGTGGCGGCC GACGCCTTGG CGGACGCCGG CATCGCCGCC
GCCGACGTCG ACGAGATCAT CCTCGGGCAT TTCAACGCCG GGTTCTCGCC GCAGGATTTT
ACCGCCGCCT TGGTGCTGCA GGCCGACCCC GAGCTGCGGT TCAAACCGGC GACCCGGGTT
GAGAACGCCT GCGCCACCGG ATCGGCCGCG GTGCACCAGG CGGTGAAGTC GGTCGCCGCC
GGCGCCAAGA TCGTCTTGGT GGTCGGCGTC GAGCAGATGA CCAAGACCCC GGGGCCGGAG
ATCGGCAAGA ACTTGCTGCG CGCCTCCTAT CTGATCGAGG ACGGCGACAC CCCGGCGGGC
TTCGCCGGCG TGTTCGGCAA CATCGCGCAG AAGTACTTTC AGAAATACGG CGACCAGTCC
GACGCGCTGG CGATGATCGC GGCGAAGAAC CACCATAACG GCGTATCGAA TCCCTATGCG
CAGATGCGCA AGGACATCGG CTTCGACTTC TGCCGCGCTG AGGGCGACAA GAACCCGTTC
GTCGCCGGCC CCCTGAAACG CACCGACTGC TCGTTGGTGT CGGACGGCGC TGCGGCCTTG
GTGATCACCG ACGCCGACAC CGCCAAGGCG ATGTCCAAGG CGGTGACGAT CAAAGCCACG
GCGCACGCTC AGGATTTCCT GCCGATGTCG AAGCGCGACA TTCTGCGGTT CGAGGGCTGC
AGCGTGGCCT GGCAGCGCGC GCTGCAATCC GCCGGCGCGA CCCTGCAGGA TCTGTCCTTC
GTCGAGACCC ACGATTGCTT CACCATCGCC GAACTGATCG AATACGAGGC GATGGGCCTG
ACGCCGGCCG GGCAGGGCGC CCGCGCCATC AAGGAAGGTT GGACCCGCAA GGACGGCAAG
CTGCCGATCA ATCCGTCCGG TGGCCTCAAG GCCAAAGGCC ATCCGATCGG CGCCACCGGG
GTGTCGATGC ACGTGCTCTG TGCGATGCAG CTGCTCGGCC AGGCGCCGGA GGGCATGCAG
ATCAAGGACG CCAAGCTCGC CGGCATTTTC AATATGGGCG GCGCCGCGGT CGCCAACTAC
GTCTCGCTGC TCGAGCCGGC GCGCTGA
 
Protein sequence
MTASIVGWAH TPFGKLEAET VESLIVRVAA DALADAGIAA ADVDEIILGH FNAGFSPQDF 
TAALVLQADP ELRFKPATRV ENACATGSAA VHQAVKSVAA GAKIVLVVGV EQMTKTPGPE
IGKNLLRASY LIEDGDTPAG FAGVFGNIAQ KYFQKYGDQS DALAMIAAKN HHNGVSNPYA
QMRKDIGFDF CRAEGDKNPF VAGPLKRTDC SLVSDGAAAL VITDADTAKA MSKAVTIKAT
AHAQDFLPMS KRDILRFEGC SVAWQRALQS AGATLQDLSF VETHDCFTIA ELIEYEAMGL
TPAGQGARAI KEGWTRKDGK LPINPSGGLK AKGHPIGATG VSMHVLCAMQ LLGQAPEGMQ
IKDAKLAGIF NMGGAAVANY VSLLEPAR