Gene RPD_0474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0474 
Symbol 
ID4020942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp545958 
End bp547910 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content66% 
IMG OID637960661 
Productacetyl-CoA synthetase 
Protein accessionYP_567613 
Protein GI91974954 
COG category[I] Lipid transport and metabolism 
COG ID[COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases 
TIGRFAM ID[TIGR02188] acetate--CoA ligase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.165738 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAGA AAATCTACGA CGTGCCCTCG GAATGGGCGA GCCGCGCCTT CGTGGACGAT 
GCGAAGTATC GCGAGATGTA CGCCCGCTCG GTCAGCGACC CGAACGGCTT CTGGGCCGAG
CAGGCCAAGC GCCTCGATTG GATCAAGCCG CCGCACAAGA TCGAGAACTG CTCCTTCGCG
CCCGGCAACG TCTCGATCAA ATGGTTCGAG GACGGCGTCC TCAACGTCGC GCATAATTGC
ATCGACCGGC ATCTGCACAC CCGCGGCGAT CAGGTCGCGA TCATCTGGGA GGGCGACGAT
CCCTCGCAGT CGCGCAAGAT CACCTATCGC GAGCTGCATG ACGAGGTCTG CAAATTCGCC
AACATCCTGC GCAACCGCAA TGTCGGCAAG GGCGACCGCG TCACCATCTA TCTGCCGATG
ATTCCGGAAG CCGCCTTCGC GATGCTGGCC TGCGCCCGGA TCGGCGCGAT CCATTCGGTG
GTGTTCGCCG GCTTCTCGCC GGACAGTCTC GCCCAGCGCA TCACCGACTG TCAGTCCAAG
GTGGTGATCA CCGCCGACGA AGGCCTGCGC GGCGGCCGTA AGGTGCCGCT GAAGGCCAAT
GTGGATGCCG CGCTCGCCAA ATGCGGCGGC GTCGACTGGG TGGTCGTGGT GAAGCGCACC
GGCGCTGCGG TCGAAATGGA TGACGTCCGC GACTTCTGGT ACCACGAGGC CGCCGAAATG
GTGACCACGG AGTGCCCGAT CGAGCCGATG CACGCCGAAG ACCCGCTGTT CATCCTCTAC
ACCTCGGGGT CGACCGGCCA GCCCAAGGGC GTGCTGCACA CCACCGGCGG CTATCTGGTG
TTCGCCGGGA TGACGCACCA ATACGTGTTC GATTATCACG ACGGCGACGT CTACTGGTGC
ACCGCCGACG TCGGCTGGGT CACCGGTCAC AGCTACATCC TGTACGGGCC GCTCGCCAAC
GGCGCGACCA CGCTGATGTT CGAAGGCGTG CCGAACTACC CGACCAATTC GCGGTTCTGG
GAAGTGATCG ACAAGCACCA GGTCAACATC TTCTACACCG CGCCGACCGC GATCCGCGCG
CTGATGCAGG CCGGCGACGA GCCGGTGAAG AAGACGTCGC GCAAGAGCCT GCGGCTGCTC
GGTTCGGTCG GCGAGCCGAT CAATCCGGAA GCCTGGGAGT GGTATCACCG CGTCGTCGGC
GACGACCGTT GCCCGATCGT CGACACCTGG TGGCAGACCG AGACGGGCGG CATCCTGATC
ACGCCGCTGC CCGGCGCGAC CAAGCTCAAG CCCGGCTCGG CGACGCGGCC GTTCTTCGGC
GTGGTGCCGG AGATCCTCGA TCCGGAAGGC GTGGTGCTGG AGGGCGAATG CACCGGCAAT
CTGTGTCTGG CGCGATCCTG GCCGGGCCAG ATGCGCACGG TGTACGGCGA TCACGCCCGG
TTCGAGCAGA CCTACTTCTC GGCCTATAAG GGCAAGTACT TCACCGGCGA CGGCTGCCGC
CGCGACGCCG ACGGCTATTA CTGGATCACC GGCCGGGTCG ACGACGTCAT CAACGTCTCC
GGCCACCGCA TGGGCACCGC CGAGGTCGAA AGCTCGCTGG TCGCGCATCC CAAGGTGTCG
GAAGCCGCCG TGGTCGGCTA TCCGCACGAC ATCAAGGGCC AGGGCATCTA CGCCTATGTG
ACGCTGATGG CCGGGATCGA GCCCAGCGAG GAGCTGCGCA AGGAGCTGGT CGCCTGGGTC
CGCAAGGACA TCGGCCCGAT CGCCTCGCCG GACCTGATCC AGTTCGCGCC CGGCCTGCCG
AAGACCCGCT CCGGCAAGAT CATGCGCCGC ATCCTGCGCA AGATCGCCGA GGACGAGCCC
TCGACGCTCG GCGACACCTC GACGCTGGCC GATCCCGCGG TGGTCGACGA TCTCGTCGAG
CACCGGCAGA ACAAGCACCA CAAGGCGGTC TGA
 
Protein sequence
MSEKIYDVPS EWASRAFVDD AKYREMYARS VSDPNGFWAE QAKRLDWIKP PHKIENCSFA 
PGNVSIKWFE DGVLNVAHNC IDRHLHTRGD QVAIIWEGDD PSQSRKITYR ELHDEVCKFA
NILRNRNVGK GDRVTIYLPM IPEAAFAMLA CARIGAIHSV VFAGFSPDSL AQRITDCQSK
VVITADEGLR GGRKVPLKAN VDAALAKCGG VDWVVVVKRT GAAVEMDDVR DFWYHEAAEM
VTTECPIEPM HAEDPLFILY TSGSTGQPKG VLHTTGGYLV FAGMTHQYVF DYHDGDVYWC
TADVGWVTGH SYILYGPLAN GATTLMFEGV PNYPTNSRFW EVIDKHQVNI FYTAPTAIRA
LMQAGDEPVK KTSRKSLRLL GSVGEPINPE AWEWYHRVVG DDRCPIVDTW WQTETGGILI
TPLPGATKLK PGSATRPFFG VVPEILDPEG VVLEGECTGN LCLARSWPGQ MRTVYGDHAR
FEQTYFSAYK GKYFTGDGCR RDADGYYWIT GRVDDVINVS GHRMGTAEVE SSLVAHPKVS
EAAVVGYPHD IKGQGIYAYV TLMAGIEPSE ELRKELVAWV RKDIGPIASP DLIQFAPGLP
KTRSGKIMRR ILRKIAEDEP STLGDTSTLA DPAVVDDLVE HRQNKHHKAV