Gene RPC_1854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1854 
Symbol 
ID3971868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2017521 
End bp2018801 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content63% 
IMG OID637924967 
Productformyl-coenzyme A transferase 
Protein accessionYP_531732 
Protein GI90423362 
COG category[C] Energy production and conversion 
COG ID[COG1804] Predicted acyl-CoA transferases/carnitine dehydratase 
TIGRFAM ID[TIGR03253] formyl-CoA transferase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.780367 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAGG CGCTGAACGG CGTTCGCATT CTGGACTTCA CCCACGTCCA GTCCGGCCCG 
ACCTGCACGC AACTGCTGGC CTGGTTCGGC GCCGACGTGA TCAAGGTCGA GCGCCCCGGT
GTCGGCGACA TTACGCGGGG GCAATTGCAA GACATCCCGA ACGTCGACAG CCTGTATTTC
ACCATGCTGA ACCACAACAA GCGCTCGATC ACGCTCGACA CCAAGAACCC CAAGGGCAAG
GAAGTGCTCA CCGCGCTGAT CAAGAGCTGC GACGTGCTGG TGGAGAATTT CGGCCCCGGC
GTGCTCGACC GCATGGGATT TTCCTGGGAG AAGATCCAGA GCCTCAATCC GAAGATGATC
GTCGCCTCGA TCAAGGGATT CGGCCCCGGC CCCTATGAGG ATTGCAAGGT CTACGAGAAC
GTCGCGCAAT GCACCGGCGG CGCGGCGTCG ACCACCGGCT TCCGCGACGG CCTACCGCTG
GTCACCGGCG CGCAGATCGG CGATAGCGGC ACCGGCCTGC ATCTCGCGCT CGGCATCGTC
ACCGCCTTGT ATCAACGCAC GGTGACCGGC CGCGGCCAGA AGGTGACCGC GGCGATGCAG
GACGGCGTGT TGAATTTGTC GCGGGTGAAA TTGCGCGACC AGCAGCGCCT CGCCCATGGC
CCGCTGAAGG AATACAGCCA GTTCGGCGAA GGCATTCCGT TTGGAGATGC CGTTCCTCGC
GCGGGAAACG ATTCCGGCGG CGGCCAGCCG GGACGGATTC TGAAGTGCAA GGGCTGGGAG
ACCGATCCCA ACGCCTACAT CTACTTCATT ACGCAAGCGC CGGTGTGGGA GAAGATTTGC
GACGTGATCG GCGAGCCGGA TTGGAAAACC CATCCCGACT ACGCCAAGCC GGCGGCGCGG
CTCAAGCACC TCAACGACAT CTTCGCGCGC ATCGAACAAT GGACCATGAC CAAGACCAAG
TTCGAGGCGA TGGACATTCT CAACAAGGAC GACATTCCCT GCGGGCCGAT CCTGTCGATG
AAGGAACTCG CCGAGGATCA ATCGCTGCGC GCCACCGGCA CGGTGGTCGA GGTCGATCAT
CCGACCCGCG GCAAGTATCT GTCGGTCGGC AACCCGATCA AGATGTCGGA TAGCCCGACC
GAGGTGATGC GCTCGCCCTT GCTCGGCGAG CACACCGACG AGATCCTGCG GCAGGTGCTC
GGCTTCAGCG ATCAGCAGGT CGCCGAGGTG CATGATTCCG GCGCGCTGGA ACCACCGCGC
AAGGCGGCTG CGGCGGAATA A
 
Protein sequence
MTKALNGVRI LDFTHVQSGP TCTQLLAWFG ADVIKVERPG VGDITRGQLQ DIPNVDSLYF 
TMLNHNKRSI TLDTKNPKGK EVLTALIKSC DVLVENFGPG VLDRMGFSWE KIQSLNPKMI
VASIKGFGPG PYEDCKVYEN VAQCTGGAAS TTGFRDGLPL VTGAQIGDSG TGLHLALGIV
TALYQRTVTG RGQKVTAAMQ DGVLNLSRVK LRDQQRLAHG PLKEYSQFGE GIPFGDAVPR
AGNDSGGGQP GRILKCKGWE TDPNAYIYFI TQAPVWEKIC DVIGEPDWKT HPDYAKPAAR
LKHLNDIFAR IEQWTMTKTK FEAMDILNKD DIPCGPILSM KELAEDQSLR ATGTVVEVDH
PTRGKYLSVG NPIKMSDSPT EVMRSPLLGE HTDEILRQVL GFSDQQVAEV HDSGALEPPR
KAAAAE