Gene RPD_1526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1526 
Symbol 
ID4022006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1702098 
End bp1703660 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content64% 
IMG OID637961721 
Productbenzoate-CoA ligase family 
Protein accessionYP_568664 
Protein GI91976005 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID[TIGR02262] benzoate-CoA ligase family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.936301 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCTTC GTGACTACAA TGCGGCGGTC GATTTCGTGG ATCGCAATGT CGCGGAAGGT 
CGGGGCGACA AGATCGCCTT CATCGATCCG TTGCGGAGCC TGTCCTATGG CGAGCTACGC
GACGCGGCGG CGCGGGTCGG CCCGATGCTG GCGCGGCTCG GCGTCGAGCA GGAAGACCGC
ATCGCGCTGG TGTTGCAGGA CACCGTCGAT TTTCCGATCC TGTTCTGGGG AGCGATCCGC
GCCGGCGTCG TCCCGGTGCT GCTCAACACA AGGCTGACCA CCGATCAGTA CCGATATCTT
CTGGAGGATT CGCGCGCGAA GGCGGTGTTC GTCTCGACCG ATCTGCTGCC GCTGATCGAG
GAGGCCGCGA CCGACCTGTC GCATCTGCGC TCGATCATTG CGGTCGGCGA GGGAGCATCA
GCGGCGGCGC GGCTGGCCGA TCTGCTCGCC GCCGAAAACG AGGGCGGCGC GCCGGCCCGC
ACCTGCGCCG ACGACGTCGC CTATTGGCAA TATTCGTCCG GCACGACCGG AATGCCCAAG
GGGGTGATGC ACGTCCACTC CAGCCCGCGC GTCATGGCGA CGAGCGCCGG CCAGCGCCGC
ATCGGCTATC GCCAGGACGA TATCGTGTTC TCGGCGGCGA AGCTGTTCTT CGCCTACGGC
CTCGGCAACG CGATGTTCTG CTCGATGTGG GTCGGCGCAA CCTCGGTGCT CTACCCGGAG
CGGCCGACGG CGGAATCGGT GTTCGACGTG CTGCGGCTGC ACCAGCCGAC CCTGCTGTTC
GCGGTGCCGA CGCTGTATGC GGCGATCCTT GCCGATCAGC AGCGCAAGCA CGAGCGGCTG
CCGGAGCGGC TGCGGCTGTG CGTCTCCGCC GGCGAGCCGC TGCCGGCGCA GGTCGGATTG
AACTGGCGCA ATCGGTTCGG CCGCGACATC GTCAACGGCG TCGGCTCGAC CGAGATGGGC
CATCTGTTCC TGACCAATCT TCCGAGCGCG GTGGAATACG GCACGTCGGG TGTGCCCGTC
GATGGCTATC GGCTGAAGCT GGTGGACGAT CAGGGATGCG ACATCGCCGA CGGCGAAATC
GGTGAACTGT TAGTGAATGG CGGCTCCGCC GCTGCGGGCT ACTGGAATCA ACGTGACAAG
TCGCGGATGA CCTTCATCGG CGAATGGACG CGAACGGGCG ACAAATATCA CCGCCGCGCC
GACGGCGTGT ACACTTACCG CGGCCGCACC GACGATATGT TCAAGGTGAG CGGCATCTGG
GTTTCGCCGT TCGAAATCGA GGAAGCGCTG ATGGGGCACC CCAAGGTGGC CGAAGCGGCG
GTGATCCCGG CCGAAGATAT CGACGGACTG ATCAAGCCGA AGGCCTTTAT CGTGCTCGCC
TCGCAGGATG AAGACATCAA CGTATTGATC CAAGACCTCA AGGACCACGT CAAACGCGCG
ATCGGTCCGT GGAAGTATCC ACGCTGGATA CGTGTCGTGA ACGAGCTGCC GAAAACGTCA
AGCGGCAAGC TGCAACGCTA CATGTTGCGT GCGATGGTGC TGGACCAGGA CGGTTCAGTA
TGA
 
Protein sequence
MPLRDYNAAV DFVDRNVAEG RGDKIAFIDP LRSLSYGELR DAAARVGPML ARLGVEQEDR 
IALVLQDTVD FPILFWGAIR AGVVPVLLNT RLTTDQYRYL LEDSRAKAVF VSTDLLPLIE
EAATDLSHLR SIIAVGEGAS AAARLADLLA AENEGGAPAR TCADDVAYWQ YSSGTTGMPK
GVMHVHSSPR VMATSAGQRR IGYRQDDIVF SAAKLFFAYG LGNAMFCSMW VGATSVLYPE
RPTAESVFDV LRLHQPTLLF AVPTLYAAIL ADQQRKHERL PERLRLCVSA GEPLPAQVGL
NWRNRFGRDI VNGVGSTEMG HLFLTNLPSA VEYGTSGVPV DGYRLKLVDD QGCDIADGEI
GELLVNGGSA AAGYWNQRDK SRMTFIGEWT RTGDKYHRRA DGVYTYRGRT DDMFKVSGIW
VSPFEIEEAL MGHPKVAEAA VIPAEDIDGL IKPKAFIVLA SQDEDINVLI QDLKDHVKRA
IGPWKYPRWI RVVNELPKTS SGKLQRYMLR AMVLDQDGSV