Gene RPD_4008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4008 
Symbol 
ID4024525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4453844 
End bp4455178 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content66% 
IMG OID637964211 
Productphenylacetate--CoA ligase 
Protein accessionYP_571128 
Protein GI91978469 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1541] Coenzyme F390 synthetase 
TIGRFAM ID[TIGR02155] phenylacetate-CoA ligase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGCGC AGACCGCTCT CCGCGAAACG CCGATTTATA TGCATGACCG TGCCATCGAG 
ACGATGCCGC GGCCGCAGCT CGCGGCGTTG CAGCTCGAGC GGCTGCGCAG AATCGTCGAG
CGCGCCTATC GCGACGTCCC GCATTATCGC CGGACGTTCG ACGCCGCCGG CGTCAAGCCG
TCCGAGCTGA CGTCGCTCGC CGATCTGGCG AAGTTTCCGT TCACCAAGAA GACAGATCTG
CGCGACAATT ATCCGTTCGA CATGTTCGCG GTGCCGCGCA ATCAGTTGCC GCGAATTCAC
GCATCATCCG GCACCACCGG AAAGCCGACC GTGGTCGGCT ACACCCGGAA CGATCTCGAT
AATTGGGCCG ACCTGATGGC GCGGTCGCTG GTCAGCGCCG GCGCCTCGCC GGACGACATC
GTTCACAACG CCTATGGCTA CGGCCTGTTC ACCGGCGGTC TCGGCGCGCA TTACGGCGCC
GAGCGGCTCG GCTGCACAGT GGTGCCGATC TCCGGCGGCG GCACCGAGCG TCAGGTCACG
CTGATGATGG ACTTCGGCGC CGACGTGCTG TGCAGCACGC CGTCCTACGC ACTCAACATC
GCCGAAGTCG CCGAGCAGAT GGGCGTCGAT CTGCGCAAGG CGCCGCTGCG CGTCGGGCTG
TTCGGCGCCG AGCCGTGGAG CGACGCGATG CGGCGCGACC TCGAGGCGCG GCTCGGCATC
AAGGCGATCG ACATCTACGG CCTGTCGGAG ATCATGGGCC CTGGCGTCGC CTGCGAATGC
CACGTCGCGC AGAATGGCCT GCACGGCTGG GAGGATCACT TCCTGTTCGA GACCATCGAT
CCGGAAACGC TGCAGCCGTT GCCGCTCGGC TCGACCGGCG AACTGGTGAT CACCACGCTC
ACCAAGGAAG CGCTGCCGAT GATCCGGTAT CGCACCCGCG ACATCACCAG CCTCTCGACC
GAGCCCTGCG CCTGCGGTCG CACCCATCTG CGGATCATGC GCGTCACCGG CCGCGACGAC
GACATGCTGA TCATCCGCGG CGTCAACGTC TATCCGTCGC AGGTGGAGTC GGTGCTGGTC
GGCTTCCCCG GCATCGCGCC GCACTACCAG ATCGTGCTGA CCCGCGACAA AGCGCTCGAC
GCCATGACCG TCGAAGTCGA GATCGCCCCG GATGCGCCGC GCGACGACGC CTCGCTGGCG
TACAAGGCCG CCGAGGTCAC GCATCACATC AAGTCGCTGA TCGGCGTCAC CTGCAAGGTC
ACCGTCAAGG CGCCCGGCGA AGTGCCGCGC TCGCAGGGCA AGGCGGTGCG GGTGAAGGAT
CAGCGGAATA TTTGA
 
Protein sequence
MGAQTALRET PIYMHDRAIE TMPRPQLAAL QLERLRRIVE RAYRDVPHYR RTFDAAGVKP 
SELTSLADLA KFPFTKKTDL RDNYPFDMFA VPRNQLPRIH ASSGTTGKPT VVGYTRNDLD
NWADLMARSL VSAGASPDDI VHNAYGYGLF TGGLGAHYGA ERLGCTVVPI SGGGTERQVT
LMMDFGADVL CSTPSYALNI AEVAEQMGVD LRKAPLRVGL FGAEPWSDAM RRDLEARLGI
KAIDIYGLSE IMGPGVACEC HVAQNGLHGW EDHFLFETID PETLQPLPLG STGELVITTL
TKEALPMIRY RTRDITSLST EPCACGRTHL RIMRVTGRDD DMLIIRGVNV YPSQVESVLV
GFPGIAPHYQ IVLTRDKALD AMTVEVEIAP DAPRDDASLA YKAAEVTHHI KSLIGVTCKV
TVKAPGEVPR SQGKAVRVKD QRNI