Gene RPB_1237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1237 
Symbol 
ID3909171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1415438 
End bp1416772 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content67% 
IMG OID637883131 
Productphenylacetate--CoA ligase 
Protein accessionYP_484858 
Protein GI86748362 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1541] Coenzyme F390 synthetase 
TIGRFAM ID[TIGR02155] phenylacetate-CoA ligase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCC AGCCCGCATT GCGTGAACAG CCCGTCTATA TGCACGATCG CGCCATCGAG 
ATGATGCCGC GGCCGCAGCT CGCGGCGCTG CAGTTCGAGC GGCTGCGCCG AATCGTCGAG
CGGGCCTATC GCGACGTCCC GCATTATCGC CGGACGTTCG ACGAAGCCGG CGTCAAGCCG
GCCGATCTGC AGACGCTCGC GGACATCGCG AAGTTTCCGT TCACCAAGAA AAACGATCTG
CGCGACAACT ATCCGTTCGG CATGTTCGCG GTGCCGCGCA ATCAGCTACC GCGGATTCAC
GCCTCGTCGG GCACCACCGG CAAGCCGACC GTGGTCGGTT ATACCAAGAC CGATCTCGAC
AACTGGGCCG CCCTGATGGC GCGGTCGATG GTCGGGGCGG GCGTGTCGCC CGACGACATC
GTCCACAACG CCTATGGCTA CGGCCTGTTC ACTGGTGGGC TCGGCGCGCA TTACGGCGCC
GAGCGGCTGG GCTGCACCGT GGTGCCGATC TCCGGCGGCG GCACCGAGCG GCAGGTGACG
CTGATGACCG ATTTCGGCGC CAACGTGCTG TGCTGCACGC CGTCCTACGC GCTCAACATC
GCCGAAGTCG CCGAGCAGAT GGGCGTGAAC CTGCGCGCCG CGCCGCTGCG GATCGGCGTG
TTCGGCGCCG AGCCGTGGTC GGATGCGATG CGCCGCGACC TTGAGGCGCG GCTCGGCATC
AAGGCGATCG ACGTCTACGG CCTGTCGGAG ATCATGGGCC CCGGCGTCGC CTGCGAATGC
GCCGTGGCGC AGAACGGTCT GCACGGCTGG GAAGATCACT TCCTGTTCGA GACCATCGAT
CCGGAAACCC TGCAGGTGCT GCCGATGGGG TCGGTCGGCG AATTGGTGAT CACCACGCTG
ACCAAGGAAG CGCTGCCGAT GATCCGGTAT CGCACCCGCG ACATCACCAG CCTGTCGACC
GAACCCTGCG CCTGCGGCCG CACCCATCTG CGGATCATGC GCGTCACCGG CCGTGACGAC
GACATGCTGA TCATCCGCGG CGTCAACGTC TATCCGTCGC AGGTGGAGTC GGTGCTGGTC
GGCTTCCCCG GCATCGCGCC GCACTACCAG ATCGTGCTGA CGCGGGAGAA GGCGCTCGAC
GCCATGACGG TCGAAGTCGA GATCGCCCCC GACGCCCCGC GCGACGAGGC GGCGCTGGTG
AAGAAGGCGG CCGAGGTCAC CCACCACATC AAGTCGCTGA TCGGCGTGAC CTGCAAGGTC
GTAGTGAAGT CGCCGGGCGA CGTCCCGCGC TCGCAGGGCA AGGCGGTGCG GGTGAAGGAT
CAGCGGAATA TTTGA
 
Protein sequence
MSAQPALREQ PVYMHDRAIE MMPRPQLAAL QFERLRRIVE RAYRDVPHYR RTFDEAGVKP 
ADLQTLADIA KFPFTKKNDL RDNYPFGMFA VPRNQLPRIH ASSGTTGKPT VVGYTKTDLD
NWAALMARSM VGAGVSPDDI VHNAYGYGLF TGGLGAHYGA ERLGCTVVPI SGGGTERQVT
LMTDFGANVL CCTPSYALNI AEVAEQMGVN LRAAPLRIGV FGAEPWSDAM RRDLEARLGI
KAIDVYGLSE IMGPGVACEC AVAQNGLHGW EDHFLFETID PETLQVLPMG SVGELVITTL
TKEALPMIRY RTRDITSLST EPCACGRTHL RIMRVTGRDD DMLIIRGVNV YPSQVESVLV
GFPGIAPHYQ IVLTREKALD AMTVEVEIAP DAPRDEAALV KKAAEVTHHI KSLIGVTCKV
VVKSPGDVPR SQGKAVRVKD QRNI