Gene Rpal_0738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0738 
Symbol 
ID6408391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp774125 
End bp776458 
Gene Length2334 bp 
Protein Length777 aa 
Translation table11 
GC content66% 
IMG OID642710653 
Product4-hydroxybenzoyl-CoA reductase, alpha subunit 
Protein accessionYP_001989773 
Protein GI192289168 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID[TIGR03194] 4-hydroxybenzoyl-CoA reductase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.821542 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGTC CCGCATTCCA AGGCGGTCTC GACCGCGCCG GCGTGCCGCT AGTCGACGGT 
ACTGATAAAG TCACGGGGCG GGCTCGGTAC ACCGCCGATC TCGATCATAC CGGAGCATTA
GTTGCTCGTA TCCTGCGTAG CCCGATCAGT CATGGCAATA TCGTCCGGCT CGATGTCAGC
AAGGCGTTGG CACTCGACGG TGTCGCGGGG ATCGTCACCG GCGAGGACTG CGCAATCACC
TATGGCGTGC TGCCGATCGC GATGAACGAG TATCCGATGG CGCGCGATCG CGTCCGGTAT
CGCGGCGAAC CGGTTGCGGC AGTGGCCGCG GTCGATGCCG AGACGGCGCA ACAGGCGCTC
GACCTGATCG AACTCGAGCT GCGTGAACTG CCGGCTTACT ACGAGTCGGA GGCCGCCCGC
GCCCCGGGTG CCTGGCTGCT GCACGACAAC AAGCCGGGCA ATATCGAGCG CGAGGTTCAC
AATGAGTTCG GCGATCTTGC CGCAGGCTTC GAAGCCGCCG ACCTGATCCG CACCCACACG
CATCATTGTG CCGAGGTCAA TCACGCGCAG ATCGAGCCGC ACGCCTGCCT GATGGATTAC
GATCCGGTCA CCGGGCGGCT CACCGCCCAG AGCGTGTCGC AGGTCGGCTA TTATCTGCAC
CTGATGCTAG CGCGCTGCCT CGAGATCGAT CCGTCGCGGG TGCGGGTGAT CAAGCCGTTC
GTCGGCGGTG GCTTCGGCGC CCGCGTGGAA GTCTTGAATT TCGAGGTGAT CGCCGCACTG
CTCGCCCGTA AGGCCAGTGG CAGAGTGTTG ATGCGGCTGA GCCGTGAAGA GACCTTCATC
ACCCACCGCG CCCGGCCGCA GACCGACATC ACCTTGACGA TCGGGATGCG GCGCGACGGC
CGGTTCACCG CCTGCTCGTG CGAAGTGGTG CAGCGCGGCG GCGCCTATGC GGGTTACGGC
ATCGTGACCA TCCTGTATGC CGGCGCGCTG CTGCAGGGGC TGTACGACAT CCCCGCCATC
AAATACGACG GCTACCGCGT CTACACCAAC CTGCCACCCT GCGGCGCGAT GCGCGGGCAC
GGCTCGGTCG ACGTTCGTCA CGCCTTCGAG AATCTGGTCG ATCGGATGGC GCGCGAACTC
GGTCTCGATC CTTTTGCGGT GCGCCGCGCC AATCTGCTGG CCGCGCCGAC GCGCACTCTC
AACGATCTGA TGGTCAACAG CTATGGTCTC GCCGAATGCC TCGACAAGGT CGAGCGCGCC
AGCGGTTGGC GTGAGCGAAT TGGCCGGCTG CCGCCGGGCA AGGGTCTCGG CATGGCGTGT
TCGCACTATG TCAGCGGTTC GGCCAAGCCG ATCCATTTCA CCGGCGAGCC CCATGCAGTT
GTGGCGCTGC GGCTCGATTT CGATGGTGGC ATCACCGCGC TCACCGGCGC CGCCGACATC
GGCCAGGGCT CGTCGACGGT GGTGGCGATC ACTGTTGCTG AAACCCTGGG CGTCGCGCTG
AACCGGGTCC GTGTAATTTC CGGCGACTCC GCCATCACTC CGAAGGACAA TGGCGCGTAT
TCGTCGCGCA TCACCTTCAT GGTCGGCAAC GCCGCAATCG ATGCGGCGAC CAAGCTGAAA
CAGATCCTGA TCGAGGCGGC CGCGCGCAAA CTCGAAGCCG CACCGGAGCA GGTCGAGTGT
GCCGGCGAGA GCTTCTTCAT CGGCAGCAGT GCCCAGGCCG CGCTCGGCTT CGCCGAAGTG
GTGAAAGCCG CGCTGGTCGA CGAGGGGGCC ATCACCGTCA AGGGCACCTT CACCTGCCCG
CCGGAGTCGC AGGGCGGTAA ACATCGCGGC GGTGCGGTCG GCTCGACGAT GGGCTTCAGC
TATGCGGCCC AGGTGGTCGA GGTCAGCGTC GACGACGCGA CTGGCCTGAT CACCGTCGAC
AAGGTCTGGG TCGCGCTCGA TTGCGGCCGC GCCATCAATC CGCTCGCGGT GGTCGGGCAG
GTGCAGGGCG CCGTGTGGAT GGGGATGGGG CAGGCGATGT GCGAAGAGAC GCGCTATCTC
GACGGGCTGC CCGCGCATGC CAGCTTCCTC GAATATCGGA TGCCGACGAT GATCGAATCG
CCGCCGATCG AAGTTGCGAT CGTCGAGAGC GTCGATCCGT TCGGGCCGTT CGGCGCCAAG
GAGGCCAGTG AAGGCGCACT CGCTGGGTTC CCGCCGGCGA TGGTGAATGC CGTCGCCAAT
GCGATCGGCA TCGATCTCGA TGATTTGCCG GCAACGCCCG ATCGGGTCGT CGAGGCGCTG
GCACGGCGGC GGCGCGAGGC AAAGCGGGCG GTGGCGGAGA GGGCAGCCTC ATGA
 
Protein sequence
MSGPAFQGGL DRAGVPLVDG TDKVTGRARY TADLDHTGAL VARILRSPIS HGNIVRLDVS 
KALALDGVAG IVTGEDCAIT YGVLPIAMNE YPMARDRVRY RGEPVAAVAA VDAETAQQAL
DLIELELREL PAYYESEAAR APGAWLLHDN KPGNIEREVH NEFGDLAAGF EAADLIRTHT
HHCAEVNHAQ IEPHACLMDY DPVTGRLTAQ SVSQVGYYLH LMLARCLEID PSRVRVIKPF
VGGGFGARVE VLNFEVIAAL LARKASGRVL MRLSREETFI THRARPQTDI TLTIGMRRDG
RFTACSCEVV QRGGAYAGYG IVTILYAGAL LQGLYDIPAI KYDGYRVYTN LPPCGAMRGH
GSVDVRHAFE NLVDRMAREL GLDPFAVRRA NLLAAPTRTL NDLMVNSYGL AECLDKVERA
SGWRERIGRL PPGKGLGMAC SHYVSGSAKP IHFTGEPHAV VALRLDFDGG ITALTGAADI
GQGSSTVVAI TVAETLGVAL NRVRVISGDS AITPKDNGAY SSRITFMVGN AAIDAATKLK
QILIEAAARK LEAAPEQVEC AGESFFIGSS AQAALGFAEV VKAALVDEGA ITVKGTFTCP
PESQGGKHRG GAVGSTMGFS YAAQVVEVSV DDATGLITVD KVWVALDCGR AINPLAVVGQ
VQGAVWMGMG QAMCEETRYL DGLPAHASFL EYRMPTMIES PPIEVAIVES VDPFGPFGAK
EASEGALAGF PPAMVNAVAN AIGIDLDDLP ATPDRVVEAL ARRRREAKRA VAERAAS