Gene RPC_4003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4003 
Symbol 
ID3969193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4451816 
End bp4452973 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content64% 
IMG OID637927107 
Productpatatin 
Protein accessionYP_533848 
Protein GI90425478 
COG category[R] General function prediction only 
COG ID[COG1752] Predicted esterase of the alpha-beta hydrolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.426633 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCCCC CGATGGATAC AGCTGCCGCC AAACCGGCTT CGAGTCCCGC CGAAGGTCGC 
CGCGTGTTGG TGCTGCAGGG CGGCGGCGCG CTCGGCTCCT ATCAGGCCGG CGCCTATCAG
GCGCTGTGCC ACCACGATTT CGAACCGCAA TGGCTCGCCG GCATCTCGAT CGGCGCCATC
AACGCCGCGA TCATCGCCGG CAATCCGCGC GAGCAGCGGG TGGCGAAGCT GAAGGAATTC
TGGGAGCTGG CGTCGTCGCC GGTGCCGTGG CTGCCCGTGC TGCACGACGA CCGGGTGCAT
TCGCTGTTCA ACGAAACCAG CGCAGCGCTG ACCGCCATGT TCGGCGTGCC CGGCTTCTTC
ACGCCGCGGT TTCCGCCGCC GCTGCTGTTT CCGCCGCAGG ATCTGACCTC GCTGAGCTAT
TACGACACCG CGCCGCTGCG CGCGACGCTG CAGCGGCTGG TGGACTTCGA TCGCATCAAC
GACGAGGCCA AGACCACCCG GCTCAGCCTC GGCGCGGTCA ACATCGCCAC CGGCAATTTC
TGCTACTTCG ACAACACAAG ACAACAGATC GGCCCGGAGC ACGTCATGGC TTCGGCGGCG
CTGCCGCCGG GCTTTCCGGC GGTCGAGATC GACGGCGAAT TCTTCTGGGA CGGCGGCATC
GCCTCCAACA CGCCGCTGGA CTACGTGCTC GGCGAAGAAA CCGTCGACGA TCTGTTGATC
TTCCAGGTCG ATCTGTTCAG CGCCCGCGGC CGGCTGCCGG AGACGCTGCT GGAGGCCGCC
GAGCGCGAAA AGGACATCCG GTTTTCCAGC CGCACCAGGC TCAACACCGA CAAGAACAAG
CAGATCCACA ACACCCGCAA GGCGTTGCGC GATTTGATCG ACAAATTGCC CGACGAACTG
CGCAACGATC CGGCCTACGC CATCCTGCAC GAGGCGGCGG AGGAGAACAC CGTCACCGTG
GTGCATCTGA TCTACCGCAA GCGCAATTAC GAGGCTGCAT CGAAGGACTA TGATTTCTCC
CGCCTCAACA TGCTCGAGCA TTGGAAATCC GGCGAGCAGG ACGTGCATCT GTCGATGCGG
CGTCCGGACT GGCTGCGCCA GCCGCAGGAC GGCGAGACCA TGGTGACCTA CGATCTGACC
AAAGAGGCTT TCCAATAG
 
Protein sequence
MGPPMDTAAA KPASSPAEGR RVLVLQGGGA LGSYQAGAYQ ALCHHDFEPQ WLAGISIGAI 
NAAIIAGNPR EQRVAKLKEF WELASSPVPW LPVLHDDRVH SLFNETSAAL TAMFGVPGFF
TPRFPPPLLF PPQDLTSLSY YDTAPLRATL QRLVDFDRIN DEAKTTRLSL GAVNIATGNF
CYFDNTRQQI GPEHVMASAA LPPGFPAVEI DGEFFWDGGI ASNTPLDYVL GEETVDDLLI
FQVDLFSARG RLPETLLEAA EREKDIRFSS RTRLNTDKNK QIHNTRKALR DLIDKLPDEL
RNDPAYAILH EAAEENTVTV VHLIYRKRNY EAASKDYDFS RLNMLEHWKS GEQDVHLSMR
RPDWLRQPQD GETMVTYDLT KEAFQ