Gene RPC_3014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3014 
Symbol 
ID3973621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3310218 
End bp3311957 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content66% 
IMG OID637926125 
ProductNitrilase/cyanide hydratase and apolipoprotein N-acyltransferase 
Protein accessionYP_532878 
Protein GI90424508 
COG category[R] General function prediction only 
COG ID[COG0388] Predicted amidohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.55748 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCGCA AGATCCAGGT TGCCACCGTC CAGTTCGAAC CCACCATGTT CGAAAAGGAG 
CGCAACATCG CCGGTCTCCT CGCTCTCTGC GAGCAGGCGG CGCAATCCGG CGCCCGGTTG
ATCGTGACGC CGGAAATGGG CACCACCGGA TATTGCTGGT TTGATCGCGC CGAGGTTGCG
CCCTATGTCG AGCCGATTCC CGGGCCGAGC ACCGATCGCT TCGCCGCGCT GGCGCGAAAA
TACGATTGCT ACATCGTCGT CGGCCTGCCG GAGGTCGACG ACGACGGCAT CTATTTCAAC
TCCGCGGTGC TGATCGGGCC GGAGGGCGTG ATCGGCCGGC ATCGCAAGAC CCATCCCTAT
ATTGCCGAGC CGAAATGGTC GGCGGCAGGG GATCTGCACA ACCAGGTGTT CGAGACGCCG
ATCGGGCGCA TCGCCATCTT GATTTGCATG GATATCCATT TCATCGAGAC CGCGCGCTTG
ATGGCGCTGG GTGGCGCCGA CATCATTTGC CACATCTCGA ATTGGCTGGC CGAGCGGACC
CCGGCGCCGT ACTGGATCAG CCGGGCGTTC GAGAACGGCT GCTACGTGAT CGAAAGCAAC
CGCTGGGGGC TGGAGCGCAC AGTGCAGTTT TCCGGCGGCA GCTGCGTGAT CGCGCCGGAT
GGCGGGATCG CCGCCGTCAT CGATGGCGGC GACGGCGTGG CGATGGCGGA GATCGATCTC
GATCTGGCGC GGGCGCGCCG TGTCGCCGGC GAGCCGGTGT TCCAGCGGCG CCGGCCGGAG
CTTTACCCTG AGCTGCTGAC CGACACCTTC AGCTGGAATC CGCGCGATTT CTTCAAGCTG
TACGGTCATC AGCCATGGCC GGAGGGCAAG TCGTCGCGGG TCAGCGTCGC GCAATTTGCG
CCGAGCTCCG ATGTGGACGG CAATCTCGAT CACATCGACG CGCTGGCTCG GCAAGCCAAG
GCCGACGGGG TCGAGCTTGT TGTGTTTCCG GAACTGGCGA TCAGCGGTCT GATCGACCCG
GCGCAAGCTG CGCAGGCGAT TCCGGGCCCG GCGACCGATC GGCTCGGCGA CCTCGCCAAG
CAGCTGTCGC TCTATCTGGT CTGCGGCATC GCCGAGCGAG CCGGCGAACT CACCTACAAC
AGCGCGGTCC TGATCGCACC GGACGGCGCA TGGACGGTCT ATCGCAAGAC GCATCTCACC
GAAGACGAGC GCAGCTGGGC GACCGCAGGC GACGACTGGA CCGTGGTCGA TACGCCGCTC
GGTCGGATTG GCCTGCTGAT CGGTCATGAC GCGATGTTTC CGGAAGCAGG CCGCGTGCTG
GCGCTGCGCG GCTGCGATCT GATCGTCTGC CCGGCGGCGA TCGCGACCCG GTTCAGCTCG
CCGCATGCCG GCACGGCGGT CGCGCAGCCG GCGCCGATCC CGACCGGGGC CGATCCGTAT
CACTGGCATC ACTTCCGCGT CCGCGCTGGC GAGAACAACG TGTTCTTCGC CTTCGCCAAT
GTGATCGATC CTGCGCGCGG TTACGCCGGC CTGAGCGGCG TGTTCGGCCC CGACACCTTT
GCCTTTCCAC GCCGCGAAGC CATGGTCGAA GACGGCGAGG GCGTCGCCAC GGCGGTGATC
GACACCAGCA ATCTCGACAG CGTCTATCCG ACCAATGTGG TGCGGCGGAA GGACCTGGTG
GCGATGCGGA TGCCGCACAG CTATCGGCCG CTGATCCAGG CGGTGGCGGG AAATTTCTGA
 
Protein sequence
MSRKIQVATV QFEPTMFEKE RNIAGLLALC EQAAQSGARL IVTPEMGTTG YCWFDRAEVA 
PYVEPIPGPS TDRFAALARK YDCYIVVGLP EVDDDGIYFN SAVLIGPEGV IGRHRKTHPY
IAEPKWSAAG DLHNQVFETP IGRIAILICM DIHFIETARL MALGGADIIC HISNWLAERT
PAPYWISRAF ENGCYVIESN RWGLERTVQF SGGSCVIAPD GGIAAVIDGG DGVAMAEIDL
DLARARRVAG EPVFQRRRPE LYPELLTDTF SWNPRDFFKL YGHQPWPEGK SSRVSVAQFA
PSSDVDGNLD HIDALARQAK ADGVELVVFP ELAISGLIDP AQAAQAIPGP ATDRLGDLAK
QLSLYLVCGI AERAGELTYN SAVLIAPDGA WTVYRKTHLT EDERSWATAG DDWTVVDTPL
GRIGLLIGHD AMFPEAGRVL ALRGCDLIVC PAAIATRFSS PHAGTAVAQP APIPTGADPY
HWHHFRVRAG ENNVFFAFAN VIDPARGYAG LSGVFGPDTF AFPRREAMVE DGEGVATAVI
DTSNLDSVYP TNVVRRKDLV AMRMPHSYRP LIQAVAGNF