Gene Rpal_1751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1751 
Symbol 
ID6409408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1879217 
End bp1880266 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content63% 
IMG OID642711639 
ProductNitrilase 
Protein accessionYP_001990754 
Protein GI192290149 
COG category[R] General function prediction only 
COG ID[COG0388] Predicted amidohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.134006 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTCCTC ACTTCAAAGC CGCTGCGATA CACGCCGCGC CCGTATTCCT CGATAAGACT 
GCGACTACAA AAAAGGCGAT CTCGCTCATC CGTGAGGCAG TCGCTGCGGG TGCGGAGCTG
GTTGCATTTC CGGAGACTTA CATCCCGGCG TTTCCGGTTT GGGCGGCGTT GTGGGCGCCG
ATCGACAACC ACGATCTGTT CGTGCGAATG GCTGATCAGT CGGTGCTGAT CGATGGTCCC
GAGGTGAAAG CGATCCGGGA CGAGGCTCGG CGGCTCGGCG TCGTGGTGTC GATCGGTATC
AGCGAGAAAT CGCCGGCCAG CGTGGGTGGG ATCTGGAACT CCAATCTATT GATCGGCGAG
GACGGCGAGA TCCTCAACCA TCACCGTAAG CTGGTTCCGA CCTTCTACGA GAAGCTGATC
TGGAGCGCCG GTGACGGCGC GGGTCTCCGC GTCGTCGACA CGCGGCTCGG CAAGATCGGT
CAATTGATCT GCGGCGAAAA CACCAATCCG CTGGCGCGCT ATGCATTGAT GGCGCAGGGC
GAGCAGTTCC ATATCTCGAG CTGGCCGCCG GTCTGGCCGA CCCGGCGTCC GGCCGAAGGC
GGAAACTATC ACATTGCGGC GGCGACCCGG ATTCGCGCCA GCGCGCATTG CTTCGAAGCG
AAGGTCTTTG GTCTTGTCAC GTCCGGCGTG CTCGACAAGG CCGCGCGCGA CATGCTGGTG
GCGCGCGATC CGTCGGCCGC AGCCGTGCTC GACGGCACGC CGCGCGCGGC GACATTCTTC
TTGGACCCGA CAGGCGAGCA GATCGGCGAA GCGCTCTGCG AGGACGAGGG CATTCTGTAT
GCCGATATCG ATCTCACCCG ATGCGTCGAG CCCAAGCAAT TTCACGACGT GGTCGGCTAC
TACAACCGGT TCGATGTTTT CGCCGTCAGC ATCAGCCGTC ACCGGCTGAC GCCGGCGACG
TTCATCGACG ATCTGCCACT CCCCGCAGTC GTGGACAATG TCGAAGACAA GGTCGGGCGC
GCGCCGAACG CGGCGCCCGT AGCCCTTTAA
 
Protein sequence
MLPHFKAAAI HAAPVFLDKT ATTKKAISLI REAVAAGAEL VAFPETYIPA FPVWAALWAP 
IDNHDLFVRM ADQSVLIDGP EVKAIRDEAR RLGVVVSIGI SEKSPASVGG IWNSNLLIGE
DGEILNHHRK LVPTFYEKLI WSAGDGAGLR VVDTRLGKIG QLICGENTNP LARYALMAQG
EQFHISSWPP VWPTRRPAEG GNYHIAAATR IRASAHCFEA KVFGLVTSGV LDKAARDMLV
ARDPSAAAVL DGTPRAATFF LDPTGEQIGE ALCEDEGILY ADIDLTRCVE PKQFHDVVGY
YNRFDVFAVS ISRHRLTPAT FIDDLPLPAV VDNVEDKVGR APNAAPVAL