Gene Rpal_2669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2669 
Symbol 
ID6410332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2898652 
End bp2900391 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content66% 
IMG OID642712545 
ProductNitrilase/cyanide hydratase and apolipoprotein N-acyltransferase 
Protein accessionYP_001991654 
Protein GI192291049 
COG category[R] General function prediction only 
COG ID[COG0388] Predicted amidohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.538211 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCAAC TGCTGAAGGT CGCTACCGTC CAGTTCGAGC CGATCATGGC CGAGAAGGAG 
CGCAACGTCG CGCGTCTGCT CGAGCTGTGC GAGGAGGCGG CGGTGGGAGG CGCCAAACTG
ATCGTCACCC CGGAGATGGG CACCACCGGC TATTGCTGGT ACGACCGCGC CGAAGTGGCG
CCGTTCGTCG AGCCGATCCC GGGGCCGACC ACAGCACGGT TTGCCGCGCT GGCGCGCAAG
CACGATTGCT ACATCGTCGT CGGCCTTCCG GAGGTCGATG AGGACGGCAT CTATTACAAT
TCGGCCGTCC TGATCGGCCC GGAGGGATTG ATCGGGCGTC ACCGCAAGAC GCATCCGTAT
ATCTCTGAGC CGAAATGGTC GGCGGCGGGC GATCTGCACA ATCAGGTGTT TGACACGCCG
ATCGGGCGGA TCGCGCTGCT GATCTGCATG GACATCCATT TCGTCGAGAC TGCCCGGTTG
ATGGCCCTCG GTGGCGCCGA CATCATCTGT CACATCTCGA ACTGGCTGGC GGAGCGTACC
CCGGCGCCGT ACTGGATCAG CCGGGCGTTC GAGAACTCCT GCTACGTCAT CGAGAGCAAC
CGCTGGGGGC TTGAGCGGAC CGTGCAGTTC TCCGGCGGAA GCTGCGTGAT CGCGCCGGAC
GGCGGCATCG CTGCGGTGAT CGATGGCGGC GACGGTGTGG CCTTCGCCGA AATCGATCTG
GACACTGCGC GCGCGCGCCA GATCGGCGGC GAGGCGGTGT TTCGGCAGCG GCGGCCGGAG
CTGTATCCGG AGTTGCTGAC CGGCACCTTC AGCTGGAATC CGTACGACTT CTTCGGCCTG
TACGGACACG AGGCCTGGCC GAAGGGCAAG CGCTCCAAGC TCAGCGGCGC GCAGTTCGCG
CCGGTCGCCG ATCTCAGTGC CAATCTCGAC CGGATCGAGG CGTTGGCACG CCAGGCGAAG
GCGGATGGCG CCGAGATGGT GGTGTTTCCG GAACGGAGCC TTACCGGACT GGATGATCCG
GCGCGTACTG CCGTCGCTGT GCCTGGCCCC GCGACCGACC GGCTCGCCGC GCTGGCAAGC
GAGCTGTCGC TGTATCTCGT CTGCGGTCTC GCCGAACGCG ACGGCGATAT CCTGTACAAC
AGTGCCGTGC TGATCGCGCC GGGCGGCACC ATCACCACCT ATCGCAAGAC GCATCTGACC
GAGGACGAGC GGGGCTGGGC GCAGCCTGGC GACAGCTTCG TCGTGAGCGA TACGCCGCTT
GGCCGCGTCG GCCTGCTGAT CGGCCACGAT GCGATGTTTC CTGAAGCCGG GCGCGTGCTG
GCGCTCCGCG GCTGTGACAT CATCGCGTGC CCGGCGGCGA TCGAGACCCG GTTCAGCACG
CCGCACGCCG GCACCAGCGT CAAACAGTCG GCACCGATCC CGACTGGCGC CGATCCGCAC
CATTGGCATC ACTTCCGCGT CCGCGCCGGC GAGAACAATG TGTTCTTCGC TTTCGCCAAT
GTGGTGGATA GAGCGCGCGG CTATCCCGGG CTGAGCGGCG TGTTTGGGCC GGATACGTTC
GAATTCCCGC GCCGCGAGGC ACTGATCGGG AGCGAGGAGG GCATTGCCAC CGCGATGATC
GACACCTCCA ATCTCGACAG CGTGTATCCG ACCAATGTGG TGCGGCGGAA GGATCTGGTG
GCGATGCGGA TGCCGCACAG CTATCGGCCG CTGGTGCAGG CGATGGCCGG CAACTACTAA
 
Protein sequence
MSQLLKVATV QFEPIMAEKE RNVARLLELC EEAAVGGAKL IVTPEMGTTG YCWYDRAEVA 
PFVEPIPGPT TARFAALARK HDCYIVVGLP EVDEDGIYYN SAVLIGPEGL IGRHRKTHPY
ISEPKWSAAG DLHNQVFDTP IGRIALLICM DIHFVETARL MALGGADIIC HISNWLAERT
PAPYWISRAF ENSCYVIESN RWGLERTVQF SGGSCVIAPD GGIAAVIDGG DGVAFAEIDL
DTARARQIGG EAVFRQRRPE LYPELLTGTF SWNPYDFFGL YGHEAWPKGK RSKLSGAQFA
PVADLSANLD RIEALARQAK ADGAEMVVFP ERSLTGLDDP ARTAVAVPGP ATDRLAALAS
ELSLYLVCGL AERDGDILYN SAVLIAPGGT ITTYRKTHLT EDERGWAQPG DSFVVSDTPL
GRVGLLIGHD AMFPEAGRVL ALRGCDIIAC PAAIETRFST PHAGTSVKQS APIPTGADPH
HWHHFRVRAG ENNVFFAFAN VVDRARGYPG LSGVFGPDTF EFPRREALIG SEEGIATAMI
DTSNLDSVYP TNVVRRKDLV AMRMPHSYRP LVQAMAGNY