Gene Rleg_0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0020 
Symbol 
ID8011268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp18450 
End bp19442 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content57% 
IMG OID644822611 
ProductNitrilase 
Protein accessionYP_002973871 
Protein GI241202775 
COG category[R] General function prediction only 
COG ID[COG0388] Predicted amidohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0301643 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATCG TAAAGGCCGC CGCGGTCCAG ATCAGTCCCT CGCTTTACAG CCGTGAAGAA 
ACGGTCGACA AAGTCGTCAC CAAGATCGCC GACCTTGGTG ACAAGGGGGT CCAGTTCGCG
ACCTTTCCCG AAACGGTCGT CCCATATTAC CCGTATTTCT CCTTCGTCCA GTCCGCCTAT
GACTTGCGGA CAGGAAAAGA GCATCTGCGA TTGCTGGATC AATCGGTCAC CATCCCATCT
GACACCACAC GCACCATCGC TGAAGCCTGT AAGCGAGCGA GGGTGGTCGT TTCCATAGGG
GTCAATGAAC GCGACGGGGG CACGATCTAC AATACCCAGT TGCTGTTTGA TGCCGATGGT
ACTTTGTTGC AGCGACGCCG CAAGATTTCA CCGACCTTCC ATGAAAGGAT GATCTGGGGA
TATGGAGACG GGTCTGGCCT TCGGGCTGTC GACAGCGCGG TGGGACGTAT CGGCCAACTC
GCATGCTGGG AGCATTACAA TCCGCTTGCG CGCTTCGCGC TCATGGCTGA TGGCGAGCAA
ATTCACTCGG CAATGTATCC CGGCTCGTTT GGGGGGGATC TGTTTTCCGA ACAGATGGCT
GTCAACATCC GGCAGCACGC GCTGGAATCC GGTTGTTTCG TGGTCAATGC AACAGCCTGG
CTCGACCCGC AGCAACAGGC CCAGGTCATG GAAGACACCG GCTGTAGTAT CGGTCCGATT
TCCAGCGGCT GCTTTACCGC GATTGTCGCA CCGGACGGCA GCTTGATCGA GGAACCATTG
CGCTCAGGCG AAGGCGTCGT GATTGCTGAT CTCGACTTCA CCCTGATCGA CAAACGCAAA
CAGCTGATGG ATTCACGCGG ACACTATAGC CGGCCCGAAC TGCTCAGTCT GTTGATCGAT
CGTACGCCGA CAATTCACGT GCATGAGCGC ATCACGCCGT CCGTGCCGAC GAATACTGCC
GAGGTCACTG AAGGAGGTCC TGCGTTGGTC TGA
 
Protein sequence
MTIVKAAAVQ ISPSLYSREE TVDKVVTKIA DLGDKGVQFA TFPETVVPYY PYFSFVQSAY 
DLRTGKEHLR LLDQSVTIPS DTTRTIAEAC KRARVVVSIG VNERDGGTIY NTQLLFDADG
TLLQRRRKIS PTFHERMIWG YGDGSGLRAV DSAVGRIGQL ACWEHYNPLA RFALMADGEQ
IHSAMYPGSF GGDLFSEQMA VNIRQHALES GCFVVNATAW LDPQQQAQVM EDTGCSIGPI
SSGCFTAIVA PDGSLIEEPL RSGEGVVIAD LDFTLIDKRK QLMDSRGHYS RPELLSLLID
RTPTIHVHER ITPSVPTNTA EVTEGGPALV