Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0020 |
Symbol | |
ID | 8011268 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 18450 |
End bp | 19442 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644822611 |
Product | Nitrilase |
Protein accession | YP_002973871 |
Protein GI | 241202775 |
COG category | [R] General function prediction only |
COG ID | [COG0388] Predicted amidohydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0301643 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAATCG TAAAGGCCGC CGCGGTCCAG ATCAGTCCCT CGCTTTACAG CCGTGAAGAA ACGGTCGACA AAGTCGTCAC CAAGATCGCC GACCTTGGTG ACAAGGGGGT CCAGTTCGCG ACCTTTCCCG AAACGGTCGT CCCATATTAC CCGTATTTCT CCTTCGTCCA GTCCGCCTAT GACTTGCGGA CAGGAAAAGA GCATCTGCGA TTGCTGGATC AATCGGTCAC CATCCCATCT GACACCACAC GCACCATCGC TGAAGCCTGT AAGCGAGCGA GGGTGGTCGT TTCCATAGGG GTCAATGAAC GCGACGGGGG CACGATCTAC AATACCCAGT TGCTGTTTGA TGCCGATGGT ACTTTGTTGC AGCGACGCCG CAAGATTTCA CCGACCTTCC ATGAAAGGAT GATCTGGGGA TATGGAGACG GGTCTGGCCT TCGGGCTGTC GACAGCGCGG TGGGACGTAT CGGCCAACTC GCATGCTGGG AGCATTACAA TCCGCTTGCG CGCTTCGCGC TCATGGCTGA TGGCGAGCAA ATTCACTCGG CAATGTATCC CGGCTCGTTT GGGGGGGATC TGTTTTCCGA ACAGATGGCT GTCAACATCC GGCAGCACGC GCTGGAATCC GGTTGTTTCG TGGTCAATGC AACAGCCTGG CTCGACCCGC AGCAACAGGC CCAGGTCATG GAAGACACCG GCTGTAGTAT CGGTCCGATT TCCAGCGGCT GCTTTACCGC GATTGTCGCA CCGGACGGCA GCTTGATCGA GGAACCATTG CGCTCAGGCG AAGGCGTCGT GATTGCTGAT CTCGACTTCA CCCTGATCGA CAAACGCAAA CAGCTGATGG ATTCACGCGG ACACTATAGC CGGCCCGAAC TGCTCAGTCT GTTGATCGAT CGTACGCCGA CAATTCACGT GCATGAGCGC ATCACGCCGT CCGTGCCGAC GAATACTGCC GAGGTCACTG AAGGAGGTCC TGCGTTGGTC TGA
|
Protein sequence | MTIVKAAAVQ ISPSLYSREE TVDKVVTKIA DLGDKGVQFA TFPETVVPYY PYFSFVQSAY DLRTGKEHLR LLDQSVTIPS DTTRTIAEAC KRARVVVSIG VNERDGGTIY NTQLLFDADG TLLQRRRKIS PTFHERMIWG YGDGSGLRAV DSAVGRIGQL ACWEHYNPLA RFALMADGEQ IHSAMYPGSF GGDLFSEQMA VNIRQHALES GCFVVNATAW LDPQQQAQVM EDTGCSIGPI SSGCFTAIVA PDGSLIEEPL RSGEGVVIAD LDFTLIDKRK QLMDSRGHYS RPELLSLLID RTPTIHVHER ITPSVPTNTA EVTEGGPALV
|
| |