Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_30010 |
Symbol | NRL1 |
ID | 4837424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 1728368 |
End bp | 1729351 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640388739 |
Product | Nitrilase, arylacetone-specific (Arylacetonitrilase) |
Protein accession | XP_001383097 |
Protein GI | 126133144 |
COG category | [R] General function prediction only |
COG ID | [COG0388] Predicted amidohydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.224947 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGCCT CAGTTTATCC AAAATTGAAA GTTGCTGCTG TTCAAGCTGC TCCAGTTTAT CTTAATTTAG AAGCTACAAT TGCAAAGTCT GTTAAGCTTA TTGAAGAAGC TGCTGCTAAT GGTGCAAAGT TGGTCGCTTT TCCAGAAGCT TTTGTTCCTG GTTACCCTTG GTTTGCCTTC ATTGGTCACC CAGAGTACAC TAGGAAGTGG TACCACAAGT TGTACAAGAA TGCCTTGGAA ATTCCTAGTC CTGCCATTCA AAAGATTTCC AACGCAGCCA GAGACAATGA TATTTTTGTG TGTATTTCTG GTTCAGAAAA GGACAATGGT TCCTTGTTCT TGTGCCAATT GTGGTTTGAC AATAAAGGAA ATTTGATTGG AAAGCACAGA AAGATGAGAG CTTCTGTTGC TGAAAGATTG GTCTGGGGTG ATGGTTGTGG TTCTTTACTT CCAGTCATGA AGACTGAAAT TGGAAACTTG GGAGGCTTGA TGTGCTGGGA ACATCAAGTT CCTTTGGATC TTGCCGCTAT GAATAACCAG AACGAGCAGA TTCATGTTGC TGCTTGGCCA GGATATTTTG ACGATGAAAT CTCTTCTCGT TACTATGCTA TTTCTACTCA AAGCTTTGTT GTCATGACAT CGTCCATTTA TAGTGAAGAA ATGAAGCAGT TAATCTGTGA AGACGCAGAA CAAAGAAAGT ATTTTGATTC TTTCAAGAGT GGTCACACTT GTATCTACGG TCCTGATGGG GAGCCAGTTT CAGAAATGAT TCCTGCTGAA ACGGAAGGTA TTGCCTACGC TGACATCGAT ATTGCCAGAA CTATCGACTT CAAATACTAC ATTGACCCTG CTGGCCACTA CAGTAACAAA TCCTTGACTG CTACGCACAA TGTTTCCGAT ACCAGACCAA TCAAGCAAAT CGGCTCCTCT CCTTCCCAAT TCATTGGCCA CGATGACTTG AACAGAGTCG ACGTTGCAGC TTAA
|
Protein sequence | MSASVYPKLK VAAVQAAPVY LNLEATIAKS VKLIEEAAAN GAKLVAFPEA FVPGYPWFAF IGHPEYTRKW YHKLYKNALE IPSPAIQKIS NAARDNDIFV CISGSEKDNG SLFLCQLWFD NKGNLIGKHR KMRASVAERL VWGDGCGSLL PVMKTEIGNL GGLMCWEHQV PLDLAAMNNQ NEQIHVAAWP GYFDDEISSR YYAISTQSFV VMTSSIYSEE MKQLICEDAE QRKYFDSFKS GHTCIYGPDG EPVSEMIPAE TEGIAYADID IARTIDFKYY IDPAGHYSNK SLTATHNVSD TRPIKQIGSS PSQFIGHDDL NRVDVAA
|
| |