Gene PICST_30010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30010 
SymbolNRL1 
ID4837424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1728368 
End bp1729351 
Gene Length984 bp 
Protein Length327 aa 
Translation table12 
GC content42% 
IMG OID640388739 
ProductNitrilase, arylacetone-specific (Arylacetonitrilase) 
Protein accessionXP_001383097 
Protein GI126133144 
COG category[R] General function prediction only 
COG ID[COG0388] Predicted amidohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.224947 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCCT CAGTTTATCC AAAATTGAAA GTTGCTGCTG TTCAAGCTGC TCCAGTTTAT 
CTTAATTTAG AAGCTACAAT TGCAAAGTCT GTTAAGCTTA TTGAAGAAGC TGCTGCTAAT
GGTGCAAAGT TGGTCGCTTT TCCAGAAGCT TTTGTTCCTG GTTACCCTTG GTTTGCCTTC
ATTGGTCACC CAGAGTACAC TAGGAAGTGG TACCACAAGT TGTACAAGAA TGCCTTGGAA
ATTCCTAGTC CTGCCATTCA AAAGATTTCC AACGCAGCCA GAGACAATGA TATTTTTGTG
TGTATTTCTG GTTCAGAAAA GGACAATGGT TCCTTGTTCT TGTGCCAATT GTGGTTTGAC
AATAAAGGAA ATTTGATTGG AAAGCACAGA AAGATGAGAG CTTCTGTTGC TGAAAGATTG
GTCTGGGGTG ATGGTTGTGG TTCTTTACTT CCAGTCATGA AGACTGAAAT TGGAAACTTG
GGAGGCTTGA TGTGCTGGGA ACATCAAGTT CCTTTGGATC TTGCCGCTAT GAATAACCAG
AACGAGCAGA TTCATGTTGC TGCTTGGCCA GGATATTTTG ACGATGAAAT CTCTTCTCGT
TACTATGCTA TTTCTACTCA AAGCTTTGTT GTCATGACAT CGTCCATTTA TAGTGAAGAA
ATGAAGCAGT TAATCTGTGA AGACGCAGAA CAAAGAAAGT ATTTTGATTC TTTCAAGAGT
GGTCACACTT GTATCTACGG TCCTGATGGG GAGCCAGTTT CAGAAATGAT TCCTGCTGAA
ACGGAAGGTA TTGCCTACGC TGACATCGAT ATTGCCAGAA CTATCGACTT CAAATACTAC
ATTGACCCTG CTGGCCACTA CAGTAACAAA TCCTTGACTG CTACGCACAA TGTTTCCGAT
ACCAGACCAA TCAAGCAAAT CGGCTCCTCT CCTTCCCAAT TCATTGGCCA CGATGACTTG
AACAGAGTCG ACGTTGCAGC TTAA
 
Protein sequence
MSASVYPKLK VAAVQAAPVY LNLEATIAKS VKLIEEAAAN GAKLVAFPEA FVPGYPWFAF 
IGHPEYTRKW YHKLYKNALE IPSPAIQKIS NAARDNDIFV CISGSEKDNG SLFLCQLWFD
NKGNLIGKHR KMRASVAERL VWGDGCGSLL PVMKTEIGNL GGLMCWEHQV PLDLAAMNNQ
NEQIHVAAWP GYFDDEISSR YYAISTQSFV VMTSSIYSEE MKQLICEDAE QRKYFDSFKS
GHTCIYGPDG EPVSEMIPAE TEGIAYADID IARTIDFKYY IDPAGHYSNK SLTATHNVSD
TRPIKQIGSS PSQFIGHDDL NRVDVAA