Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_36697 |
Symbol | NIT1 |
ID | 4840390 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 788164 |
End bp | 789087 |
Gene Length | 924 bp |
Protein Length | 307 aa |
Translation table | 12 |
GC content | 46% |
IMG OID | 640391705 |
Product | nitrilase |
Protein accession | XP_001385512 |
Protein GI | 150866043 |
COG category | [R] General function prediction only |
COG ID | [COG0388] Predicted amidohydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.801519 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTAAAC ACGTCGTAGC TGCATTGCAA ATCGGAGCTG ATCCCAAGGG AACCCAAGCG ACCCTTGGAA AGATCTTGAG CTACGAAGCT GAGTTGAAGG AAAAAAAAGT TGAATTGGTT GTGATCCCTG AAGCCACTTT GGGAGGATAT CCTAAGGGAT CGCACTTTGG GACATATTTG GGCTATAGAT TGCAAGCTGG CAGAGAGAAG TTTGCTGAAT ATTTCAAGGG TGCAATTGAT GTTCCTGGTC CGGAAATTGA GCAATTGGAA ACATTAAGTG CTTCCACAGG AGCGTCCATT GTGATTGGAG TAATCGAAAG AGGTGGTTCT TCATTGTACT GTACCGCTGT ATACATTGAC TCTGTAAAGG GTTATGTTGG CAAACACAGA AAGATTATGC CTACTGCAAC TGAGAGATTG ATCTGGGGTC AAGGAGATGG TTCTACATTA ATTACGGCCG ATTTTGAAGG TTTGGGTAAA GTTGGTGGTG CTATCTGCTG GGAGAACTTT ATGCCTTTGT TGAGAGCTTC CTTCTACGCG AAAGGACTCA ACATCTACGT TGCACCTACA GTGGATGATA GAGATGGATG GACAGCTTTG ATCAGAACTA TCGGAAACGA GGGTCGTCTT TTTGTGGTCT CTGCAGTTGC ATTCTTGCCT ACAGCACAAG CTGCCCAATT GGACATGCCT GGCTGGCCAG AAGGAAAGAA TGCTATCGAT GGAGGTTCGC TCATCGTCAA TCCTTACGGA GATATCATTG CTGGACCATT GAGGGGTAAA GAAGGTTTGT TGACTGCTGA GATTGACTAC GATATCATTC CTCAGGCCAA ATACGACATG GATCCAGTGG GCCATTATCT GAGAGGAGAC ATTTTCCAGT TGACAGTAGA CCAGACCCCG AGAGATGCTG TTGTCTTCAA GTAG
|
Protein sequence | MSKHVVAALQ IGADPKGTQA TLGKILSYEA ELKEKKVELV VIPEATLGGY PKGSHFGTYL GYRLQAGREK FAEYFKGAID VPGPEIEQLE TLSASTGASI VIGVIERGGS SLYCTAVYID SVKGYVGKHR KIMPTATERL IWGQGDGSTL ITADFEGLGK VGGAICWENF MPLLRASFYA KGLNIYVAPT VDDRDGWTAL IRTIGNEGRL FVVSAVAFLP TAQAAQLDMP GWPEGKNAID GGSLIVNPYG DIIAGPLRGK EGLLTAEIDY DIIPQAKYDM DPVGHYSRGD IFQLTVDQTP RDAVVFK
|
| |