Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_34891 |
Symbol | ALN1 |
ID | 4836964 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 589621 |
End bp | 591525 |
Gene Length | 1905 bp |
Protein Length | 579 aa |
Translation table | 12 |
GC content | 46% |
IMG OID | 640388279 |
Product | Allantoinase |
Protein accession | XP_001382874 |
Protein GI | 150864159 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR03178] allantoinase |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.93559 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCGTG CTATTTCCTC CACTCAGGTA CTTATTGGCA CTGAAGTTGT TCCAGGCACC GTAATTTTTC TGATTGAATC AGGCAAAATT CTCTACATTG AGTCAGGAAA GCAGCTTGAA GTCGACGATC CGCTTCTTGA GCTCTATAAC GTATTACCGG TAGACCACAG AGATGTTTCA CCATTGGTAG TTATGCCTGG TTTGGTGGAT GCTCATGTCC ATCTTAACGA GCCGGGAAGA ACAGAATGGG AAGGTTTTGC AACGGGAACA AAGGCTGCTG CTGCTGGCGG TGTCACCACA GTGATAGATA TGCCTCTCAA TGCCATTCCA CCCACTACGA CAATTGCTAA CTTTAATCTC AAGATCGATG CTGCGAAAGA TCAGACTTGG GTTGATGTAG GATTCTGGGG AGGATTGGTT CCAGACAATT TGCACCATTT ACGTCCCTTG ATTTCAATGG GTGTTAGAGG GTTCAAGGGG TTTATGATCG AAAGTGGAGT AGACGAGTTT CCAGCTATAG ATCCCAGCTA TATAGTAAAG GCAATGGAGC GAGTGGAAGG TCAGAAGACG GTGTTGATGT TTCATGCTGA AATGCAACCT GGTCAAGCTT CGGGACCTGT TAGTGAAGAT TCCCTTAGAA TCGGACACCA GGGAGCTTCT TTATCCAGTG TTATATCGCC TGTGGATGTT CCTTTGACTC CTAAAGCCTC TAAAGGTTTC TTTTTGCAAG AAGATGAGGA TGAAACTGTA AAAAGATCTA CTCAAGTGGA GCCTCTTTCG GGTGATATAG AGTCTATCAG CTTGGGAATG TCTGCTTCTT TTATCCAACG AGCCCCCAAA CCGGTGGTAG AGTTGTTGTC GATGGACGGA AGCGGAAACT GTCAGGAAGA GCATGAAAAC TGCCACTTGC CTCACAACCA CGCCGGTTCC ATTGATCACA AAGTCTTATC AGATGCTCAG GCTACAGCCT TGGCCAAAAG TCCGATCTTA GCAGCTGTCG AACCCACCTT TGGAAAGTTT GCCAGAAAGG CTAACCACTT TGATTCGCCT TTTTTCAGGG CTGTAGAAGA AAAGCCCTTG GATTCTCCTC TCTTGATAGC CCAGAGTGAA GACGCCCTTT TGGAAGATAT CGACCCAACA GCCTACGCAT CATTTTTGGC TTCAAGGCCC GACAACTTTG AGACTACGGC TATTGCAGAA ATCATCAACT GCTCTACAAA GTTTCCTACT GTTCCTTTGC ACATTGTACA TTTGGCCACT CATGAAGCTG TTCCGTTGAT CAGAGCAGCC AAAGCCAAGG GTTTGCCTAT CACCGCGGAA ACATGTTTCC ATTACTTGTC GTTGTACGCC GAGTCCATTG CTAACTGCTC CACTCATTTC AAGTGCTGTC CACCCATAAG AACTAACGAT AATAGAAAGC TTTTGTGGCG AGCACTTAGA AATGATATCA TCACCACTGT GGTCTCAGAC CACTCGCCTT GTACTCCAGA CTTGAAGGGT TTAGAAAAGG GAGACTTCTT CGAGGCCTGG GGAGGTATCT CTTCTGTGGG TTTTGGATTG CCGATATTGT ACACTGAGGG AAAGAAGTTG TCCCCTCCAA TTACCTTTGC TGAGATCAAC AAATGGTGTT CGCTCAATAC TGCCAAGCAA GTTGGTTTGA GTCACAGAAA AGGTAAGCTT GCTGTAGGCT ATGATGCTGA CTTGTTGGTT TTTGATCCTA ACGACAAGTA CATTGTCCAG AATCAAGACA CCTACTTCAA AAACAAGTTG ACCGCCTACG CTGGAAAGGA ATTCCTGGGC AGAGTCATCG AAACCATTGT TGGAGGTAAT TCTGTGTATG CTTTTGGAAA AGGGCATTCT GATGTTCCAA TGGGTAAGTT AATCTTGGAG CCAAGATTTG CATAA
|
Protein sequence | MSRAISSTQV LIGTEVVPGT VIFSIESGKI LYIESGKQLE VDDPLLELYN VLPVDHRDVS PLVVMPGLVD AHVHLNEPGR TEWEGFATGT KAAAAGGVTT VIDMPLNAIP PTTTIANFNL KIDAAKDQTW VDVGFWGGLV PDNLHHLRPL ISMGVRGFKG FMIESGVDEF PAIDPSYIVK AMERVEGQKT VLMFHAEMQP VEPLSGDIES ISLGMSASFI QRAPKPVVEL LSMDGSGNCQ EEHENCHLPH NHAGSIDHKV LSDAQATALA KSPILAAVEP TFGKFARKAN HFDSPFFRAV EEKPLDSPLL IAQSEDALLE DIDPTAYASF LASRPDNFET TAIAEIINCS TKFPTVPLHI VHLATHEAVP LIRAAKAKGL PITAETCFHY LSLYAESIAN CSTHFKCCPP IRTNDNRKLL WRALRNDIIT TVVSDHSPCT PDLKGLEKGD FFEAWGGISS VGFGLPILYT EGKKLSPPIT FAEINKWCSL NTAKQVGLSH RKGKLAVGYD ADLLVFDPND KYIVQNQDTY FKNKLTAYAG KEFSGRVIET IVGGNSVYAF GKGHSDVPMG KLILEPRFA
|
| |