Gene PICST_34891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_34891 
SymbolALN1 
ID4836964 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp589621 
End bp591525 
Gene Length1905 bp 
Protein Length579 aa 
Translation table12 
GC content46% 
IMG OID640388279 
ProductAllantoinase 
Protein accessionXP_001382874 
Protein GI150864159 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR03178] allantoinase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.93559 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGTG CTATTTCCTC CACTCAGGTA CTTATTGGCA CTGAAGTTGT TCCAGGCACC 
GTAATTTTTC TGATTGAATC AGGCAAAATT CTCTACATTG AGTCAGGAAA GCAGCTTGAA
GTCGACGATC CGCTTCTTGA GCTCTATAAC GTATTACCGG TAGACCACAG AGATGTTTCA
CCATTGGTAG TTATGCCTGG TTTGGTGGAT GCTCATGTCC ATCTTAACGA GCCGGGAAGA
ACAGAATGGG AAGGTTTTGC AACGGGAACA AAGGCTGCTG CTGCTGGCGG TGTCACCACA
GTGATAGATA TGCCTCTCAA TGCCATTCCA CCCACTACGA CAATTGCTAA CTTTAATCTC
AAGATCGATG CTGCGAAAGA TCAGACTTGG GTTGATGTAG GATTCTGGGG AGGATTGGTT
CCAGACAATT TGCACCATTT ACGTCCCTTG ATTTCAATGG GTGTTAGAGG GTTCAAGGGG
TTTATGATCG AAAGTGGAGT AGACGAGTTT CCAGCTATAG ATCCCAGCTA TATAGTAAAG
GCAATGGAGC GAGTGGAAGG TCAGAAGACG GTGTTGATGT TTCATGCTGA AATGCAACCT
GGTCAAGCTT CGGGACCTGT TAGTGAAGAT TCCCTTAGAA TCGGACACCA GGGAGCTTCT
TTATCCAGTG TTATATCGCC TGTGGATGTT CCTTTGACTC CTAAAGCCTC TAAAGGTTTC
TTTTTGCAAG AAGATGAGGA TGAAACTGTA AAAAGATCTA CTCAAGTGGA GCCTCTTTCG
GGTGATATAG AGTCTATCAG CTTGGGAATG TCTGCTTCTT TTATCCAACG AGCCCCCAAA
CCGGTGGTAG AGTTGTTGTC GATGGACGGA AGCGGAAACT GTCAGGAAGA GCATGAAAAC
TGCCACTTGC CTCACAACCA CGCCGGTTCC ATTGATCACA AAGTCTTATC AGATGCTCAG
GCTACAGCCT TGGCCAAAAG TCCGATCTTA GCAGCTGTCG AACCCACCTT TGGAAAGTTT
GCCAGAAAGG CTAACCACTT TGATTCGCCT TTTTTCAGGG CTGTAGAAGA AAAGCCCTTG
GATTCTCCTC TCTTGATAGC CCAGAGTGAA GACGCCCTTT TGGAAGATAT CGACCCAACA
GCCTACGCAT CATTTTTGGC TTCAAGGCCC GACAACTTTG AGACTACGGC TATTGCAGAA
ATCATCAACT GCTCTACAAA GTTTCCTACT GTTCCTTTGC ACATTGTACA TTTGGCCACT
CATGAAGCTG TTCCGTTGAT CAGAGCAGCC AAAGCCAAGG GTTTGCCTAT CACCGCGGAA
ACATGTTTCC ATTACTTGTC GTTGTACGCC GAGTCCATTG CTAACTGCTC CACTCATTTC
AAGTGCTGTC CACCCATAAG AACTAACGAT AATAGAAAGC TTTTGTGGCG AGCACTTAGA
AATGATATCA TCACCACTGT GGTCTCAGAC CACTCGCCTT GTACTCCAGA CTTGAAGGGT
TTAGAAAAGG GAGACTTCTT CGAGGCCTGG GGAGGTATCT CTTCTGTGGG TTTTGGATTG
CCGATATTGT ACACTGAGGG AAAGAAGTTG TCCCCTCCAA TTACCTTTGC TGAGATCAAC
AAATGGTGTT CGCTCAATAC TGCCAAGCAA GTTGGTTTGA GTCACAGAAA AGGTAAGCTT
GCTGTAGGCT ATGATGCTGA CTTGTTGGTT TTTGATCCTA ACGACAAGTA CATTGTCCAG
AATCAAGACA CCTACTTCAA AAACAAGTTG ACCGCCTACG CTGGAAAGGA ATTCCTGGGC
AGAGTCATCG AAACCATTGT TGGAGGTAAT TCTGTGTATG CTTTTGGAAA AGGGCATTCT
GATGTTCCAA TGGGTAAGTT AATCTTGGAG CCAAGATTTG CATAA
 
Protein sequence
MSRAISSTQV LIGTEVVPGT VIFSIESGKI LYIESGKQLE VDDPLLELYN VLPVDHRDVS 
PLVVMPGLVD AHVHLNEPGR TEWEGFATGT KAAAAGGVTT VIDMPLNAIP PTTTIANFNL
KIDAAKDQTW VDVGFWGGLV PDNLHHLRPL ISMGVRGFKG FMIESGVDEF PAIDPSYIVK
AMERVEGQKT VLMFHAEMQP VEPLSGDIES ISLGMSASFI QRAPKPVVEL LSMDGSGNCQ
EEHENCHLPH NHAGSIDHKV LSDAQATALA KSPILAAVEP TFGKFARKAN HFDSPFFRAV
EEKPLDSPLL IAQSEDALLE DIDPTAYASF LASRPDNFET TAIAEIINCS TKFPTVPLHI
VHLATHEAVP LIRAAKAKGL PITAETCFHY LSLYAESIAN CSTHFKCCPP IRTNDNRKLL
WRALRNDIIT TVVSDHSPCT PDLKGLEKGD FFEAWGGISS VGFGLPILYT EGKKLSPPIT
FAEINKWCSL NTAKQVGLSH RKGKLAVGYD ADLLVFDPND KYIVQNQDTY FKNKLTAYAG
KEFSGRVIET IVGGNSVYAF GKGHSDVPMG KLILEPRFA