Gene PICST_28241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_28241 
SymbolSIZ1 
ID4851017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp720870 
End bp725801 
Gene Length4932 bp 
Protein Length1643 aa 
Translation table 
GC content44% 
IMG OID640392725 
Producthypothetical protein 
Protein accessionXP_001387777 
Protein GI126273979 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.531382 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCG ACATAGCGCG GTTGACTCCA ACGGAGTTCA GCGATACCAT CGCACGGCTC 
AACCAGATGA AGGTAGCAGA AATCAAGGAT ATCTTGCGAC TGCTAGACTT CAAGCTTACA
GGGCGTCGAC ACGAGGTGAT AGCGCGGATT GAAGACTACT TCAAGCATGG GATCGAGTTG
CAAGATCCAG TAAGGCTTTT GGCCGTAAGA GTCCTCATTC TCATGAGAGG GGAGGGCCGT
CAGCTTCCTA TCTATAAGGA CTTGTATCGT AAGATATCGA AGGGAGAGTT TGTTCCCGGA
CAGATGAACA ACTTTGCTAC TATGATGAGC AGTAGTGGAC GTAATGTGAC ACGTCAGAAT
GGTAATGTTA ATGCTACAAA TCAAAATTCG AATTCGTCTC GTTCACATTC AAACCATCTG
GGCTCAGATT CTAATCATAA TCAGTATGGA GTCAACGAGT CATCACCGTA TCGCCGTCAC
GCGCTTTACT TCGCCGATAA CCCCTTCTAT AAGCTTCGAA GACTCATTCA CGGTTCACCG
CAGCCCTGTT TTCCCCAATC TGGGAGAGGA ACATGTGATA TTGCATTCCT GTTGCTGGAC
AGCGAATCAA GTCTTCTAGC TCTGGACAGC ACCATTAAGG CGTACGTGCT CTGTGGCGTC
CAGGCAAACT CTCTGCCAGC TCTGACGGAT GTTGCAGTGC ATTTTCCACA ACCTTGTGAA
CTCCACGTAA ATGGATCTCA AGTTCAGCAT TTCCGGGGGA TCAAGAATAA GATAGGAACT
TCGAAACCAG CCGATATAAC TAACCATCTT CGTGCGCGTC CCGCACTCAA TAAGATCCAG
TTCGTCTACA CCAGAACGAC CGAGACCTTC TTGCTTTATG TCTATATCGT CCAGACTGTG
TCTATTCAAG AGATCCTACA GGGCATTTTG AATAGACCTA TGATCCACAA GAACTCGACT
CGTGATAGAA TACGAGCCCA GAATGACGAC GACGATATTG TAGTTTCCGA CATTTCTGTA
ACGCTCAGAG ACCCCGTGTC ATACACAAGG ATGAAGTATC CCGTGCAATC TGTCTATTGC
GACCATACAC AGTGCTTTGA TGCGTTGATT TTCCTCCAGT CTCAGGCACA GATTCCTACA
TGGAGCTGCC CTTACTGTCA ACGCAATGTG AAGGTGGACG ATCTTGCCAT ATCTGAGTAC
TTCACAGACA TATTGAACAC AGTGCTGGCA GATGTGGAAC AAGTTCTCAT ACATTCCGAT
GGCTCGTGGT CACTAGAAGG CTCTGCAACT CCTGCTCCAA GGCTGAATAC TCCGGCAAGG
TCCCCTTCCA GCTCTCCAAG AGTTGTAACA AAAATTGAAG AGGATGCCAA TTCTAGTATT
CCTCCAGATG TAGAAATAGT CCTGTTAGAT AGTGAATCCG AAAGCGAACT GGAAATTCCT
CTTTCCAACA TTCCTCGTCG TCATCACCCC CAAGCCATAT CAAGGGATCT GTCAACTTCA
CCTATACAAA GAGATGACTC TCCGCAAATT TCAGAGACTA CACGTCCTTT GGCACTTGTA
AATCCTCCAG AAGCAACGGT ACAACCTCAA CATGTTTCTA TGGATGTTAA TGACATTTCT
GGTGTCGGTA ATTCAGTTCC TCCTGTTCGA ACAGTTCCAA CAGCTAGTGA TGTTTCTGTT
GCTAGTGGGA TTGCCATCGG TAGTATAGTT ACTCCGGCCT CTACATCGGC TTCACAAGTT
CCGACTGCAA TTGCTCCGTC ACCACCACCA CCTCCTCCTC CTCCTCCTCC TCCACCACCA
CCACCACCGA GCAGTCACCT ACAAGAAACA AGTGCATCTA ATTATATAGC TACTCAACTA
CCTCTACCTT CTGTGTTATC TCAATCACAA TCTACATCTT CTGAACTGCC TCTAGTACCT
CCTCCACTTC AACCTCCTAG AGAGCATCCA CAATCCAATA TTCATAAATC ACATTCGCAT
TCAAATACTC CATCAGCTTC AAATCATCTA GTTAAGCTTG ACAGCCATCC ATTTAAAGAC
AAAAACGTCT CAGCTACTGC CTTTTCTTCC ACAATACCTT CGACTACTGA TGATTCACAC
ATGGAAATCG ACAGTGACAC TCCAATATCT TCTATCCATG CTCCGACCCA ATCATCATCT
CACTTATCTT CGCAGATACC TTTGCACTCA ACCACAAACC CCACAACTCA GCCTTTGACT
GAGGAACCTC CACATTTGTC GCTGACAACA CCTGTGAAAA GACCAACTGG TGCTAGATCA
AAACCTACGC CTAACATTTT CATCACCTCA CATTCAAGAT TGAACAATAG ACCCATTTAT
CGTCCCAAAC CAGCGACTGG TAAAAGACCT CTACAGGAGG CAAATATACG CGTGGATAAC
AATTTGCGAA TTCTAACTGC TCGCAATCCA GCCATTCTGG CTCGGAGTGA GAATGCTTCA
AGCCCAAGTA TCTCTGCTTC TAAGAAGACA AGAACTGAAA ATTCATCTTC TGAAATGTCC
TCTGAAAGGA CCACCGTCGA AGTGTCAGCT TCTTCATCCT CTGAGAAGTC AAGCTCTGAA
AGTTCATTAG AAAAGTCACA TCCACCACCA CCACCACCAC CACCTCCTCC TCCTCCTCCT
CCAGTAAGTG ATAATTCTAC TAATAATTCA AATCAGAGTG AACATACCTC TTCAACTGAA
AATTCACAAA CTCAAGATCA ACCAGCAATT GAAAATCATA GTCAGAATAA CAATTCTGCT
GTGGAACATA GTCATGCTGT TGGAATGACC ATTCTGAATG AAAGATCTTC TGAGAACCAA
AAATCAGTTG AGGAGCTTCC TGTTAACACG ATACCATCCG GAAGACCTTC TACCACAATA
GATCTGCAAG TTCGTTCTGC ACATGTAACC TCTGAGTCTT TCAAGAGCCA GCATATCTCA
CTAGCTAAAA GTATAAATGT CGTTGCGACT GCAGCAAACG TTACTTCTTC TACTTCATCC
ACTGCTCAGA AAGCTAGTGA AACCAACGGA ACAGGCAAGG AATTAACTTT GGCTGGTGAT
ACTTTCATTG GTGATAGTCA AAAGGAAGTC ACGGCCAAGA AGAATGCAAA TACTAGCGCT
GTAATTTTGT CTCTCCCATC TCCGATCAAC TCCAGTACTC TGCATGCAGA TGGTGTGGAA
GACAAGAATG GTATTCATAG TGCAGTTATT GGTAGATCAG CTGAAAGTAC TCTTCGACCC
ACTGAAAATT TGCCGATGGC CTCCAGTCCT TCAAATAATG GAACTGGAAT CAGAGCGGAC
ATTGTACAAC ATTCAAAAAC TACTTCTCCT TTATCTGCAT TGGAAGTCAA TAATGATAAT
AGTACTATTA CACCGGATTT ATGCATTGAT AAGATTAGAC CAGGTGTAAC TTCTCATTCT
GAAGCTCCTC TGGCAGACAA ACTAAGACAT CTGGTTGCAA GAATGATCGA AATTGAGCAG
CAACTTCAAG GTGTACGGAA TGTGCGCAAT CAACAGTGCG CAGACAGCCG AGAACATATT
ATTTCATACA ATGACTACTT ACAGGAATCT GCTAATGCGA TATTGCAACT TAAAGATTTG
ATGACACAAC TAGAGCAGCA GTCTCGTATT CAAACACATC AGTTGGTGGC GTTACAAGAT
GAACAATTAC AAGAAAAGGG AAATCTTGAA ATGAAAATTG CTGCCTCCCT TCTCAATTCC
CCCGAGAAGG AAAGCAGAGT TAGTTATCCA GATATTTTCC GGAAAGCTTT GGAAGATCAG
GAGAAGAATC GAAGTCAAAG AAAAAGACAC GAGTTTGATG TGCAAAGACA GGCACTTGTT
GAAAGGCATA GATTGCAATT GAAAACACTT CAGGAAAGAC AACAACAAGA AAAAGACATC
TTGTCGGAGC AGGTTCGTCG TCTTGTGGTA GAAGTTGATC TGAGAAAACT GTTGAGTGTG
GGCAATGTCT TCTCCCATGT CTTAGACTTG CCTGAATTCC AACAACTTAA AGCACACATA
AAGAGTATGA CTCCATTTGC AGCTAGTCCA TCATTGAGTA CCAACTATGG CAGTGTATCA
CTGGAATCGG GAATGCGAAG TATTTCGTAT GCAAATTTGG CACCATCACA AAAAGGCAAC
AGTTTGATAG CTCCAAATTC AAACCATGGG TATCTGATAT CTTTGAGTCC AGATATTAGA
AGAGGACATG AACAACAACA ACTTCTGCAT AGTCAGATTC AACAACAGAT TCTGCAACTT
CAGCCTGAAA GCCGGCACGC TCGTCTACTG GAAGAACAAC TTAAGCAAAG CATGAAGCGT
CATGATCCGT CATTTGGCAG ATCTGTAGTA TCTACTCGTT ATGAACAACC TTTGCATTCG
TCACAAGCTG CAACAGTTTC ACAAAGCCAA ACACCTACAC CTACACCTTC TTTGCAAAGG
TATGCTTCAA TGCCAACTGT CCAGTATTCA AAGTCTGTAT TCGATCGTGT AGCAGACAGA
ACGTCGTCAG CATCAGAATC ACCAGCTTTG AAGAAGCAGC GTATTACTGG ATTGTTGGGT
ATAGAATTGA ACCAAGATGC CATCAGTTCC AACGTCACCG TCACAGATCC AGTATCACCT
ATAAATGGCA AGATTGGACA CATGACCATT AACTCACCGG TTTCAACAAG ACATTTGTCT
ATTGGTAGCA GAAACTTCAC TGCCATTGAG AGTCCAAGAG AACCCGGAAA TAGAAGAGTT
TCCGATGGTT CTGTTCCGCC TCAGATACAT GCTGGGGTTG CACGTAATGT ACCTGAAGGT
TTCATCACAG AAGTTTTCAG ACAGAAAAAC ACTGAATCCA ATAAGTCTAC TTCAGCACCG
ACTGCTGCTA ATGTAGATCA TAAGGCAGGG ATGGAAGTGA TATCCCTCTT GTCTGACAGT
GAAGAAGAGT AG
 
Protein sequence
MSSDIARLTP TEFSDTIARL NQMKVAEIKD ILRLLDFKLT GRRHEVIARI EDYFKHGIEL 
QDPVRLLAVR VLILMRGEGR QLPIYKDLYR KISKGEFVPG QMNNFATMMS SSGRNVTRQN
GNVNATNQNS NSSRSHSNHL GSDSNHNQYG VNESSPYRRH ALYFADNPFY KLRRLIHGSP
QPCFPQSGRG TCDIAFLLLD SESSLLALDS TIKAYVLCGV QANSLPALTD VAVHFPQPCE
LHVNGSQVQH FRGIKNKIGT SKPADITNHL RARPALNKIQ FVYTRTTETF LLYVYIVQTV
SIQEILQGIL NRPMIHKNST RDRIRAQNDD DDIVVSDISV TLRDPVSYTR MKYPVQSVYC
DHTQCFDALI FLQSQAQIPT WSCPYCQRNV KVDDLAISEY FTDILNTVLA DVEQVLIHSD
GSWSLEGSAT PAPRLNTPAR SPSSSPRVVT KIEEDANSSI PPDVEIVLLD SESESELEIP
LSNIPRRHHP QAISRDLSTS PIQRDDSPQI SETTRPLALV NPPEATVQPQ HVSMDVNDIS
GVGNSVPPVR TVPTASDVSV ASGIAIGSIV TPASTSASQV PTAIAPSPPP PPPPPPPPPP
PPPSSHLQET SASNYIATQL PLPSVLSQSQ STSSELPLVP PPLQPPREHP QSNIHKSHSH
SNTPSASNHL VKLDSHPFKD KNVSATAFSS TIPSTTDDSH MEIDSDTPIS SIHAPTQSSS
HLSSQIPLHS TTNPTTQPLT EEPPHLSLTT PVKRPTGARS KPTPNIFITS HSRLNNRPIY
RPKPATGKRP LQEANIRVDN NLRILTARNP AILARSENAS SPSISASKKT RTENSSSEMS
SERTTVEVSA SSSSEKSSSE SSLEKSHPPP PPPPPPPPPP PVSDNSTNNS NQSEHTSSTE
NSQTQDQPAI ENHSQNNNSA VEHSHAVGMT ILNERSSENQ KSVEELPVNT IPSGRPSTTI
DLQVRSAHVT SESFKSQHIS LAKSINVVAT AANVTSSTSS TAQKASETNG TGKELTLAGD
TFIGDSQKEV TAKKNANTSA VILSLPSPIN SSTLHADGVE DKNGIHSAVI GRSAESTLRP
TENLPMASSP SNNGTGIRAD IVQHSKTTSP LSALEVNNDN STITPDLCID KIRPGVTSHS
EAPLADKLRH LVARMIEIEQ QLQGVRNVRN QQCADSREHI ISYNDYLQES ANAILQLKDL
MTQLEQQSRI QTHQLVALQD EQLQEKGNLE MKIAASLLNS PEKESRVSYP DIFRKALEDQ
EKNRSQRKRH EFDVQRQALV ERHRLQLKTL QERQQQEKDI LSEQVRRLVV EVDLRKLLSV
GNVFSHVLDL PEFQQLKAHI KSMTPFAASP SLSTNYGSVS LESGMRSISY ANLAPSQKGN
SLIAPNSNHG YLISLSPDIR RGHEQQQLLH SQIQQQILQL QPESRHARLL EEQLKQSMKR
HDPSFGRSVV STRYEQPLHS SQAATVSQSQ TPTPTPSLQR YASMPTVQYS KSVFDRVADR
TSSASESPAL KKQRITGLLG IELNQDAISS NVTVTDPVSP INGKIGHMTI NSPVSTRHLS
IGSRNFTAIE SPREPGNRRV SDGSVPPQIH AGVARNVPEG FITEVFRQKN TESNKSTSAP
TAANVDHKAG MEVISLLSDS EEE