Gene PICST_82620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_82620 
Symbol 
ID4838111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp530998 
End bp534058 
Gene Length3061 bp 
Protein Length892 aa 
Translation table12 
GC content42% 
IMG OID640389426 
Productpredicted protein 
Protein accessionXP_001383376 
Protein GI150864528 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.338646 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.971769 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CAGTTCAGAA TAGTTGGTTG AAATATCCAG AACTTCGTGA AACAGACACA ATTTTTGGAG 
TTTACCCACC TTCGTTTGAA CCCTCGAAAA AGTAGAAACT CACGACGTTT CCCCAGTCTT
TTTTGGTTGC TCTATCTCGA TTTTGATAGA GTAAATTTTT CATAAAAACT CCATCCCATC
ATCAACTTTC AACAACTCAT CTTCATTCAC AGTATTTAAC TCAATTTCGT GACTTTCGTT
CAGTATCACA TCTCAATAAG AATACCAGTA TGCGGAAGTA TTTTTCCATG GTGAATGAAC
GAGTGGTCAA AACGGCCTTT TTGCTGGTGT TGCTCATCAA CGTCTTGTAC TTTGTTAGGG
TGTATTCTTC TGAAGAAATC CAGCTCACAC GTGCTGCTCT TGCTCATGCT GCTTCTACAG
TCTTTTCCAC TGACGACCAG ATCGAGTTCC CCAAGGACGT TCCGTTGGAA CAGATTACAG
ATTCCAAGCA GAGAGTCTCG TATCTTTTGC ACGAAGTAGA CGAGAACAAA GATGGCAATT
ACTGGTTGGC CCATACGAAC GTGAAACACT CGAGTTTGAA AATCAAGCCC AGCGACTTTT
TGCCATCTGA TTCTCCTAAT TGGCTGAATC GCCCCGAGCT TTTCTTTGAT CCCAGGTTCA
CCCTTTCAAT CTACTTAGAT GAGTTAAAGC ACCAGTTGTT GAGTAGAAAC CCCAAAAATG
AGAAGTCTCT CGACTCTGTG ATCCTGATGC CATTTGCATG GTCGGACTGG GTCGATTTGA
CAATGCTTAA CGAAGAGTTG TCGAAACCTG TTGATGAACG CAGAGACTGT GAATGGCTTC
AATCTCAAGT AAATAAGCCT ACAAAGAGAC CATTTTTCTG TGTAAACCTT AAGGATGCCA
CGGATGAAGA AATCGAAGAA ACAGGCATTT CCAAAGAGAG CTTACCTGGT TTCTTGGTAA
AGACTTCGCC CATGAACAAA GCTCCTCATA AACAGGTTAT GATGCAAGGA AAGGCACACT
TGTATGCCTA TCAACAGAAC CCATTCACCA TTATTTTCTT AACCAAAAGT GGTACATATG
AGGCCCAAAT TAGGGACAAT CGTAAGAGAC TTGTACACAC GGACATGTTC GAAAACTATT
TGAGCAGAAG AGGAATCAAT CCAAATCATT TGGAAGACTA CGGCAATGAA ATTGTATTGA
ATCCCCATGA TGAATTCTCA AGTTTGTTGA CCACTGTTGC TCCTAGACCT TTGAACTGGG
ATGACGATAT TCACAAGATG AATGCCATCA CACAGCAAAC GGAAGTCAAT GCTTCTCGTG
AATTGCATTT GAATCCCGAA TCGTTCAACT ACAAGCACGA GGCTGTGCTC AAGCAGTTGC
ACGATTACCA GTATCGTTTA GACAAGCTTG AAGAAGCATT TTCCAACGAG TTACGTTACA
GCCCTGAAGT TTTCAACGAG TTTTCGCTTG ATCGTCATGA GTTCAACCAC TACTCTGGTC
TTAAAACTGC ATCGGAAACT CCAATTCAAG AGGAGCCAAC ATACTATAAG TTAGCTACTT
TGCTCAAGAA GAACGGTAAT GTAGATGCCG GCTGGCACTA TGAATGGAGA TTCTTCAACG
GTGCCTTAAG ATTTATCAAA GACGATACTT GGACCATGAA CCAGTTGGAA ATCAGAGAAC
AAATCATCTT AGATAGATTG TTGCGTAACT GGTTCAGATT TGCAGAACAA AAGGGCATCA
TCTCGTGGAT AGCCCACGGA CCTCTCTTAT CATGGTATTG GGACGGTCTC ATGTTCCCAT
TCGACATCGA CATCGATATT CAGATGCCTT CAGCTGAATT GAACAGATTA TCCAAGCTCT
ATAACATGAC GCTTGTAGTC GAAGACATCG ATGAGGGCTA TGGTAAATAC TTGATAGACT
GTTCTACCTT CATTCACCAC CGTGACATGG CATACAAGGA TAACCACATC GATGCTCGTT
TTATAGATGT TGACACGGGT ACTTATATCG ACATCACTGG TGTGGGTAAG AACAACGAGA
ACCCTCCTCC GGAGTACGAC AGCTATATCA GAAGCAAGAA TGCTAAAGGA GAATCTGTAG
AGTTGTACAT GGACAGAAGA AAGCACTGGT TGAACTTTGA GAAGATCAGC CCCCTCAGAT
ACACACTGAT CAGTGGGGTT CCAGTGTATA TTCCAAATGA TGTGATGTCC ATGCTCAACA
CTGAGTATTC CCATGGTACT TCGGCTTTCC ACTTTGATGG CTACTACTAT GTTCCATGTC
TTAGATTATG GATTCAACAA GACAGAGTAG CCAAGATCTT CAACGAAAAT GACTTCAAGG
TCGGCGACAA AATCGACAGA GACAAGCTTC TTAACTTGGT GGTAAATATG AACGATAATG
ACAAAGCCAG ACTACTTGAA AGCGACGAAG AGTTACTCAT GGAGTATTAC TTGACTCATA
AGCACACTGG CTTGCACCAA TTAGAGAAGA AGTTTCTCTT GGATGCTGGT TTGCAGCATT
CCATTATAGA TTTACACGAT AACTATGATT ACCATATGTT GACGTCCAAC TTTAAGATGG
GCAAGCCGTT GAGAAAGTCT TTGTTTGATT TCGAGTACTT TGAGAGATTT GAACACGATG
AGTACGAACC AGCAAAGGGA GATGAGCCTC CTAAAAAAAT CATTAAGCCA AAGGTCAAAT
CACAGAGTTT GGCTGCTGGT AAGTTGGAGC CTATAAAGGT TGTTCCTAAG CCAGATCCTA
TAGCTGAACT CTTTAAGGAT CAAAAGCCGG CAACTCCAAA GGAACCCGAA CAACCAAAGG
TAGCTGAACA ACCAAAGGTA GCTGAACAAC CAAAGGAACA ACCAAAGGAG CAATCAAAGG
AACAGCCAAA GGGTAATGAA GAACAAAAGG TACCGCCTAA AGAAGAAAAT AAAGAACAAC
AGGTATAATG AGTCCATACA CCTGCTAATA ATCAAGGTCA ATTAGAACAT TTCGCATTTA
CGTCATTTTG CTATTCGTAG CATCCATGTT TCACTGTATA TTAGAATAAA CGTTAGAATA
T
 
Protein sequence
MRKYFSMVNE RVVKTAFLSV LLINVLYFVR VYSSEEIQLT RAALAHAAST VFSTDDQIEF 
PKDVPLEQIT DSKQRVSYLL HEVDENKDGN YWLAHTNVKH SSLKIKPSDF LPSDSPNWSN
RPELFFDPRF TLSIYLDELK HQLLSRNPKN EKSLDSVISM PFAWSDWVDL TMLNEELSKP
VDERRDCEWL QSQVNKPTKR PFFCVNLKDA TDEEIEETGI SKESLPGFLV KTSPMNKAPH
KQVMMQGKAH LYAYQQNPFT IIFLTKSGTY EAQIRDNRKR LVHTDMFENY LSRRGINPNH
LEDYGNEIVL NPHDEFSSLL TTVAPRPLNW DDDIHKMNAI TQQTEVNASR ELHLNPESFN
YKHEAVLKQL HDYQYRLDKL EEAFSNELRY SPEVFNEFSL DRHEFNHYSG LKTASETPIQ
EEPTYYKLAT LLKKNGNVDA GWHYEWRFFN GALRFIKDDT WTMNQLEIRE QIILDRLLRN
WFRFAEQKGI ISWIAHGPLL SWYWDGLMFP FDIDIDIQMP SAELNRLSKL YNMTLVVEDI
DEGYGKYLID CSTFIHHRDM AYKDNHIDAR FIDVDTGTYI DITGVGKNNE NPPPEYDSYI
RSKNAKGESV ELYMDRRKHW LNFEKISPLR YTSISGVPVY IPNDVMSMLN TEYSHGTSAF
HFDGYYYVPC LRLWIQQDRV AKIFNENDFK VGDKIDRDKL LNLVVNMNDN DKARLLESDE
ELLMEYYLTH KHTGLHQLEK KFLLDAGLQH SIIDLHDNYD YHMLTSNFKM GKPLRKSLFD
FEYFERFEHD EYEPAKGDEP PKKIIKPKVK SQSLAAGKLE PIKVVPKPDP IAELFKDQKP
ATPKEPEQPK VAEQPKVAEQ PKEQPKEQSK EQPKGNEEQK VPPKEENKEQ QV