Gene PICST_32169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32169 
Symbol 
ID4839116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp850294 
End bp853455 
Gene Length3162 bp 
Protein Length1053 aa 
Translation table12 
GC content40% 
IMG OID640390431 
Productpredicted protein 
Protein accessionXP_001384830 
Protein GI150865563 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.390072 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACCGT CAGGAGATTC CAATAGTAAT GGAGTTGTCT CTGGTGGTTT AGCTTTATCG 
CTGGGTGGAC AGAATTCGCT ACAATTTCTG ATTTCGCACA TCCTCGGCAC ATCGGCTCGA
GCACCTCATC ATCTCTCAGT GCAAGACAAC TATGTAGCCT ATGCTGCCAG TGGAGGTGTT
GTCGTCCGTC AGTTGGACTT GGAAAATAAT AACGCCGTTA TCTCGGAGCG GTTCTTTTGC
GCCAACTCCA GTTCAGGCAA TGAAAACACT GCAAACAGCA TCCTGCCATC TGGTCCAGAT
GCATATCTCA ACATGGCACT TGAAATGGAA AGTAGTCACA ATTTGCATCA AAATCTCCAT
CAGAGTTCCA AGGATGTTCA GCCTGTAAGA GATAGATATG GCTATTCTAT CGCAACTGAA
CCTATCGTTG TTGGAGGTAG CAACAATATC GGAAATATTG CAGAATTAAC CCAAAGTGTT
CACGACATCG ACTTATCTTC GCCATCCAAG TTGAAAGACA GAGTCAGGTC CATAAACTGT
ATGTGCATAT CACCTAACAA ACGTTTGCTA GCCATCGGTG AAACGGGTTA TCAGCCACGG
ATACTTCTCT TCTCATTGGC ACCAGACCTG AGCTCAAATC CTGTTGCACT TATATATGAA
CACTCTTTTG GTATTAAATC GCTCTGTTTT CTGGCAGATC TGCGCTATCT TTGCTCTCTT
GGTCTTGTGA ATGATGGCTG TATAAATGTC TGGAGGATTT CTAACTCTGA TGTGCAGTTA
GCAGCTAACA ATAGGTGCTC TTCTGTGGTG AACAGATTGT TCTGGCATGA GGACTACATC
ATCACCTTGG GATTACGTTT CATAAAAGTT TGGAGGTTTT CGAGTAAAGA AGACAACGAC
AGAATCCTGG ACAAACCTCT AGCACTAAAA GGTAAAAGTG TCCTTTTGGG ATCGCTAATA
AGCTCGAATT TCACAGACAT ATCTGCCTTG AACAACGACG AATTGCTCAT AATAACAAAC
AACAACCAAT TGCTCTTGCT CAAGTTAAAT AGTGAACTCA AACTCATTTC ATTAGAAACG
CCCCCGTTTG ATTTCGATAC CTTATTGGTA GATTATGAGC TTGAAAAGAT ATGGTTTGGA
TCCAATTCCA AGCTAGAATC GTATTCTATA AACGATTTGA AACCTAGCTC TGTTTCAACT
CCTCTGACGC CTTCTTCCAG AGTAAATTCT GTATTTGGAG CACAAACTAA CGAAAACGTG
CGGACAGTTC CTATTCTTAG ACTCTTCAAT CTCAGTTCCA ACTACATCAT ATACCTTTCA
CATCGTGAAG AAATTGTATT GTACAATAAG TTCAAGTGTG ACATCGAAAG TAGAGTAGCC
AGCTCCTTGA TGAGCGAGCT AGCAGGTTTC AAGAACTGCC ATCTGGGAGA CCTATTGGTG
TATTCGCATT CTGGTATGAT TAAGAGGGTT ACAAAGGACT ATGAATTAGA GACCATTTTG
AAGTTTAACT TGCCGTCAAA CGAACTCATA TCAAATTCGC TTATGGCCGT AGACTCCAAC
AATGATTCAC TTTTGTTGGG AGACAAGTAT GGAACATTGT ATGTTGTCAA GATAACAGAA
GAAAAAGCAT CTGAAATTGT ATATCAAATC AAAGCACATT CATCTTCAAT TAACGACATA
GTATACTACG AATTTGGAGA CTTTCAGTTG ATAACAAGCA TAGCAAGAGA TCGTATGATA
CAATTCTTCT ACAAAAAACC AGGTACCAAC TGGGACATCT TGCAAACTAT ACCTATCCAC
AATGGCAATT TATTGAAAAT TCAGTATCAC AACAGCAGGA TATATGTCTG CTCATCTGAT
AGAACTATCT CTATTCACAA ACTTGAAGTT GTTGAGAGTG AATTGAGGGT TTTCCAAGAG
AAGATATTAT CTATGAAATG TAGTCCGATC ACTTTGAAAA TTGTAGACGA CGATTTGATC
GTGTCTACAA ACGACAAAAC TTTGTCAATA TATCTGGTAT CTCAGGGATT TGAGCTATCA
CGGACTTTAA AGCTTGTTAA CGGTAAAAAT AACGAGAGCT TACTTGTGGA GAACATCATT
GTATTTAAGA ATTTGCTCAT AACTTCCTCG ACAGACAAGT CTCTCAGAGT ATTCAATTAT
CATACTGGCA GACCAATGAG TGTAGCCTGG GGTCACCTGG ATGTGATATT AAGCTTGGAA
TTAAGTTCCA ATGAAGACTT GATTTCCATT GGGAAGGACG GTTGCTTGTT CACTTGGAAG
ATTAATGAAT CGACAGCAAC AAAGAATAAC ACATATAAGG AAGATACAAC ATATAAGGAG
GAAAGTAATG TGATTCCTAT GTATGCCAAT GTAACCAGAA AGATTCTTCC TATTTCTCCC
ATAAAGATCA ATGCACCCAA GATCGAAACA TGCACAAAAG AGGCGCCATC TCCACGTAAT
TCTATATCTC CCAGACTTAC AAACGCAACT TTGAAGAGAA TCGAAGCCCG TAGAGCTAGT
TCTCAGAGTC CCACTAGAGA TTCGGGTAGA TCAAAATCGG TTTCTACTGC GAAGCCATCC
TTGTCTATCA AACCTTTAGA AAAGGCACAC ACTATAAGCA CACTTTCCTC TACCGCGGGA
CCAGGACTTA CTTCTCCAAG AAGACCATTG TCTCCCATTA GACGCAGTCC ATCCAGAAAT
CTGTTAGATC ATTCACCAGT AAGGAGTCTG TTGGATCATT CACCCATGAA GCTTTCAAAG
CCACATATCT TATTTAGTCA TGATGAAAAA AGAACCGCAG ACCAACCTTT CGTGGATACA
GCTCTAGCCC AATTGCAGTT TATTGATTCT AAACTTCAAA GAGAAGTTAT AAGCAACAAC
GATAAGGCCA AATTGTTAAC CAAACTCGAC TCTATTTTTA GACAATTAGG TGGAGATAAG
GAGCTGACTA AAAGTAACGT TCGAGACAAA AGGACTGAAA TTAGCCAGAA TAGAGAAGCA
GATGAAAGAG AATTGTTGGA GTCATACAGC GATAAGCTTC TTCAACTTAT GGAATCTAAA
CTTGAGTCGA AGTATTCCAA GCAAGTTCCT CCACTTTTCA TTGGAGAAAA CCACTCTTCG
ATTTCAACAA CTTCCCAAGA CTCGCTGGAG GACATAGATT AA
 
Protein sequence
MPPSGDSNSN GVVSGGLALS SGGQNSLQFS ISHILGTSAR APHHLSVQDN YVAYAASGGV 
VVRQLDLENN NAVISERFFC ANSSSGNENT ANSISPSGPD AYLNMALEME SSHNLHQNLH
QSSKDVQPVR DRYGYSIATE PIVVGGSNNI GNIAELTQSV HDIDLSSPSK LKDRVRSINC
MCISPNKRLL AIGETGYQPR ILLFSLAPDS SSNPVALIYE HSFGIKSLCF SADSRYLCSL
GLVNDGCINV WRISNSDVQL AANNRCSSVV NRLFWHEDYI ITLGLRFIKV WRFSSKEDND
RISDKPLALK GKSVLLGSLI SSNFTDISAL NNDELLIITN NNQLLLLKLN SELKLISLET
PPFDFDTLLV DYELEKIWFG SNSKLESYSI NDLKPSSVST PSTPSSRVNS VFGAQTNENV
RTVPILRLFN LSSNYIIYLS HREEIVLYNK FKCDIESRVA SSLMSELAGF KNCHSGDLLV
YSHSGMIKRV TKDYELETIL KFNLPSNELI SNSLMAVDSN NDSLLLGDKY GTLYVVKITE
EKASEIVYQI KAHSSSINDI VYYEFGDFQL ITSIARDRMI QFFYKKPGTN WDILQTIPIH
NGNLLKIQYH NSRIYVCSSD RTISIHKLEV VESELRVFQE KILSMKCSPI TLKIVDDDLI
VSTNDKTLSI YSVSQGFELS RTLKLVNGKN NESLLVENII VFKNLLITSS TDKSLRVFNY
HTGRPMSVAW GHSDVILSLE LSSNEDLISI GKDGCLFTWK INESTATKNN TYKEDTTYKE
ESNVIPMYAN VTRKILPISP IKINAPKIET CTKEAPSPRN SISPRLTNAT LKRIEARRAS
SQSPTRDSGR SKSVSTAKPS LSIKPLEKAH TISTLSSTAG PGLTSPRRPL SPIRRSPSRN
SLDHSPVRSS LDHSPMKLSK PHILFSHDEK RTADQPFVDT ALAQLQFIDS KLQREVISNN
DKAKLLTKLD SIFRQLGGDK ESTKSNVRDK RTEISQNREA DERELLESYS DKLLQLMESK
LESKYSKQVP PLFIGENHSS ISTTSQDSSE DID