Gene PICST_30659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30659 
Symbol 
ID4838136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp620139 
End bp623759 
Gene Length3621 bp 
Protein Length1206 aa 
Translation table12 
GC content38% 
IMG OID640389451 
Productpredicted protein 
Protein accessionXP_001383751 
Protein GI150864780 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.498733 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTTC AAAGCGACCG TGAGTTGCAA ATTGGATCAA AAGATCCAGA AGATTCGACT 
CGACGAGTCT CGTTACCCCC AGCATCTTAT GACGAAACCA CAAAGACTTT ATCCATACTG
CTCTTCCAAA AGATCAAACC GATTTCTATA GAGCTCTCAA ATATAGGATT GCTAGATACA
CCAGCTTTTA ATAGCCAATT CCGTCGATTG GCCGAGGTGT TGGAATCAGA ACATGAAGTT
CTAGACAGTC ATTATAAAGA AAATACTAAG AGTTCCCAAA ATTTAATACC TTATCATCTT
TCGCCAAACT TGGCTGATTA CATATTTTTC CCATTATCGA ATATCCTCAA GCAGCCAGCA
ATTGATGATC GTATTACTAC ATCAATTTTA AAAATAATAG GCTTTCTAGT CGAACATATA
TGGAGCTACA AGTTCGACGA GAAGTTTGTG GATGAATTGC TATCTATTGT GTTGTTCCTT
TCTGGTGCCA ATTCTAGCCC TTCTAGTGGA TCGAAACCAA TTGAGTTTAG AGCAGCAGCT
GTAGATTGCC TTGCTATGAT ATTGAAATCC GTACCAAACG ACTACTTTGT TGAAACAGCT
AATGTCAAGA GAATGTCATT GTTAGGCAAT TCAATTAGCT TCTTCTTGGA TAACATCACT
TGGGCCACAA ATCCAAGGTC TCAGGAAGAA AACAACTTGC TTATAAATAA TCTTGATATT
CTTGTTAATA CGGCTACCTT TAAGATGTCT TCAGACCAAT TGGCCCATGT TCTTCCTGGG
ATGGTGTCGA ACTTGACTAA CTTCTTTACC CAGTCTAAGA ATTTGCACTA TACCGTGATA
GTAAAAGTGG TAGAAGCCTT GAAAGTGATC ATAATCAAAG TGTTCAACGA CAAGACCCTT
AATGTTCTGT ACAAGGATGT TCGACAAATA GACAATTTGG AAGCAGTAAA TGAACTATGG
AGCCAAGAAA CCGATGAAGA TGTTCCTACT AACAGGATAA AGATCGAGAT ACCCTCCAAA
ACAGGAATTA GAAACACAGC ATGGCTCCAA GCAACTTCAA AACAACTCAA ACTTTCACTA
ATTGTTTTTT TCAAGACTCT CTTGGTGACC TCTAGCAACA ATAAGCTGAA AGTGCAAACC
AAGAACTTGT TAGCAGAGGC GATCTACAAA TTCTCTGATG CTATAATGGA AAACTCATTT
TTGTCTCTTT TCAATGAAGT CGTGCCACTT TCTCTCGATA TGCTTGCTAC AACAATATCT
CTTGTCGCGA CGAATGTTGA AGAAGAGAAT ACCAGCATAG CCAATGTTGT GAAACAAGTG
GGTATTTTGA TTTCAGCTAA TAGAGAAAAC TGTTCACTTT TTTTCAATCA GATCAAGTCT
AAGTTAGATG ACTTGATTAC GAACAAACTT ACTTCCGTGA TTCTCTCTGT TGATGATGAG
AAGATCAACT CATATTTAGT TTGTATCAAG CTTCATTTGC TGCTTTTGAA TAATCTTTCA
AAATCTGCTT ATAATGTTCA TGAAGCTGTA CTGAGTAACA AGAAGAATAT TATGGTTCTT
CTCAGCAAGA GTTTGAAAGA AAGTTATATT CAGAATAGTC AACAAAAAGA CAACACTAAC
GATCTCTTGA AGATGCTCAG TGGAAGTCAA ACAGAGGAAG GATTTTCTAA CAAACTAGAT
AACGTAGAAT TACCTCCAAA CATCGATTCC AAAAAGATTA CAAAAATACG ACCGGATCAA
AATAGAGAAA ATGTTTCCGA GCTGTACTCA TCCAATTTGA TGCTTTTGTC AAACAAATGG
AGTGAAGATT CTGTGACAAG AAGCGAACCT CAATACTACT TCTCTACTCT ATATTCCAAC
ACAATCGAAG AAAAGTTTGC TGGGCTTATT CAATTTATAG CTGCTCTAGC AGATGATGAT
GAAAATGAAC AGACAGGACA GTTAGAATTG GTGGAGTCTC TTTTTGATGA GGAGAGCGAA
AAGAACACTT TGGACAGAGG AATCGCATTA TGGGTAGCCA ACAACTTTTT CATTAACCAG
AAGTCGTCGA GGAATAAGAT CAATATGAAC GATTATATTA TTCTTGAAAA TGACTCCGAT
GAAGAAGATG AGTTGGAAGA AATGTCGTAT CTATTGGTCA GCAAGTCTCA AGACTTGATT
AATGATACCT CTTTTGAACT TGCCAATGCT GGAATTTCTG GGAACAGTTT GAAAGTTAAC
GAAATGTCGT ACTCGATAGC CTTAGATACC ATTGGAATTC TTTCAAGCAA ATTACCTTTG
GAAGAGTTTA GATCGAACTT CCTCAGAGAC TTCTTGTATT CTCTACTAGA GGCATTGACA
TTTCGATCAA ACACACTCAT TCAATCGCAT GCTCAGACAG CTGTCGGGTT TATTCTCAAG
AATTACTATA ACGACTCGTT AGAGTCGTTG ATACTTGATA ACCTGGACTA CTTGATAGAT
AACATCAGCT TAAGACTAAC TGTTCCAAGC AATTTTGTAC CAACTCTTCC CGGTATCTTA
TTGATCATTA TAAAAGTAGC TGGTCAGAGC CTCCTTGAGA TGAACCAACT CAATGACGTC
TTGTCAGAGA TGTTTGTAAT TATCGATTCG TACCACGGAT ACTCGATTAT TGTAGAAGGT
TTGTTTATGG TTTTTGAAGA AGTCACCAAG CAAGTGAAAG AAAGATACCT CTCACAAAAC
AGTCTTCAAA TAGAATTAGA TCCCGACCTC AATACTTCAA GCTATAAACC ATGGGGATTG
ACTAGTGTTA AGCAGGTGTT GAAATTATTG AGCGATTCAG CCAAATTGAC TGAACCATTT
GAGTCATATG ATTCGACAAA GGAATACTTC AAAAGAAAAC CAGATACTCC TTTTTCAGAG
CAAGCTGCAG ATTCTGACGA TGATGATGAC GATGAACAAG AACCTGACGT TGCAGAAGAA
GAAAAATGGC CATCACCTGT TCCTGAAAAT ACCTACTTTC TTGTGCAAAG GATATTCAAT
TACGGATTTG TAGTGTTGTC TCAGAAATCA ACCAGCATGA AGTTGCAGGT GTTGAAGACT
TTGAAACAGA CTTATCCGAT TTTGACTACC AACTACAAGT TGGTGTTGGT GATCTTAACT
AAGAACTGGC CCATTCTCTT GACGTTGATC AGTGGGTCCA GCAGCTTGTC TGTCTTCCAA
GATGTTCATG AAGGTGTCCT ACCAGAACTG GAAGCTCTAA TTGTTCCAGC TTTGGAGTTT
GTTATAGAGA TCGTGAAGGA AGATGGTGAT AGAGAGAATT TCCTAGGAAA TAAATTCATA
GAGTCCTGGG AGTTTCTCAT CAACCATTCG CCCATTTTCG GCCGTCTTCG CAAGAGAGAT
AGTTTCAGCT TGAACAGTAA GCTGAAACAG ATAAGTAGGA TCGAACAACA ATTAACTACG
ACGCGGTTGA ATCCAAAAGT CAATGATCTT CTTGTGACTT ACTTAATCAC AGGCCTCAAC
TGCTATGAGA GGACAGTATC TGACTTGGTG CGTTTGGATA TTGTTAGAGT GTGTTATAGA
ATGGGCATAC CATCCGACAT GAAGTTGAGC AGAGACGTCA GGAATACGTT GTGGATAGTG
AAGAACGAGG ACGACAACTA G
 
Protein sequence
MSVQSDRELQ IGSKDPEDST RRVSLPPASY DETTKTLSIS LFQKIKPISI ELSNIGLLDT 
PAFNSQFRRL AEVLESEHEV LDSHYKENTK SSQNLIPYHL SPNLADYIFF PLSNILKQPA
IDDRITTSIL KIIGFLVEHI WSYKFDEKFV DELLSIVLFL SGANSSPSSG SKPIEFRAAA
VDCLAMILKS VPNDYFVETA NVKRMSLLGN SISFFLDNIT WATNPRSQEE NNLLINNLDI
LVNTATFKMS SDQLAHVLPG MVSNLTNFFT QSKNLHYTVI VKVVEALKVI IIKVFNDKTL
NVSYKDVRQI DNLEAVNELW SQETDEDVPT NRIKIEIPSK TGIRNTAWLQ ATSKQLKLSL
IVFFKTLLVT SSNNKSKVQT KNLLAEAIYK FSDAIMENSF LSLFNEVVPL SLDMLATTIS
LVATNVEEEN TSIANVVKQV GILISANREN CSLFFNQIKS KLDDLITNKL TSVILSVDDE
KINSYLVCIK LHLSLLNNLS KSAYNVHEAV SSNKKNIMVL LSKSLKESYI QNSQQKDNTN
DLLKMLSGSQ TEEGFSNKLD NVELPPNIDS KKITKIRPDQ NRENVSESYS SNLMLLSNKW
SEDSVTRSEP QYYFSTLYSN TIEEKFAGLI QFIAALADDD ENEQTGQLEL VESLFDEESE
KNTLDRGIAL WVANNFFINQ KSSRNKINMN DYIILENDSD EEDELEEMSY LLVSKSQDLI
NDTSFELANA GISGNSLKVN EMSYSIALDT IGILSSKLPL EEFRSNFLRD FLYSLLEALT
FRSNTLIQSH AQTAVGFILK NYYNDSLESL ILDNSDYLID NISLRLTVPS NFVPTLPGIL
LIIIKVAGQS LLEMNQLNDV LSEMFVIIDS YHGYSIIVEG LFMVFEEVTK QVKERYLSQN
SLQIELDPDL NTSSYKPWGL TSVKQVLKLL SDSAKLTEPF ESYDSTKEYF KRKPDTPFSE
QAADSDDDDD DEQEPDVAEE EKWPSPVPEN TYFLVQRIFN YGFVVLSQKS TSMKLQVLKT
LKQTYPILTT NYKLVLVILT KNWPILLTLI SGSSSLSVFQ DVHEGVLPES EALIVPALEF
VIEIVKEDGD RENFLGNKFI ESWEFLINHS PIFGRLRKRD SFSLNSKSKQ ISRIEQQLTT
TRLNPKVNDL LVTYLITGLN CYERTVSDLV RLDIVRVCYR MGIPSDMKLS RDVRNTLWIV
KNEDDN