Gene PICST_59604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_59604 
Symbol 
ID4838582 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp172278 
End bp175460 
Gene Length3183 bp 
Protein Length1052 aa 
Translation table12 
GC content41% 
IMG OID640389897 
Productpredicted protein 
Protein accessionXP_001384332 
Protein GI150865209 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5656] Importin, protein involved in nuclear import 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.487702 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.194041 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCCA ACGTGTTGTT GGAGTGTTTC TCTGCGACGC TCCAGGCAAA CCAGGATGTC 
CGGATACAAG CGGAGGTCAA GCTTCGTGAA TTGAGTGCTA CACCTGGTTT TTTGGGTGCA
TGTTTAGATA TCATAGCTTC TAACGGTAGC TCGATCAATT CAGGAGTCCG TAAGGCTGTA
GCTGTGTACT TCAAAAACCG TGTAGTCAAG TTTTGGACCT CGGCTGATTC CAAGATTGAT
GCTGGTGAAA AGCCGGTGAT TAAAGACCGG ATCTTGCCAG TCATTGTAGT GTCTGACTAT
ATCACCAAAC AACAATTGAT ACCTGTCTTG AGAGTATTGA TTTCTCATGA GTTTCCCAAC
TGGTCTGGTC TTTTGGAACT GACGGGTTCG CTTTTGCAGC AAGTTCCTAC TGGTTCAAAC
GTGAAGGATG AGGACTTTTC ACAGTTGTAC ACTGGCTTAT TGTGTTTTGC TGAAATATCA
CGAAAGTTCA GATGGACCGA TAACAACGAC AGAAAAGCCG AGCTTTATCC TATAATCGAA
CTGGCTTTCC CTCACTTATT AAATATTGGC AATACCATCG TGGCTTCTGC CCAGAATATC
ACGGAGTTCC AGGCGGAAAT CGTAAAGTTG ATTCTTAAGA TCTACAAATT CGTCACTTAC
TACGACTTGC CTGCTCCCTT GCAAACCTCA GAAGCCGTCG AGCAATGGGG TCAATTCCAT
GAATCCGTCA TCAATATGCC GGTGCCACTG TATATCCGCG ACTCCAACCT TAGCGAACAG
GAAAAGTCCT TTCTCCAGTT CTCCAAGTGC TACAAATGGT CTATCGCTAA TATGTACCGT
TTATTTGTTC GCTATGCATC GGCTAGCTTG GGTAAAAAAT TCAAATATAC TGAATTTCAC
GAGTTGTACT TGAACCAACT CGTGCCTCCT TTGTTATCCT CATACCTTTC TATTATTGAA
CAGTGGTGCC AGGGTAAGAA GTGGTTGAGT TCTTCAGCCC TTTACTTTCT CTTAGAGTAC
TTGAGTCATT GCATTACCCA AAAGTCCACT TGGCAAATCA TCAAGCCCTT CTTCCAGAAT
CTCGTCTCAT ACTTAATCTA CCCATTGTTG TGCCCTAGCG ACAGCATCTT GGAGATATTC
GAATTGGATC CCCAGGAGTA TATCCATGTA GCCTTTGATA TATCCGAAGA GTTCAACAGC
CCCGATGTTG CTGCTTTGGG TTTGTTGGTG ACTCTTGTAC ACAAGAAAAA ATCTACTACG
TTGGAAACTA TAGTCTCTGT CATTCACCAA GAATTGAACC AATTACAACA CCAAGAGGAA
ACTCTAGAGG TTGCTAAAAA GAAGGAAGGT GCCTTGAGAA TGCTAGGAGG CATTTCATCT
TATCTTACAG CTGCTAAATC AGACTACAGA AGTCAAATGG AAGCTTTCTT GATTCACTTG
GTATTCCCTT CACTCACATC TAAGTTTGAA TTCTTGAGGG CCCGTGCACT TGAAGTGGTT
TCAAAGTTCG ATGACATTAA TTTGCAAGAG GAGCAGAGTA AGTCAATGTT GTACCAAGGT
GTTCTCAGAA ACTTTGACTC TTCGAGCAAC GCCAGTTTAC CTGTCAGTTT CCAAAGCGCT
TTGGCAATAC AAGCTTTCTT GCCCCAGCCG CAATTCAAGG AAATTTTGTC TGGAATTATA
ATACCGACCA TGTCAAGATT GTTGGAGTTA TCTAACGACA TCGACAATGA CGCTATTTCC
ATTGTAATGC AGGAATGTGT TGAAAATTTC TCGGAACAAT TGCAACCTTT CGGGGTTGAT
TTAATGAGCA AGTTAGTGGA ACAATTCATG AGGTTGGCTG TGGAAATTAA TGAAGCATCG
AACGTCGATG TGGATGATTT TGACGGAAAT TTTGAAGACC AGAGTGAAAA GGTCATGGCC
GCCATTGGTT TACTCAACAC CATGATCACT GTTTTATTGT CCTTCGAAAA CTCTACTGAA
GTATGCTTGA AGCTAGAGGA AGTCTTCTCA CCAGCCATTA CTTACGTGTT GACAAACAAG
ATTGATGATT TCCTCGCTGA AATTGGGGAG TTGATGGAAA ATTCAACATT CTTGTTACGT
TCCATCAGTC CTATTATGTG GAAGAATTTT GAGCTTTTGA GTGATTCGTT TGCTGATGGT
CTTGCTATAA TGTACCTTGA AGAACTAATG CAGTGCTTGC AAAATTTTTT GAACTACGGA
ACTGATGAAT TAATCAAGAA CCCAGCTCTT GTTCAAAAAT TTTTCAATAT TTATAAGATG
ATTTCAGAAG GTGAGGATAC CCAAATTGGA TACAACGATC TTGTGTTTGC TTGCGAATTA
TCGCAAACCT TCGTTTTATC TTTACAGCAA GTTTCTGTTC AATACATTCC TAGTTTTGTT
CGATCTGTCA TCACTATCTC TAACGAAGGG AACAAGGATA AACACCATAT CAAGAACAGT
GCATTTGATG TAAATGTCAA CAATGTTATT GCTGCTTGTT TGGTTTACGA TGCCCCAACT
ACTTTGAGCA TATTGCAAGA ATCTAACCAA GTGATTCCAT TCTTTGAACG TTGGTTCCAG
TTGATCCCTC AGTTGAAGCG TGTTTACGAT TTAAAATTGT CCATACTTGC ATTGTTGAGC
TTGTTGAACA ACGAAGAAAT TATCTCACTG TTACATTCTA CTACTCCAGC TATATTTGAC
CAAATGGGAT TAAAATTAGC TATATTAACG AGAGAATTGC CAAAGGCAGT CGAGAGTTTA
GAGAAGAGAA GGAAGAATTT CGACGAAAGT GATTTCGGAG GTGACAACTA TAGATACGGC
GACGACGAAT GGGAAAATGC CAGCTCTGAA GATTTAGACT ACATTCTCGA CCAGGGAGAA
GCAGCAGCAA ATGAAGCAGC CAATGAAGAA GAGGTCGAAG GAGGTAGACA CGAGTACTTA
AACTTTTTAC AGGAAGAGGA TAATAAATTG AAGAGTTCAG GTTACTTCGA TGAAGAAGAT
GAGCCAGTGA TAGAGGACCC ATTGGCCACA ACCCCCCTTG ATAGCGTCAA CGTTTTTGCG
TTGCTAAAGG ATTTTATGGT CAAGGTGGAA GCCAATAATG CTGCTCTTTT CAGCGGTATT
TTTGGAGGTC TTACTGAAAG TGACAAGATA CTCTTTAAAG ATATTTTTGA TATTGTTCAG
TAG
 
Protein sequence
MNANVLLECF SATLQANQDV RIQAEVKLRE LSATPGFLGA CLDIIASNGS SINSGVRKAV 
AVYFKNRVVK FWTSADSKID AGEKPVIKDR ILPVIVVSDY ITKQQLIPVL RVLISHEFPN
WSGLLESTGS LLQQDEDFSQ LYTGLLCFAE ISRKFRWTDN NDRKAELYPI IESAFPHLLN
IGNTIVASAQ NITEFQAEIV KLILKIYKFV TYYDLPAPLQ TSEAVEQWGQ FHESVINMPV
PSYIRDSNLS EQEKSFLQFS KCYKWSIANM YRLFVRYASA SLGKKFKYTE FHELYLNQLV
PPLLSSYLSI IEQWCQGKKW LSSSALYFLL EYLSHCITQK STWQIIKPFF QNLVSYLIYP
LLCPSDSILE IFELDPQEYI HVAFDISEEF NSPDVAALGL LVTLVHKKKS TTLETIVSVI
HQELNQLQHQ EETLEVAKKK EGALRMLGGI SSYLTAAKSD YRSQMEAFLI HLVFPSLTSK
FEFLRARALE VVSKFDDINL QEEQSKSMLY QGVLRNFDSS SNASLPVSFQ SALAIQAFLP
QPQFKEILSG IIIPTMSRLL ELSNDIDNDA ISIVMQECVE NFSEQLQPFG VDLMSKLVEQ
FMRLAVEINE ASNVDVDDFD GNFEDQSEKV MAAIGLLNTM ITVLLSFENS TEVCLKLEEV
FSPAITYVLT NKIDDFLAEI GELMENSTFL LRSISPIMWK NFELLSDSFA DGLAIMYLEE
LMQCLQNFLN YGTDELIKNP ALVQKFFNIY KMISEGEDTQ IGYNDLVFAC ELSQTFVLSL
QQVSVQYIPS FVRSVITISN EGNKDKHHIK NSAFDVNVNN VIAACLVYDA PTTLSILQES
NQVIPFFERW FQLIPQLKRV YDLKLSILAL LSLLNNEEII SSLHSTTPAI FDQMGLKLAI
LTRELPKAVE SLEKRRKNFD ESDFGGDNYR YGDDEWENAS SEDLDYILDQ GEAAANEAAN
EEEVEGGRHE YLNFLQEEDN KLKSSGYFDE EDEPVIEDPL ATTPLDSVNV FALLKDFMVK
VEANNAALFS GIFGGLTESD KILFKDIFDI VQ