Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_59604 |
Symbol | |
ID | 4838582 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 172278 |
End bp | 175460 |
Gene Length | 3183 bp |
Protein Length | 1052 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640389897 |
Product | predicted protein |
Protein accession | XP_001384332 |
Protein GI | 150865209 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5656] Importin, protein involved in nuclear import |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.487702 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.194041 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGCCA ACGTGTTGTT GGAGTGTTTC TCTGCGACGC TCCAGGCAAA CCAGGATGTC CGGATACAAG CGGAGGTCAA GCTTCGTGAA TTGAGTGCTA CACCTGGTTT TTTGGGTGCA TGTTTAGATA TCATAGCTTC TAACGGTAGC TCGATCAATT CAGGAGTCCG TAAGGCTGTA GCTGTGTACT TCAAAAACCG TGTAGTCAAG TTTTGGACCT CGGCTGATTC CAAGATTGAT GCTGGTGAAA AGCCGGTGAT TAAAGACCGG ATCTTGCCAG TCATTGTAGT GTCTGACTAT ATCACCAAAC AACAATTGAT ACCTGTCTTG AGAGTATTGA TTTCTCATGA GTTTCCCAAC TGGTCTGGTC TTTTGGAACT GACGGGTTCG CTTTTGCAGC AAGTTCCTAC TGGTTCAAAC GTGAAGGATG AGGACTTTTC ACAGTTGTAC ACTGGCTTAT TGTGTTTTGC TGAAATATCA CGAAAGTTCA GATGGACCGA TAACAACGAC AGAAAAGCCG AGCTTTATCC TATAATCGAA CTGGCTTTCC CTCACTTATT AAATATTGGC AATACCATCG TGGCTTCTGC CCAGAATATC ACGGAGTTCC AGGCGGAAAT CGTAAAGTTG ATTCTTAAGA TCTACAAATT CGTCACTTAC TACGACTTGC CTGCTCCCTT GCAAACCTCA GAAGCCGTCG AGCAATGGGG TCAATTCCAT GAATCCGTCA TCAATATGCC GGTGCCACTG TATATCCGCG ACTCCAACCT TAGCGAACAG GAAAAGTCCT TTCTCCAGTT CTCCAAGTGC TACAAATGGT CTATCGCTAA TATGTACCGT TTATTTGTTC GCTATGCATC GGCTAGCTTG GGTAAAAAAT TCAAATATAC TGAATTTCAC GAGTTGTACT TGAACCAACT CGTGCCTCCT TTGTTATCCT CATACCTTTC TATTATTGAA CAGTGGTGCC AGGGTAAGAA GTGGTTGAGT TCTTCAGCCC TTTACTTTCT CTTAGAGTAC TTGAGTCATT GCATTACCCA AAAGTCCACT TGGCAAATCA TCAAGCCCTT CTTCCAGAAT CTCGTCTCAT ACTTAATCTA CCCATTGTTG TGCCCTAGCG ACAGCATCTT GGAGATATTC GAATTGGATC CCCAGGAGTA TATCCATGTA GCCTTTGATA TATCCGAAGA GTTCAACAGC CCCGATGTTG CTGCTTTGGG TTTGTTGGTG ACTCTTGTAC ACAAGAAAAA ATCTACTACG TTGGAAACTA TAGTCTCTGT CATTCACCAA GAATTGAACC AATTACAACA CCAAGAGGAA ACTCTAGAGG TTGCTAAAAA GAAGGAAGGT GCCTTGAGAA TGCTAGGAGG CATTTCATCT TATCTTACAG CTGCTAAATC AGACTACAGA AGTCAAATGG AAGCTTTCTT GATTCACTTG GTATTCCCTT CACTCACATC TAAGTTTGAA TTCTTGAGGG CCCGTGCACT TGAAGTGGTT TCAAAGTTCG ATGACATTAA TTTGCAAGAG GAGCAGAGTA AGTCAATGTT GTACCAAGGT GTTCTCAGAA ACTTTGACTC TTCGAGCAAC GCCAGTTTAC CTGTCAGTTT CCAAAGCGCT TTGGCAATAC AAGCTTTCTT GCCCCAGCCG CAATTCAAGG AAATTTTGTC TGGAATTATA ATACCGACCA TGTCAAGATT GTTGGAGTTA TCTAACGACA TCGACAATGA CGCTATTTCC ATTGTAATGC AGGAATGTGT TGAAAATTTC TCGGAACAAT TGCAACCTTT CGGGGTTGAT TTAATGAGCA AGTTAGTGGA ACAATTCATG AGGTTGGCTG TGGAAATTAA TGAAGCATCG AACGTCGATG TGGATGATTT TGACGGAAAT TTTGAAGACC AGAGTGAAAA GGTCATGGCC GCCATTGGTT TACTCAACAC CATGATCACT GTTTTATTGT CCTTCGAAAA CTCTACTGAA GTATGCTTGA AGCTAGAGGA AGTCTTCTCA CCAGCCATTA CTTACGTGTT GACAAACAAG ATTGATGATT TCCTCGCTGA AATTGGGGAG TTGATGGAAA ATTCAACATT CTTGTTACGT TCCATCAGTC CTATTATGTG GAAGAATTTT GAGCTTTTGA GTGATTCGTT TGCTGATGGT CTTGCTATAA TGTACCTTGA AGAACTAATG CAGTGCTTGC AAAATTTTTT GAACTACGGA ACTGATGAAT TAATCAAGAA CCCAGCTCTT GTTCAAAAAT TTTTCAATAT TTATAAGATG ATTTCAGAAG GTGAGGATAC CCAAATTGGA TACAACGATC TTGTGTTTGC TTGCGAATTA TCGCAAACCT TCGTTTTATC TTTACAGCAA GTTTCTGTTC AATACATTCC TAGTTTTGTT CGATCTGTCA TCACTATCTC TAACGAAGGG AACAAGGATA AACACCATAT CAAGAACAGT GCATTTGATG TAAATGTCAA CAATGTTATT GCTGCTTGTT TGGTTTACGA TGCCCCAACT ACTTTGAGCA TATTGCAAGA ATCTAACCAA GTGATTCCAT TCTTTGAACG TTGGTTCCAG TTGATCCCTC AGTTGAAGCG TGTTTACGAT TTAAAATTGT CCATACTTGC ATTGTTGAGC TTGTTGAACA ACGAAGAAAT TATCTCACTG TTACATTCTA CTACTCCAGC TATATTTGAC CAAATGGGAT TAAAATTAGC TATATTAACG AGAGAATTGC CAAAGGCAGT CGAGAGTTTA GAGAAGAGAA GGAAGAATTT CGACGAAAGT GATTTCGGAG GTGACAACTA TAGATACGGC GACGACGAAT GGGAAAATGC CAGCTCTGAA GATTTAGACT ACATTCTCGA CCAGGGAGAA GCAGCAGCAA ATGAAGCAGC CAATGAAGAA GAGGTCGAAG GAGGTAGACA CGAGTACTTA AACTTTTTAC AGGAAGAGGA TAATAAATTG AAGAGTTCAG GTTACTTCGA TGAAGAAGAT GAGCCAGTGA TAGAGGACCC ATTGGCCACA ACCCCCCTTG ATAGCGTCAA CGTTTTTGCG TTGCTAAAGG ATTTTATGGT CAAGGTGGAA GCCAATAATG CTGCTCTTTT CAGCGGTATT TTTGGAGGTC TTACTGAAAG TGACAAGATA CTCTTTAAAG ATATTTTTGA TATTGTTCAG TAG
|
Protein sequence | MNANVLLECF SATLQANQDV RIQAEVKLRE LSATPGFLGA CLDIIASNGS SINSGVRKAV AVYFKNRVVK FWTSADSKID AGEKPVIKDR ILPVIVVSDY ITKQQLIPVL RVLISHEFPN WSGLLESTGS LLQQDEDFSQ LYTGLLCFAE ISRKFRWTDN NDRKAELYPI IESAFPHLLN IGNTIVASAQ NITEFQAEIV KLILKIYKFV TYYDLPAPLQ TSEAVEQWGQ FHESVINMPV PSYIRDSNLS EQEKSFLQFS KCYKWSIANM YRLFVRYASA SLGKKFKYTE FHELYLNQLV PPLLSSYLSI IEQWCQGKKW LSSSALYFLL EYLSHCITQK STWQIIKPFF QNLVSYLIYP LLCPSDSILE IFELDPQEYI HVAFDISEEF NSPDVAALGL LVTLVHKKKS TTLETIVSVI HQELNQLQHQ EETLEVAKKK EGALRMLGGI SSYLTAAKSD YRSQMEAFLI HLVFPSLTSK FEFLRARALE VVSKFDDINL QEEQSKSMLY QGVLRNFDSS SNASLPVSFQ SALAIQAFLP QPQFKEILSG IIIPTMSRLL ELSNDIDNDA ISIVMQECVE NFSEQLQPFG VDLMSKLVEQ FMRLAVEINE ASNVDVDDFD GNFEDQSEKV MAAIGLLNTM ITVLLSFENS TEVCLKLEEV FSPAITYVLT NKIDDFLAEI GELMENSTFL LRSISPIMWK NFELLSDSFA DGLAIMYLEE LMQCLQNFLN YGTDELIKNP ALVQKFFNIY KMISEGEDTQ IGYNDLVFAC ELSQTFVLSL QQVSVQYIPS FVRSVITISN EGNKDKHHIK NSAFDVNVNN VIAACLVYDA PTTLSILQES NQVIPFFERW FQLIPQLKRV YDLKLSILAL LSLLNNEEII SSLHSTTPAI FDQMGLKLAI LTRELPKAVE SLEKRRKNFD ESDFGGDNYR YGDDEWENAS SEDLDYILDQ GEAAANEAAN EEEVEGGRHE YLNFLQEEDN KLKSSGYFDE EDEPVIEDPL ATTPLDSVNV FALLKDFMVK VEANNAALFS GIFGGLTESD KILFKDIFDI VQ
|
| |