Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_28643 |
Symbol | |
ID | 4851406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 1732950 |
End bp | 1738745 |
Gene Length | 5796 bp |
Protein Length | 1931 aa |
Translation table | |
GC content | 40% |
IMG OID | 640393114 |
Product | predicted protein |
Protein accession | XP_001387561 |
Protein GI | 126274518 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.272891 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCACGT CCTCGAGCAC CGAGGAGCTC AAGCGATTCT TGCTTCATCT ACCCTCTGAC TCTAACTATG TCTTTTCTCA AGATATTCGT ACCCAGATCA GACGGGCCTG CTTTCTTGCC ATCTCCAACA ATGGGCAACA TCTAAACCGC CTCTTTCCCA ACGCCTTCTC AAACAGAAAC GATCCAAATA GCGAATCGGA AGCGTTATAT TCACTGCAAC ACATGATCGA TGTCGTCGAA TCAGAATACA ACTGGCTCTT TGCCAACTAC TACAAAATCT CTCTCAAAAA TAGAGATCCC AAACACGACA CTCATAACCA CCACGCCTTC CATGCCAACC TGCCCTGTGC CCGTATCTTC CGCAAGGGTG AACCCATTTA TCGGTGTTTA ACTTGCGGTT TCGACGATAC CTGTGCTCTC TGCTCCCATT GCTATCAACC CGAATACCAC CAGGGTCACA AGGTCCACAT CACCATATGC CAGCGTGAAA ATGGAGGCGT TTGTGATTGC GGAGATCCAG AAGCTTGGGT CAAGGAGTAT ATCTGCCCTT ATGCGGCTCA TGATGATGAA AGCATTATCG TCAGAAACGA GAAAATGCCT GTAGACTTGG AACTGAGCTT TCTCCAGACG ATAGAAACCT TGCTAGATTA CGTTATCGAC GTTATGAGCC AGTCCGACCA GCAGTTCGAA GATCCGGTAG AAGCAACGGA AGCTAAAGTA GAACTTAATG CGATCAACAG TACTCTTGAC CCACTGAAAT ACGATTGTAG TACGGATAAC TTCACCGACG CCAACAATGA GCGGTTCTTT CTCATGGTAT ACAACGATCA AATCAGACAC TATAGAGATG CAGTCCAGCG TATCCATTTG GCCAGTAGGA AAGTAAAGCA ATTTGCCGTC ATGGTGACTG AAAAAGTGCA GAACTACGGC AAAGCCAAGG TGATAAGCTC CAAGAATATC AAGTTATTAC TCGAAAGACA AAAGATTTTG AGTGCTACGG GTCTTTCAAC ATGTATAAGA TCTCACAGAG ACGTTTTTAG GGAAGATATG TGTGATGAGA TCTTGATCTG GATCAACGAT TTGACCGAAA GTGAGATGTT CAAGATGAAC AACTCAGCAA AAAATCTTTT TTGTAGAGCT TTTTGTGAAA AGTGGAAAAG CGGTCTTTTA GTTGCCACTC ATGGAGACAA CTCAAATCAC CAATATAGAG TTGGAGCGCT AGATGCCTGT TCAAAAATCC CCAAATTGCC GTCTTCTAAT ACTGATAGAA AGACTCAACT ACATTGGTTC TTTCAACCTT CAAAATGGAA CTTACCTGAA GACATTTGCA GAGAATGCGA TTACAATTTA ACAGAAGAAG ACTACGAGCC ACATACTAGT CACTTGGGAT CTCGTCTTCA GTATCTTATT TATTTGGATG TTAGATTCTG GAAGTCTATC AGAATCTATT TGCATGATAT GTACTCGACT TCATTGATCA CAAATTTATG GTATAAGCAT ATCATTTCCT GTCAATATGT GGACATATAC CCTACCATTG CTGACATGTT CTTGACTATG GACAGAGAAC CAGAGCTTAA TGTTATGTGT ACGTTGTCTA CTCAGCTCTT CACGTGTCCT TCTAACTCCA CTTCGATTGT TCAGCATGGT GATGTTTCGC GAATTTTTGC GTCCATATAC GGTTTCTTAA CTATTGAGGA AATCAGGTCA CCTGAGTGTG TTGAAGTAAC TCAAGAGATA TCGATGAAAA GTTTGAAGAA CAGGATATGG GGCCAGATCT TCTTTGATAT TGGTTACATA TTGAGTAGAT CTCGCGATTC AAAGTATATT TTGACTAGCA ATATCATCCC CATGGCATGC GACATATTGG CATTATTTCA AGGTCGTCCG GTGATGAAAA GAGAAAAGAA GAACCATGTG GAATACGAAA GTCCCGATTA CACTGCATTT TTCCATGCCA TACTGGTCAT CTATCAGTTT GGAGAATACA TAGCCCATTC TTTGTCCAAT TTAGGAGATA TTGACCCTGA ATTGAGGACA TCCCTTAGTA AGAATGCTAT TAAATATGTC ATCAGCTTCT TGCTTAAGTT AGAAAACAAT GACTACCCTG GTCTTATTGA TGAATACGTT GACATTAACC TATCAATCGA CAAAAAAATC TCAAAGGAAC CCATAGGTGG AAATATCATC CAGTATTATA GAATTGACGA GGAAAAAGTC AGTTTCTTGC ATCCAATTCA TTCTTTCTTG AGTTGGTTGA TAGAACTCTC TGATTTCAAG TCTCCTTCGG AAATTGTTGA AGTGCTTAAC TCATCGACTG ATTTCTACAG CATAACCTCT AGTGTTGATA TTCCTAACCA TTTGACTTCT ATCTTTGACT ATCCCATTAG AACAATAGTA CTTATGTCAC AAATTAAGTC TGGTTTCTGG GTGAGAAACG GGTTCAGTGT CAGATCTCAA TTACAGCTTT ATAGAAACAC AGGATTGCGT GAAAGTGGGT ATATGAGAGA TCTTTTCCTA ACACAAGTAT TCATCAACTC AAATAGTCCT AACCTTGTTT GCTTCCTACT TTTCAGTCGC TGGTTATTAA TGGACGGATG GTTGATTGAC AGCAGAACGG TAAGAAATGA AGTCACTGAT CTAGATTTGC AAGTTTCAGC GGATCTGTCG GCATTGTGCT ACGACCTGAA GACCTTACCA TACATGTTAG AGGAATGCAT GAACTTCTTC ATTCATGTCT TGACTGAAGA TCTTTATTTG CGTGGTCTCA AAGATGAGGT TATGATCCAA ACAAGAATCA AGAAAGAGAT TGTTCACAAT TTGTGCTTCG GACCAATGAG TTACACGAAG TTGTGCTCTC AAATTCCGGA TCATATCTTG TCAGAGAAGA GATTTGATCT AATATTGGCT GAAATGACAA CATTTACAGC CCCAAATGGT GCAACTGACA TTGGTGTTTA TCATCTCAAA GATGAGTTTT ATGATCAGAT AAACCCATAT TATTTCAACT ATACCACAAA TACAAAGGAT GATGCAATTA AATTCGTGAA GGAAAGAATT CATAAGACCA CAAGAAAACC AATTACTGAA ATTGTTATCG ATCCTAAGTT GAGGGACCCT GGCGAATTGG GAATTTACAG ATATATTGGA AACTTTTCAG CCTCTGCTTA TTTTTCCGAT TTCTTAATTA GGACTTTGTC GTACATAAGC AAGGAGGGAA TTGAAGAGGT TGAAAGTTTG CTAGAAACTG CATTACACTT GATACACATC TGCGCCTTTG AAAATACAAT CGACATCAGT CAATACGGCA CATTCTATGA CAGATTTGTC AATATATCAG ATGCATACGG AACATCGATT GCAGTGTTAC TATACGAAAT TTTAGCAAAT GACCAGTTCA AAAATCACCA TTCAAAAATT CGTTGTATTT TGAAGGTCTT TGAGGACAAG TACCAGAATT TGGGCAAGAT CTTGAGTGAT CAAATTGTAG ATTATAATCC GTTGGTGATT GAGTTTCACA CAAAGATGGA TAATGATGAA AACGAATTCG AGAAGAAAAG ACGTATGGCG AAAGAAAGAC AAGCAAAATT AATGGCAAAA TTTAAGAAGC AACAGTCCCT GTTCCTCAAG AAGAACAACA TGGAAAGGAA TTACTGCAGT GATATTGAAA TGGAAGACTA CGAAGATGAA CATGGTTGGA GATTTCCAGA ACCCCATTGT TTGCTTTGTC AGAATGCGGC AGAAGACGCT GGCCCATTTG GTATCATTAC TTACATCTCC AAATCATCTG AGTTTCGCAC CATCCCTTTC AATGACAAAT TTTGGGTTTT GAAGGCATTT TCTGATAATG CTAGTTCAGA TGTGAACGAA AATGCAGGAG ACCCAGTAAT TGAAGAAGTC AAAAGTGAAA AGTGGCATAG GTTTATGGGC AAAATCAAGG AGAGCAATGT TATTGGCCCA GGATTCTCGC ACAATGATCA TGTTGAAAGC AAGTTGGTTT CGCTGAGCTG TGGGCATGGA ATGCACTTTC AATGCTACAT GAACTATTTG AATAGCAACA AGAGTCGCCT GAATCAAATC ACTAGAAATT CACCGGAAAA CGTTGAAAGA AAGGAATTTC TTTGTCCTTT ATGTAAGGCC ATAAATAACA TGTTCATTCC AATATTATGG ACCTCCAACA AGAGACTGTT GTCTCAATTT TTGAAACCTT TGCTGTTGCC AAATCCGTTC GATCACATTG ATCCAAAAAT TGTTCATAAT CAGGATTGGT ATAAAGAATT TACATTCATA TCCGATAAGG ACATTGAAGA CATGTCCATT TTGACTAAAG CATCCACGGA TATAATTTCT CTGTCATCCA ACAACGACTT CACCGGATCT CAGCACAGCT TCAGGGTTCT TTTGAGCAAT ATGTTCCAGA TTTTGTCACT TCTTACATTC CCTCAAGTTT TTAAGGCTGA CACAGTGTTT GTTCTTCTGA ACACGATTAA GTCAATTGAA ATATCATTGA GGGGAACATC CGCAAGGGGT GAGCTGATCA TATATCAATT GTCCAATAAC GCTTTGATCA ACTTGAGAAC ATTAAATGAA TTCCGTAATA CATGCGTTCT AATGAAGATC AAGAGCTGGA TTCATACTCC CAATCCAAAA GGAGACGCCT ATGCCAAGAT GTTGGCCAAT ATCTTCTCGT TATCAAATGG CTCTATTAAT TCATCCATTC TTGAGGCCGA CTTCTTCGAA TCGTTAGTGA ACATTCTTCC TTTGCCGTCT TCTGGTTTTT CATTTAACGC CATATTAAAC ACATGTTTCA CTGGTCACCT TATCCAGTGT CTCCATATAT TGACTAGAGA AATTGCCTCT CATGATTTCT ATAAAAGTCA GGATTATTCA GTTTTGGATA TTCCTGTAAT AGCCGATGTT GATATTAACA AGTCTAAAGT TGCTTTGTTA GCTTTCCATA AGTTGAAATT GTCTGAGGAT TTCCATGGAG ATGACGCATC CATTGAAAAT GACGAAAAAT TTGGACAGGT GATATACTCA ATGTTAGTCA AGGCAAGTAC ATCATTCTTG AGAAGGGCAG CTATTTTTGC ATACGTTCAA TGTGCCAACG TAGAAAAACT TGATGTTTCC TCCGTAGAAG ATCTAAGAGT TGAAGCTGAT AGACTCTGTT CTTTCTTGAA TATCAAGACA ATCGGCGAGT ATCTAGAATT GTTTATTTTG CCAAATAAGT CATACGAAGG TTGTGTCTTT CAAGGGTTTG TTGATTATGC AAGGAATTTG AACAAACCTG GATTTGAAAA CTCAGAGACA AGAAAGGGAT TAGAATATCC CGGAATGATT AGATTGACAG AACTTCCAGA AAGATTGGAC CACTTTTTCA CAAGATATTA CTATCTGGAT AAGTACAACA ATCCACACAT GACAATTGAA GATCCTGCCA TTTGTCTCTT CTGTGGTGCC GTTGTCGATG CTCAAAAGCC AGCCATTGGT TGCAAGGAGG GCCAGTGTAC GACACACTAC TTAAAAGAAT GTGCACATGA CGTTGGAATC TTCCTATTGC CCAAAGAGCG AAGCATGTTG TTGTTGCACA AGAATGGAGG CACTTTCTAC AATGCTCCTT TCTTGGACCA ACATGGAGAG TTAGCAGGAG AGTCAAAAAA GGCAAAGACA CTCCACTTGA TGAAAGCAAG ATATGACGAG TTTATTAAGA ACGTTTGGTT GCTGCACAAT ATCCAAAACA TTATCGCCAG AAACTTAGAA CGTGTTCTTG ACGCTGGAGG CTGGGATACT CTATAG
|
Protein sequence | MATSSSTEEL KRFLLHLPSD SNYVFSQDIR TQIRRACFLA ISNNGQHLNR LFPNAFSNRN DPNSESEALY SLQHMIDVVE SEYNWLFANY YKISLKNRDP KHDTHNHHAF HANLPCARIF RKGEPIYRCL TCGFDDTCAL CSHCYQPEYH QGHKVHITIC QRENGGVCDC GDPEAWVKEY ICPYAAHDDE SIIVRNEKMP VDLELSFLQT IETLLDYVID VMSQSDQQFE DPVEATEAKV ELNAINSTLD PLKYDCSTDN FTDANNERFF LMVYNDQIRH YRDAVQRIHL ASRKVKQFAV MVTEKVQNYG KAKVISSKNI KLLLERQKIL SATGLSTCIR SHRDVFREDM CDEILIWIND LTESEMFKMN NSAKNLFCRA FCEKWKSGLL VATHGDNSNH QYRVGALDAC SKIPKLPSSN TDRKTQLHWF FQPSKWNLPE DICRECDYNL TEEDYEPHTS HLGSRLQYLI YLDVRFWKSI RIYLHDMYST SLITNLWYKH IISCQYVDIY PTIADMFLTM DREPELNVMC TLSTQLFTCP SNSTSIVQHG DVSRIFASIY GFLTIEEIRS PECVEVTQEI SMKSLKNRIW GQIFFDIGYI LSRSRDSKYI LTSNIIPMAC DILALFQGRP VMKREKKNHV EYESPDYTAF FHAILVIYQF GEYIAHSLSN LGDIDPELRT SLSKNAIKYV ISFLLKLENN DYPGLIDEYV DINLSIDKKI SKEPIGGNII QYYRIDEEKV SFLHPIHSFL SWLIELSDFK SPSEIVEVLN SSTDFYSITS SVDIPNHLTS IFDYPIRTIV LMSQIKSGFW VRNGFSVRSQ LQLYRNTGLR ESGYMRDLFL TQVFINSNSP NLVCFLLFSR WLLMDGWLID SRTVRNEVTD LDLQVSADLS ALCYDLKTLP YMLEECMNFF IHVLTEDLYL RGLKDEVMIQ TRIKKEIVHN LCFGPMSYTK LCSQIPDHIL SEKRFDLILA EMTTFTAPNG ATDIGVYHLK DEFYDQINPY YFNYTTNTKD DAIKFVKERI HKTTRKPITE IVIDPKLRDP GELGIYRYIG NFSASAYFSD FLIRTLSYIS KEGIEEVESL LETALHLIHI CAFENTIDIS QYGTFYDRFV NISDAYGTSI AVLLYEILAN DQFKNHHSKI RCILKVFEDK YQNLGKILSD QIVDYNPLVI EFHTKMDNDE NEFEKKRRMA KERQAKLMAK FKKQQSLFLK KNNMERNYCS DIEMEDYEDE HGWRFPEPHC LLCQNAAEDA GPFGIITYIS KSSEFRTIPF NDKFWVLKAF SDNASSDVNE NAGDPVIEEV KSEKWHRFMG KIKESNVIGP GFSHNDHVES KLVSLSCGHG MHFQCYMNYL NSNKSRLNQI TRNSPENVER KEFLCPLCKA INNMFIPILW TSNKRLLSQF LKPLLLPNPF DHIDPKIVHN QDWYKEFTFI SDKDIEDMSI LTKASTDIIS LSSNNDFTGS QHSFRVLLSN MFQILSLLTF PQVFKADTVF VLLNTIKSIE ISLRGTSARG ELIIYQLSNN ALINLRTLNE FRNTCVLMKI KSWIHTPNPK GDAYAKMLAN IFSLSNGSIN SSILEADFFE SLVNILPLPS SGFSFNAILN TCFTGHLIQC LHILTREIAS HDFYKSQDYS VLDIPVIADV DINKSKVALL AFHKLKLSED FHGDDASIEN DEKFGQVIYS MLVKASTSFL RRAAIFAYVQ CANVEKLDVS SVEDLRVEAD RLCSFLNIKT IGEYLELFIL PNKSYEGCVF QGFVDYARNL NKPGFENSET RKGLEYPGMI RLTELPERLD HFFTRYYYLD KYNNPHMTIE DPAICLFCGA VVDAQKPAIG CKEGQCTTHY LKECAHDVGI FLLPKERSML LLHKNGGTFY NAPFLDQHGE LAGESKKAKT LHLMKARYDE FIKNVWLLHN IQNIIARNLE RVLDAGGWDT L
|
| |