Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_53206 |
Symbol | |
ID | 4852003 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 3396942 |
End bp | 3400919 |
Gene Length | 3978 bp |
Protein Length | 1105 aa |
Translation table | |
GC content | 43% |
IMG OID | 640393711 |
Product | predicted protein |
Protein accession | XP_001387222 |
Protein GI | 126276278 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTCCA ATGCCAATTC GCAGCATCTG ACCTTGGCAA AGTTTGCTTT CAACATCTAC GGATCGTTGA ATCACGCCAC TCCCAGCTAC GAAGGACTGG CATCCCCCTC TCTGTCGTAC AGTTCTTCAC GATCGAAGAC CAATGAGCCT TACAACTACA GATTCAACAA ATTGGTGTAC AATTGTGAGC GTGAAGTTAC CACACTATCG CAACTTAACT ATCCACTTTC GTTGTCGAAT ACTTTCCAAT CGGACCAGAA CTTGTCACAT CATGTAATCA TCGGTGGTAG AAACTACTTG AAGTTGCTAG CCTTGAACGA AGACCAGTCG CGAATTGTCC AAGACATCAA TGTTCTTGAT CAGAGCTCGA TATACACCCA CAATTCACGT GTGCCGTCCA CAAACAAGTT GAACAACATC AACACCGTCA AGGCGCAGTC AGACACCATA GGCTGTGGGT TATCCAATGG ACTTATCACT GTGTATAAGG TAGGCTCCAA CGGTAAGTGC AGACTCATCC ACAAGTTTTC GGACCACAAA CGTTGCATAA ATTCTCTAGA TTATGTGGGT ATTAGGAATT TGTATGATGC ACCGACCCAG ATGATATCTG GATCGCAGGA TGGTTCTATA AAGCTTTGGG ACATGCGACT GTCTTCTCCC AGACCTATGC TTACGATATC TTCAGGCAGC CATCTGGATC CCATTCGTTC ATGCCAATAC TCTCCCCATT CACAGGGCCG TAACAAACTT GTAGTTCTCT CTGTCCACGA CTCGGGAGCG TTGTGTAAAT TCGATTTACG ACTGTTTGGC TCCAACACCG CTGCCAACAC CAGTAGTAAT GGCCATGGCC CAGAGAGAAA ATGGAATATC CATACTGGTC CAGCATTGTC ATTACACATC CATCCCGAAA AGGAGTACGT TGTTACAGGA GGTAGAGACC AGAAGATTTG TGTGTTCAAC TACGGCGATT CTCAGATATC CAATAGAATT ACACCAGACG AGATGATTAA CACTTATGGT CCTGTAATGA AGGTACGATG GTGTCTATAT CCCGATGCAT CAACATCACA GTTTGGTGAA CCTCTCGATA CGTTCCAGCA ATCTAACGAC TTCAACAGGT TTGAAGATAA GCTTTCGTAC GATGAACGTG AGGCCATGTA CAGCTATCCT TCCTCCCTGC GTAGCAGTTC TCTATATAGC TACGACTTGG CTTGTCTGTA CTTGAACGAT GACTCTACTG TAGCCATTTA CAATCTCAAC AGGAAGTTCA TTCCTAAGGA GGTGATCACC ACGTCATCTA ATAAGCCTAT TCAGAACTTC ATCTGGGCTA ACAACCCAGG ATCGTCGCGT AAGATCTGGA CCATAACAAA GTCTAACGTG TTTTCCAGTT ATGATCTAGA TATGCACGAC AGCCTTCTCG AGTCTGAAAT TTCCAAACCT TTGGACGAAC TCGCTAATGT CACTGTAGAC TGGAACAATG GATTTGGCGA TCTCTGCTTA GCCAACCAGG AAAAATATGA GTTTGAAATC ACAGAAGTAG AATCACAGGC AAGCGACAAC GACATGGGTG ACATAGATAC AGAGTATTCG TCCAGATATG AAAGAAGCAA CTCTAACAGT GTCATTGATG AAAGTGATAC CAGAAGCATT GATCACCATT CACCTGAAAA CGATGCTGCA ACTATAGCAG CTGGTGGCTC AGGAAAGGCG TTTGTTGGCT CTGTACCCAT TGCATCAAAA ATCCACGGCT CTTTATCAGG CTCATTGATA GGGTCGTCTT CGGCAGAAAA GCCTCCTCTT TTCAGATCAA GCACTCATTA CTCGATGCAC ATGGCCAAAT CGCCATCTCC AGTACCTCGT AGAGGTTCCA CATCATTTGC TGCTCATTCA GAATCCCAGC CAAATCTTTC TAATATGCAA GGTTTATCGA TGTCAAGGCC AAAACTTACA CGTAATCTTT CGCAGGCTAC CGAAGACTCG AGTATATCTA TCGGTTCGGC CCCACAGTCA AATATCCACT TAAAGCTGAA ACGTTCATTT CAAGTTAGCT ATGCTTCACC ATATTTGGTG CCCGTGTCGT TACCGTTGTC TCTCAACGAC GAAAACGTGT TCGAGATTCT CTCAAATAAC TACTTGATAT CCATTCCAGA TGGGTTTACT TTAGTAGACG TATGTTTATT GAATGCCAGT GTTGCAGCCA GTGTACAACG GTTTCGTGAA TGTCAAATCT GGCGAGTATT GGCTGTAAGC TTAGAGGAGG ATTATGTTCA AATTGACAAC ACTACATTCT TGAGTGATCC AGAATTGGAG CATAATGAAA CTAACCAAGA TGAAAAGATC GACGATCAGA AAGATGCTAA ATCCATACTG TCAGATTTGG GCAACTTTGT AGGGTCATAT AATTCGAATT CAACCCTGAC CACTAACTAC GGAGGGTTGG GTAGTCTCAG TGCTAAAGAT ACCAGTCAGG AATCTATTGG CAGAGAGATA CGTTCAATAG TATCGTCTTC TGTAGAATCG GATGTCAAGA TCCCTCCTCC CAACCCTCCA TTAGCAAAGG CAAACAATTC CAACAATCTC ATGGATATGA TCAACCGAAG CAGAGTGAAC AGTATGAACC ATCTTCAAAG CATAAGCCCA TCTGGGTCCC ATACATTTAT CCGCAATGCT ATCCATGAAA ACAAAAGCAA CGAAAACGCA ATTGTAGACG ATGACGAAAC TGACGCAGCA GTACATGGAA CTGAAGGTTC ATCACATATT GCTCATCATA GAGAAAGACG TTCATCTTCG AAGAAAACTA AACCTACTCT TAGACATCAT AGAAGTTCTC AAATACTGTA TGAAGTCGAT AAAGAAACTC AAAGTTCTCC AATAGCCATA GCATCACCCT CAAAAATTGG CATTGGCTCC GGCGATAACT CGCCCAATTC TGCATTTTTA CATCGCCATG CTGATTCGTT CTCATCTTCG TTTGCAGGTT CAAAGTTGGC AGGACGAATC GGAACTTCTC ACGTTTCTGA AGATTTGGAC AACGAGAATC TTAACATTCT CAACAATGCT GTTCTCAACT CGAGTCCAAA CTCAGCTATG ACGACTCCTC ATCCGCAATC AAATAGTCCC AACTATTCAA ACTTTTTTTC ACTGTCGCAC CAGTCAGCTC AACACCATTC CATGGGTACT GGTTCTGGTT CAACTTCAGT ACCTTCTCGT CGTAATTCTG CTATTCCAGC ATATGGATTC CATAGACCTA AGTTGTCGTC TACATTCATG TCTCCTATTT ATGATGAATT TGCTGAAAAA CAGGAACAAC CGCGTAGCTT GAAGGCCGAG TCCTTGTTGA ACAACGATGT CTCTGATAGA ACTACAACGA AGTCTGAGCT TACTAAGGCA ATTAAAGAAG AAGTAGATAA TTCCAGTGGG GCCCCGCTAA AGAAAGCTTG GAAGTCGCTG AGCTTATTGG AAAAAGCATT AGCGCATGCT TCGAACGAGG GGGATATTAT TCTTTGCTCT ACACTTTCAC TCTTGTTTTA CGACTCGTTC AAGCAGGTAA TCCCCCAATC GTCCTGTTTG GATTGGTTAG GGCTCTACAT TGAAATCCTA CAAAGAAAAA GGTTGTTTGT CAATGCTATT CACGTTGTCA ATAATGCTCC TGATGACGTT AGAAGCAAGT TGAAAAACTT GACCTCTGGC GATGTGGATC TCCGTTTCTT CTGTTGCTGG TGCCAGAAAC TCTTGGTTAA TGAAAAGTCC AAGGAGAAAT TGAAAAATGA TGTCAATGCA GACTTCGGCT ATTGGTACTG TGACGAGTGT AGTCAGAAGC AACTGAATTG TATCTATTGC AACGAGCCTT GTAAGGGATT GACGGTAGTT GTTAGTCTTA AGTGTGGCCA CAGAGGACAT TTTGGATGTT TGAGAGAATG GTTCATTGAG GACGAGAATA ATGAATGTCC GGGTGGCTGT GATTACAGTG TAGTATAG
|
Protein sequence | MSSNANSQHL TLAKFAFNIY GSLNHATPSY EGLASPSLSY SSSRSKTNEP YNYRFNKLVY NCEREVTTLS QLNYPLSLSN TFQSDQNLSH HVIIGGRNYL KLLALNEDQS RIVQDINVLD QSSIYTHNSR VPSTNKLNNI NTVKAQSDTI GCGLSNGLIT VYKVGSNGKC RLIHKFSDHK RCINSLDYVG IRNLYDAPTQ MISGSQDGSI KLWDMRLSSP RPMLTISSGS HLDPIRSCQY SPHSQGRNKL VVLSVHDSGA LCKFDLRLSN GHGPERKWNI HTGPALSLHI HPEKEYVVTG GRDQKICVFN YGDSQISNRI TPDEMINTYG PVMKVRWCLY PDASTSQFGE PLDTFQQSND FNSSSLYSYD LACLYLNDDS TVAIYNLNRK FIPKEVITTS SNKPIQNFIW ANNPGSSRKI WTITKSNVFS SYDLDMHDSL LESEISKPLD ELANVTVDWN NGFGDLCLAN QEKYEFEITE VESQASDNDM GDIDTEYSSR YERSNSNSVI DESSLIGSSS AEKPPLFRSS THYSMHMAKS PSPVPRRGST SFAAHSESQP NLSNMQGLSM SRPKLTRNLS QATEDSSISI GSAPQSNIHL KLKRSFQVSY ASPYLVPVSL PLSLNDENVF EILSNNYLIS IPDGFTLVDV CLLNASVAAS VQRFRECQIW RVLAVSLEED YVQIDNTTFL SDPELEHNET NQDEKIDDQK DAKSILSDLG NFVGSYNSNS TLTTNYGGLG SLSAKDTTKA NNSNNLMDMI NRSRVNSMNH LQSISPSGSH TFIRNAIHEN KSNENAIVDD DETDAAVHGT EGRIGTSHVS EDLDNENLNI LNNAVLNSSP NSAMTTPHPQ SNSPNYSNFF SLPKLSSTFM SPIYDEFAEK QEQPRSLKAE SLLNNDVSDR TTTKSELTKA IKEEVDNSSG APLKKAWKSL SLLEKALAHA SNEGDIILCS TLSLLFYDSF KQVIPQSSCL DWLGLYIEIL QRKRLFVNAI HVVNNAPDDV RSKLKNLTSG DVDLRFFCCW CQKLLVNEKS KEKLKNDVNA DFGYWYCDEC SQKQLNCIYC NEPCKGLTVV VSLKCGHRGH FGCLREWFIE DENNECPGGC DYSVV
|
| |