Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_66755 |
Symbol | |
ID | 4837381 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 65926 |
End bp | 70096 |
Gene Length | 4171 bp |
Protein Length | 983 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640388696 |
Product | hypothetical protein |
Protein accession | XP_001382243 |
Protein GI | 150863688 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.234519 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ACGGCTTTGT GGGCACAAGT GTACGTACAA GATTTGATGG TGAAGGAGCT GGTTGTCAAA TTTTCTACTT TTTAAAGTGG CATATCTGGC AAATCCTAAT TATTCTCTGT ATATGAGAGT GGAAGTGAGA ATTGGAATTT TATTACTCAA TTAGTAAAAT CGACTATACT CCAAGACAAT AAAACAAGTA ATTTCTTTGT TTATGTCTAG ATATTTACAG ACATAACAAT AGTACAAATT GCTGTTAGTG TAATCGCAAA CACAAGAGGT TAAAATTTGC AGTTCTTCAA ATGCTATATA GCACTACTAC TGAATTCTCC ACTGCCCTCA TTCCTTAGCA CTGGATTCCA AGACACTTTC CATATCTCCT TTTACAATAT GTCGATTACT AACGCGATGC AGCTCGGCGA CTTGATTAAG AACAGGGCTT TCCACCCCGA CGGTATGTTC ACTTTGTATT TTATCTTAAT TTTACTCTAT GACTATATTC AATACTAACC ATTTCAGCAT TGGTCCAACG GGTAAGAACG AATATTCCAC GGACTCTTCC ATACTATGAC AATGTTCACA TCGAGAGTCC AAGTTTCAGC GACCCCATTG CGAAAGACTT GAAGAAGTTG TATCCACACA GGCCAGTAGT TGTAGCTGAG GGCAGACCCA AGAGTCATGA CTTTGAGTCA AATCCATACC AGGCGCAACT GTATGGTAAT GGAAGTCCCA ATGATACCTC CAACGACCCA GATGATGTTG AGGCAGAAGA AAGTATTGAT TTATACTCGC TGTCGTTGCT AGAGCTTCTC AATCTTTTGC TTGAAGAGAC TCATTTGGAC GACAGCAACA GGATATCAAT AGAAAACTCG CTCTTGGAGT TGACTTCCTA TCTTCCAAAA CCATTGCAGT CTTGGACTTG GAAACACGCC TCGTCTCAGG CTTCATCTTT GTCTAACGAA AAGCAGCATC AGTTAAAATC TGCTAACAAT TTCAACCAAA GTGATCCTAT CTATCCAAAT GGGACGCTTT CCAAAATAGT GATCGACAAT TACACCAGTT TGTTGCGTTT CTCATCTACT TATTTCAATT TATCTTTGGC TTCTCACATT ACCAGATACT TGGTTGAATT GATCTACAGC TTACAATATT GGGAGGTGTA TCATTTGCTC TATATGCTCC CAAACTTAGA GTATTTCTTG AAATTGATTG ACTTCGAAGT AGTAGACACC GTATATGGAC CTCTTGTTAG GCCCCCGGAA AACTATTTGA AGGATTCCAT GTTATCCGGT TTGCAGTATC CTTTTCCATA TCCATTCTAC AACTATTCAT ACCATTCATT TGCCCAAAAT GTGGACTCCA AGAAGTACAA CAGAATCAAT ACTATACCTT ATGAAGATAT AACGCTAAAA GATGTACTGC CACCAAAGAA ACGCAGAGGA AGACCAAAGT CTATATCTGA GCCTATGTTA AAGAAGAAAC CAGGCAGAAA GAGAAAGAAC CCATTGACAG AAGATATACT AGTAGCTCAG TCGCAGACAA CACAGAAGGA TTTCCCTAGT AACACGGAGA GTGAAATGAG TGGCACAGAA TCAGAACATG AAGGCTTGGG AGACTATGAC GATATCCCGG AAGAAGAGGA AGTTATTATT CTTGACGATG AATATGATGA AGAAGGTTTC GATGAAGACA TGGAAGATGA CTATGTGGAA GACGGTGCAA TGAGGGCAGC AGCGTTTTCA CCCGAAGCTA GCTCAAGAGT GGCTTCTACA CAACCTGACT TTGATAGACA GGATAAGGTA CCACAATATA ATGCTGGCGA GAGAGAAGAT CATCGTTTGA GTTTAAGCTC TCCACCAAAA TCATCACCAC CAATGACTCA ATCCTCACAA TATACATCTC AGGATATTTC TATGAGACTT CCTCCACCTC CACAGGGCCT TTCTCCAACC GTTCCTTACC ACCCTCCACC TCCAATAGGA AATGAAACTC AACAATTAAA ACCCCCCAGA CTTCCAAGTG TTCATGAACT ACCGTTTCTT CCACAGTTCC AGCAGCCGGT ACCTCATTAC CAGCCACTTA CTGGACAGAA CAGAACTCCA ACAATTGTTC AGGTACGACC TGTTCTGTTC CCAACGGCAT CCCAACTTTC ACCAGTGGTG CAGGAACCAC CATTACCAGG AATTCAGAAT TATCAACCAA GGGAATCTCC ATCTTCATCT ATACCAACTC CACAAGTAGT ACATTCGCAT ATAATGCCAC AAAGACGATC AATATCACAT ATACTGCCAA CAATCCTGGA TAGGCATTTA GAAGCTTCAC CCCATTCAAC CCATACTAGA TATATAGGTC AAGACGGTTT TGTAAAGATT CAGAGCGATT CTTCATTGCA AAATCCACAA GCATTATATG TAGAATCGCA ATTATATCCT CAAACGCATC GCCCAGGCAT TCCTTCGTAT CTGGTTCCAC CTCCTCAAGG AACGCAGCCA TCCCCACCTT TACAGCAACA AACGATTGAT AGCCCTCAAG ATAGACAACC GCTGAAGCTT CTTCTGCAAA TCCAAGCACT TAAGCTGCCG CAAAGAGGTC ATGAAAGCGA ACACATAGAG CCACAGGAAT TGTCACGATT AGCTGACGAC TCTGCAAGAA AACTCAGTTT CCAACTCAAC TCAGATCGCC CAAGGGAAAA GGGACCCGAT ACGCATGAAA AAGAGAGAGT AGAAAAGGAA AGGGTAGAAA GGGAAAGAAC AGAAAGGGAA AAAACAGAAA AGGAAAAAAT AGAGAGTGAA AGAATAGAAA GTGAAAGGGT AGAAAGTGAA AGAATTGAAA AGGAAAGGAT TGCAAAATCA GTTATCAACG ACAAACCATC AGATGTGAGT AGCTCCGAGG ATGAAAGAGA AACCTCAATG GAAGCTGGCG AATATCATCC AGATGATCAT GAAGGAAGTT ATGTGGACTA TGAAGGGGAA CCAATGCAAG GATCATCAGA AGGTATGGAA GATGGGAATG GTATATTCCA ATACCAAAGT CGATTAAGAT TCATAGCTCA ATCACCAAAG AGTAGTGCAA AATTGGAAGG AGGAGAGATT CCAATGCTAG ATGATAAGAA GAAGAATAAG TCTGGAGTAA TTCACCAGTG CCACTTAATC GACCCTGGCA CTCTTCGTAA ATGTTTAAAG ATTTTCTATG GTAAGAATGA ATTATTGAGA CACCAGGAGT TTGTTCATGC TACAAAGAAG AAAATCTACA AATGTATCTA CTGTTCTAGA AACGGAGCCA AAGTTCAGAG TTATCCAAGG CATGACTCGT TGGCAAGACA TATAAGAAGG AAGCACGGTG TTACCGGAAA GGAAAACAAA ATGGCTGTGA ACTATGCCAA AGAGAACGTA GAAGTCATCG ATGATCCAAA ACTGCTAGTA ACAAAGCAAC ATACCTTTGT AGAGCCATTG CCGCATCCTC AATTTCTCAA TCCAGACTTT ACAATCAAAT CTAGCTATGC TGGATTTTTA CTGTTCAGCA CGAAAGAGGT TCCCAACCCT AAACTACCAG TTATTCATAC TCACCACTAT CCAAAGAATA GGATTGTCGA CGTTGTAGAG TTTCCACCAA TAGATCAGCC AGCAGGAAGT CTGTCAAAAG AGGTACTATC ACCTACAGTT TCAGCAAAGG CACCCAACGT TTTCTTGTCG AAGATATCTC CCAAGTCAGT AAAGTCACAG GAAACATCTC CAGTATTCAA GGTATCTCCA TTGAAGTCAA TTACTTCTCC AAAGGATTTG CCTTCTGTTG CTACTAGTAC TAGTCCTCTT CAATCAGTGG CTCCACCTCC AGTTGCTCAT CTTTCAAGTG GTGCGTCTCC TCTAATCGGT CACAACACCG TAGGAGGTCC ATCTCCAGCC AGAGGAGACC CATCTGCTGG ACCTCCAATT GCACATAAAC CTGGAGTGCT TATTGGCCAA AACTCTCCTC CCAAAAAAGG AACAGTGTTA CCCCTGATTA AAGAGCTTGA CAACAGCCAC AGTTTTGGGG TTCACAGTTA CCCACGTTCC AGCAGCGATA ACATCCACAA AAGCATGAAT TTGTCTTACT TGAATGGCCC ACCGCCTGCG GAAGATAGAA AGTAGACTAT GTAGCTATAT TGTATATCTC ACGACGAATA ACGAAGAAAA TAGAAAAAAT G
|
Protein sequence | MLPNLEYFLK LIDFEVVDTV YGPLVRPPEN YLKDSMLSGL QYPFPYPFYN YSYHSFAQNV DSKKYNRINT IPYEDITLKD VSPPKKRRGR PKSISEPMLK KKPGRKRKNP LTEDILVAQS QTTQKDFPSN TESEMSGTES EHEGLGDYDD IPEEEEVIIL DDEYDEEGFD EDMEDDYVED GAMRAAAFSP EASSRVASTQ PDFDRQDKVP QYNAGEREDH RLSLSSPPKS SPPMTQSSQY TSQDISMRLP PPPQGLSPTV PYHPPPPIGN ETQQLKPPRL PSVHELPFLP QFQQPVPHYQ PLTGQNRTPT IVQVRPVSFP TASQLSPVVQ EPPLPGIQNY QPRESPSSSI PTPQVVHSHI MPQRRSISHI SPTISDRHLE ASPHSTHTRY IGQDGFVKIQ SDSSLQNPQA LYVESQLYPQ THRPGIPSYS VPPPQGTQPS PPLQQQTIDS PQDRQPSKLL SQIQALKSPQ RGHESEHIEP QELSRLADDS ARKLSFQLNS DRPREKGPDT HEKERVEKER VERERTEREK TEKEKIESER IESERVESER IEKERIAKSV INDKPSDVSS SEDERETSME AGEYHPDDHE GSYVDYEGEP MQGSSEGMED GNGIFQYQSR LRFIAQSPKS SAKLEGGEIP MLDDKKKNKS GVIHQCHLID PGTLRKCLKI FYGKNELLRH QEFVHATKKK IYKCIYCSRN GAKVQSYPRH DSLARHIRRK HGVTGKENKM AVNYAKENVE VIDDPKSLVT KQHTFVEPLP HPQFLNPDFT IKSSYAGFLS FSTKEVPNPK LPVIHTHHYP KNRIVDVVEF PPIDQPAGSS SKEVLSPTVS AKAPNVFLSK ISPKSVKSQE TSPVFKVSPL KSITSPKDLP SVATSTSPLQ SVAPPPVAHL SSGASPLIGH NTVGGPSPAR GDPSAGPPIA HKPGVLIGQN SPPKKGTVLP SIKELDNSHS FGVHSYPRSS SDNIHKSMNL SYLNGPPPAE DRK
|
| |