Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31037 |
Symbol | |
ID | 4838217 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 1581063 |
End bp | 1583810 |
Gene Length | 2748 bp |
Protein Length | 891 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640389532 |
Product | predicted protein |
Protein accession | XP_001383926 |
Protein GI | 126134803 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00727] small oligopeptide transporter, OPT family [TIGR00728] oligopeptide transporters, OPT superfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.629932 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCAGG TCGAAAAGAA GGAAGAACTT AATAATATCG TTAGCATTCG AAGTGTTACT TCTCACTTGC AGTTAGAGGA TCATGAAGTC GACTTGAGAG CAATTACTTC TAACCCAGTC TCTATTGGTG AGATTGGTGT TTCGCTTACA GATGAGCAGA AGCATTTCAT CTTGAAGAGA CTTCATCTCA ACGGTCTAGA GTCCTTTGAA CAATTACCTC CTCAGGCTGC CTTTTATATC GACAAGATTG AGAAAATGGG AGAAGATGAA GCATTGACAA TCGTGAAAGA AGCTCTTGTT GAGCATCATG ACGACGCCAA TATCCCAGTC GAAGACATAG AATTGTGGAC TAACCTTGTT GAAATTGGAA ATTCTAAGAC TTCTGGTACA AAGGAAAAGT TGGCTTCTAC TTTTGATGAG AAGAATGACA GAGATGGTGA GTCCTCTTCT CAGGAAGTGT CTGAAGGTGG TAAATACGAA GATTTCACCC ACAATGTGGT CGATTGGGAC TTGCAAGTTA GATTGGAGGC CGTTTTGGTT GCTTACCACT CACCTTACCC TCAAGTAAGA GCGGTTACCG ATCCTTATGA CGATCATACT ATTCCAGTCG AAACTTTCAG GGTCTATCTT CTTGGAATTA TTTGGACTGC CATTGGTGCA GTAATTAATC AGTTCTTTGC TGAAAGGCAA CCTGGTATTT ATTTGGACCC AACTGTTGTT CAAGTGTTGC TTTATCCTAG TGGTATGTTG TTGGAATATA TCTTGCCAAA GTTCAAGTTC AAGATTTGGA AATACAGTAT TGACCTCAAC CCAGGTCCTT GGAATTACAA GGAGCAGATG TTAGCAACAC TCTTCTATTC TGTTGCAGGA GGTGGTGCCA GTTACGTTTC ATACAACATT CATGTTCAAA AGATGAAGGT GTTCTACGAT AACAAATGGG TTAATTTTGG TTATGAAACC TTATTAATTT TGTCTAATAA CTTCTTGGGA TTTGGGTTCG CTGGTGTTTT TAGAAGATTT GCTGTATATC CAACTGAAGC TATCTGGCCA ACAGTGTTAC CTACTCTCGC CTTGAACAGA GCTTTGATGG TGCCAGAAAA GAAAGAGATT ATTAACGGTT GGAAGATTCC AAGATACACA TACTTTTTCA TTCTCTTTGC GGCATCCTTT GTCTATTTTT GGGTTCCTGA TTACCTTTTC TATGCCTTGT CTGTCTTCAA CTGGATGACA TGGATTAAGC CTTACAATTT CAATTTAGCT GCCATCACTG GAAGTAACTT TGGTTTGGGA TTAAACCCAA TTCCTACCTT CGACTGGAAT ATGATTAGTT TTAACGCACC ATTGATTTTT CCATTCTACA CCCAACTTAA CACTTATATC GGTGCTTTAA TAGGGTTTTT TGCAATCGTT GGTGTATACT GGACCAATTA CAAATGGACA GGCTTCCTCC CAATCAACTC TTCGTCCATA TTCACAAATA CTGGTGACTA CTATGCTGTT ACGGAGATCC TCAATGAGAA AAGTTTGCTT GATGAAAAAA AGTATCAGGA GTACGGACCA CCATTCTACT CCGCAGGAAA CTTAGTTCTT TATGGTGCCT TCTTTGCTAT CTACCCTTTT TCTATTGTTT ATGAAATCGG TACCAGATAT AAACAAACTT GGAGAGCACT CAAAAGTCTT TACCAAAGTT TCAGAAACTT TAAGAGATCT ACATATGAAG GTTACACTGA CCCACACTCC ACTATGATGA GAGCTTATAA GGAAGTCCCA GATTGGGTAT TCTTGGTAGT TTTAGTGATT TCTCTTGTGT TGGCTATTAT TTGTGTCGAA ATCTACCCTG CTGAAACTCC TGTTTGGGGT ATTTTCTTTG CCTTGGGTAT CAACTTTGTT TTTTTAATTC CAATAACTGC AGTTTACTCC AGAACTGGTT TTAGTTTTGG ACTCAATGTC TTGGTTGAAT TGATTGTTGG TTATGCTCTT CCCGGTAACG GTCTTGCTTT GAACTTTATC AAGGCGTTTG GTTACAACAT TGACGGTCAG GCACAAAACT ATATCACTGA CCAAAAGATG GCTCACTACT CCAAGGTTCC TCCAAGAGCT TTGTTCAGAG TTCAAATTAT TGGTGTCTTT ATTGCCTCGT TCGTTCAACT TGGTATAATT AATTTCGTTA TTGACAATAT TAAAGACTAT TGTGAGCCAT ACAACACACA GAGATTCACT TGTCCAAACT CTAGAGTCTT TTACAGTGCT TCTATTCTTT GGGGTGTTAT CGGACCCAAG AAGGTCTTCA ACGGATTATA CCCAATTTTG CAATACTGTT TCTTAATTGG ATTTTTGTTG GCTATTCCTG CAATCGCGTT CAAAAAGTAC GCTCCAATTA AGTACACCAA GTACTTTGAA CCTACAGTAG TAATCGGTGG TATGTTGAAT TATGCTCCTT ACAATCTTTC CTACTTGACA GGTTCATTTT ATGCTTCCTT TGCCTTTATG TATTACATTA AGAACAAGTA CCAGGCATGG TGGTATAAAT ACAACTATCT CACAACTGCA GGATTGACTG CTGGTGTGGC TTTTTCTTCT ATCATCATTT TCTTTGCCGT TGAGTACCAC GACAAGAGTA TTTCATGGTG GGGTAACAAC GTTATATATG GTGGTATCGA AGGTGGTTTG GGTCAACAGT CAAGATTGAA TGCTACCGCA GAAGCTCCAG ATGGTTATTT CGGTCCAAGA AGAGGAAACT TCCCATAA
|
Protein sequence | MSQVEKKEEL NNIVSIRSVT SHLQLEDHEV DLRAITSNPV SIGEIGVSLT DEQKHFILKR LHLNGLESFE QLPPQAAFYI DKIEKMGEDE ALTIVKEALV EHHDDANIPV EDIELWTNLV EIGNSKTSVS EGGKYEDFTH NVVDWDLQVR LEAVLVAYHS PYPQVRAVTD PYDDHTIPVE TFRVYLLGII WTAIGAVINQ FFAERQPGIY LDPTVVQVLL YPSGMLLEYI LPKFKFKIWK YSIDLNPGPW NYKEQMLATL FYSVAGGGAS YVSYNIHVQK MKVFYDNKWV NFGYETLLIL SNNFLGFGFA GVFRRFAVYP TEAIWPTVLP TLALNRALMV PEKKEIINGW KIPRYTYFFI LFAASFVYFW VPDYLFYALS VFNWMTWIKP YNFNLAAITG SNFGLGLNPI PTFDWNMISF NAPLIFPFYT QLNTYIGALI GFFAIVGVYW TNYKWTGFLP INSSSIFTNT GDYYAVTEIL NEKSLLDEKK YQEYGPPFYS AGNLVLYGAF FAIYPFSIVY EIGTRYKQTW RALKSLYQSF RNFKRSTYEG YTDPHSTMMR AYKEVPDWVF LVVLVISLVL AIICVEIYPA ETPVWGIFFA LGINFVFLIP ITAVYSRTGF SFGLNVLVEL IVGYALPGNG LALNFIKAFG YNIDGQAQNY ITDQKMAHYS KVPPRALFRV QIIGVFIASF VQLGIINFVI DNIKDYCEPY NTQRFTCPNS RVFYSASILW GVIGPKKVFN GLYPILQYCF LIGFLLAIPA IAFKKYAPIK YTKYFEPTVV IGGMLNYAPY NLSYLTGSFY ASFAFMYYIK NKYQAWWYKY NYLTTAGLTA GVAFSSIIIF FAVEYHDKSI SWWGNNVIYG GIEGGLGQQS RLNATAEAPD GYFGPRRGNF P
|
| |