Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_50085 |
Symbol | |
ID | 4840436 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | + |
Start bp | 563841 |
End bp | 566576 |
Gene Length | 2736 bp |
Protein Length | 911 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640391751 |
Product | predicted protein |
Protein accession | XP_001386124 |
Protein GI | 126139203 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00727] small oligopeptide transporter, OPT family [TIGR00728] oligopeptide transporters, OPT superfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.100109 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTACAG AAGAAAAGAA TACTACAGAT AAAAAGGAAA AACTTGACAA TATCACCAGC ATTCGAAGTG TCGGAAGTTA TTTGCAACTT GAGGATCATG AAATTGATTT GAAAGCCGTG ACTTCGAATC CAATTTCACT TGGCGAAGTT GGTGCCTCGC TTACAGATGA TCAGAAATTG CTTATTTTGA GGAGAGTCCA TCTTGATAAG TTAACTTCAT TTGAAGAATT GCCTCCCCAA GCTGCTTTCT ATATACAAAA GGTTGAACAC TTGACCACCA CTGAAGCCCT CACAATCTTG AACCAAGCAG TGATTGATCT TGATGCAGAT GCTAATTTCC CAACCAAGGA CTACAATTTG TTGACAAGCT TGGTGCAGAG TTCCGAATCG AAACAATTTA ATGCCAAAGA AAAGTTGGCG TCTGCCTTGG ATGGAAGCTC ATCAGAACAA TCATTAGAAA AGGATTATGA TACCCATTCA ATAGTTGATT GGGACTTGCA AGTTAGATTG GAAGCTGTTT TGATTGCTTA CCACTCCCCA TACCCTCAAG TGAGAGCAGT CACTGATCCG TATGATGACC CAACCATTCC TGTTGAAACC ATTAGAGTTT ATATTCTCGG CATAATTTGG ACTGCCATTG GAGCTGTTAT CAATCAATTC TTTTCAGACA GATTGCCAAG TATATCTTTA GATCCTGCAG TTGTTCAAGT GTTTCTCTAT CCTTGCGGTA TCCTATTGGA ATATATCTTA CCAAAGAAGA AGATCAAGAT TTGGAGGTAT ACAATTGATC TCAACCCTGG ACCATGGAAC TACAAAGAGC AGATGTTAGC GACTGTATTC TATTCCGTAT CCTGTCCTAT TGGTACGAGT TACGTTTCTT CCAACATTAC TGTTCAAAAG ATGGAAATGT TCTACAACAA TAAATGGGTT GATTTTGGAT ATCAAGTTTT GTTGATATTA TCGAATAACT TTTTGGGGTT TGGATTCGCG GGTATCTTTA GAAGATTTGC TGTGTACCCC CCCGAAGCCA TCTGGCCCAG TGTGTTGCCA ACTCTTGCTC TAAATAGAGC ATTGATGGTC CCTGAAAAAA AAGAAATCAT TAACGGATGG AGAATATCTA AATATAATTT CTTTTTCATT ACATTTGCTG CCAGTTTTGT TTACTTCTGG ATTCCTACCT ATCTTTTCGC TGCTTTGTCA ACTTTCAATT GGATGACTTG GATCAAACCT TATAATTTCA ACTTGGCCGC CATAACTGGA ACTAATTTTG GATTGGGATT GAATCCAATC CCTACTTTTG ACTGGAATGT GATTAATACT AATTCACCTT TGGTTCTTCC ATTTTTCACC CAAATTAACA ACTATATCGG AGTCTTGATT GGCTTTATTG CGATTGTAGG TGTCTACTGG TCCAACTACA AATGGACAGG CTTCCTACCA ATCAATTCAA ATGCTGTTTT CACTAATACT GGTGAACCGT ATGCTGTCAC AGAAGTAGTG GACGGAAATA GTTTGCTTGA TAATGAGAAG TACCAACAAT ACAGTCCACC ATTTTATACT GCTGGAAACT TAGTGGTTTA TGGTGCCTTC TTTGCTATCT ACCCGTTTTC AATTGTCTAT GAAATTGGGA GTAGATACAA ACAGACATGG AAAGCACTCA AGAGTGTTTA CAGTAGTGTT AGAGATTTCA AGAGAGGCGC ATACGAAGGA TTTGATGATC CTCACTCTAA GATGATGACT GCTTATAAGG AGGTTCCCGA TTGGCCCTTC TTTGTGGTCT TGGTAATATC TCTCGTTTTA GCAATCATCT GTGTAAAGAT TTATCCTGCT GAAACTCCTG TCTGGGGGTT GTTCTTTGCT TTGGGAATCA ACTTTGTCTT TTTGATTCCA ATCACTGCCA TCTACTCCAG AACTGGTATC GGGTTCGGTC TTAATGTCTT AGTTGAATTG ATCGTTGGGT ATGCTATTCC TGGGAATGGT CTTGCGTTGA ACTTCATCAA GGCATTTGGT TACAATATTG ATGGTCAAGC TCAAACCTAT ATTACTGATC AGAAGATGGC CCACTATGCA AAAATTCCTC CTAGAGCACT TTTTAGAGTT CAGATTCTTG GAGTTTTCAT CGCGTCCTTT GTTCAACTTG GTATCTTGAA TTTTGTTCTT ACTAATATTG ATAATTATTG TGATCCTCAC AATAAGCAAA AGTTTACTTG TGCAGGGTCT AGGACTTTTT ACAGCGCTTC TATTCTTTGG GGTGTTATTG GTCCGAAGAA GGTGTTCAAT GGTCTTTACC CTATCTTACA ATACTGCTTC TTAATTGGAT TCTTAATTGC AATCCCAGCT GTCGTATTTA AGTTTTATGC TCCAAGAAAG TACACTAAGT CTTTCGAACC TTCAGTTGTA ATTTTAGGAG TTATGAGCTT TGCTCCTACC AACCTCACAT ATTATACCGG AGGCTTATAT GCATCTATTG CTTTTATGTA CTATGTGAAG ACTAGATATG AGGCATGGTG GCAGAAGTAC AACTACCTCT TGTCTGCTGC TTTGACGGCT GGTGTTGCTT TTTCCGCTAT CATCATTTTC TTTGCTGTGC AATACCATGA TAAGAGTATT AACTGGTGGG GTAACATTGT TCCTTACGAG GGGATCGATG GTGGCTACGG ACAGCAATCC AGGTTAAACG TTACAGAGCT AGCACCAGAT GGATATTTTG GTCCAAGAAT TGGAAATTTC CCTTGA
|
Protein sequence | MSTEEKNTTD KKEKLDNITS IRSVGSYLQL EDHEIDLKAV TSNPISLGEV GASLTDDQKL LILRRVHLDK LTSFEELPPQ AAFYIQKVEH LTTTEALTIL NQAVIDLDAD ANFPTKDYNL LTSLVQSSES KQFNAKEKLA SALDGSSSEQ SLEKDYDTHS IVDWDLQVRL EAVLIAYHSP YPQVRAVTDP YDDPTIPVET IRVYILGIIW TAIGAVINQF FSDRLPSISL DPAVVQVFLY PCGILLEYIL PKKKIKIWRY TIDLNPGPWN YKEQMLATVF YSVSCPIGTS YVSSNITVQK MEMFYNNKWV DFGYQVLLIL SNNFLGFGFA GIFRRFAVYP PEAIWPSVLP TLALNRALMV PEKKEIINGW RISKYNFFFI TFAASFVYFW IPTYLFAALS TFNWMTWIKP YNFNLAAITG TNFGLGLNPI PTFDWNVINT NSPLVLPFFT QINNYIGVLI GFIAIVGVYW SNYKWTGFLP INSNAVFTNT GEPYAVTEVV DGNSLLDNEK YQQYSPPFYT AGNLVVYGAF FAIYPFSIVY EIGSRYKQTW KALKSVYSSV RDFKRGAYEG FDDPHSKMMT AYKEVPDWPF FVVLVISLVL AIICVKIYPA ETPVWGLFFA LGINFVFLIP ITAIYSRTGI GFGLNVLVEL IVGYAIPGNG LALNFIKAFG YNIDGQAQTY ITDQKMAHYA KIPPRALFRV QILGVFIASF VQLGILNFVL TNIDNYCDPH NKQKFTCAGS RTFYSASILW GVIGPKKVFN GLYPILQYCF LIGFLIAIPA VVFKFYAPRK YTKSFEPSVV ILGVMSFAPT NLTYYTGGLY ASIAFMYYVK TRYEAWWQKY NYLLSAALTA GVAFSAIIIF FAVQYHDKSI NWWGNIVPYE GIDGGYGQQS RLNVTELAPD GYFGPRIGNF P
|
| |