Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_63741 |
Symbol | |
ID | 4840490 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | - |
Start bp | 542909 |
End bp | 545650 |
Gene Length | 2742 bp |
Protein Length | 913 aa |
Translation table | 12 |
GC content | 38% |
IMG OID | 640391805 |
Product | predicted protein |
Protein accession | XP_001386300 |
Protein GI | 150866635 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00727] small oligopeptide transporter, OPT family [TIGR00728] oligopeptide transporters, OPT superfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.294425 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAAGG TAGAAAAGAA GGAAAAGCTT GACAATATAG TCAGTATTGG CAGTGTTACA TCTCACTTGC AATTAGAAGA TCATGAAGTG AATTTAAGAG CAATTACTTC TAATCCAGTC TCAATTAATG AGATTGGTGT TTCACTTTCA AATGAGCAGA AGCATTTTAT CTTGAAGCGA CTTCATTTGG ATGGCTTGCA ATCACTTGAC GAGTTGCCAC CACAAGTCAT CTTTTATTTG GATAAAATTG AGAAATTGGG AAATGATGAA GCCTTGAGTA TTTTGAAAGA GACCCTTGTT GAACATCATG ACGACACCAA TATTCCAACC CACGATTTAG AATTATGGAC TAAACTAGTT GACATTGGAG CTGAGAATTC CGGTACGAGA GTAAAGTTAG ATTCTGCTCT CAACGAGAAG AATAATGAGA GTGGTGAGCT GTCTTCTCAG GAAGTTTTTG AAGGTGGTAA ATACGAATCT AGTATTCACC AAGTAGTTGA CTGGGACTTG CAACTTAGGT TGGAGGCAGT TTTGATTGCT TACCACTCTC CTTACCCTGA AGTGAGATCT GTTACAGATC CATTTGACGA CCCCACGATC CCAGTTGAAA CCATCAGAGT TTACATTCTA GGTATCGTTT GGACTGCCAT TGGTGCAGTT ATTGATCAAT TCTTTTCTGA AAGACAGCCA GCAATAGCAT TGTATCCAGC TGTAGTTCAA GTTTTCTTGT ATCCTTGTGG TTTATTGTTG GAATATGTCT TGCCGAAATA CAAGTTTAAA ATATGGAAGT ACACTATCGA CCTCAACCCA GGACCATGGA ACTATAAAGA ACAGATGTTA GCATCGTTGT TCTATTCTGT TACTGGGGGA GCGACAAGTT ATGTTTCTTA TAATATTCAT GTTCAAAAGA TGAAAATGTT CTACGATAAT CAATGGGTTG ATTTCGGATA CCAAACATTG TTGATTTTAT CAAACAACTT CTTGGGTTTT GGATTCGCTG GTATCTTCAG AAGATTTGCA GTTTATCCAG TTGAAGCAAT CTGGCCAACA GTATTACCCA CTCTTGCCTT AAATAGAGCT TTGATGGTGC CAGAAAAGAA AGAAATTATA AACGGTTGGA AGATTGCAAA GTACACATTC TTTTTCATTG CATTCGCAGC TTCGTTCGTT TACTTTTGGA TTCCTAATTA CCTATTTGGT GCATTGTCTA CTTTCAACTG GATGACGTGG ATCGCTCCTT TCAATTTTAA CTTAGTCGCC ATCACTGGAA CTTTTTTTGG TTTAGGACTA AACCCAATAC CCAGCTTTGA CTGGAATGTT CTCAGCTTGA ATGCTCCATT GATATATCCC TTTTATTCAC AATTGAATAA CTACATTGGT AACATTTTAG GGTTTTTGGC TGTCGCTGGG GTCTACTGGA CAAACAGCAA GTGGTCAGGA TACCTTCCAA TCAATTCACC ATCTCTCTAC ACAAATACAG GAGAGATATA TCGAGTTACT TCTGTTGTCA ATGAAAATAG TTTATTCGAC ATTGAGAAGT ATCAAGAATA TGGACCACCT TTCTACACTG CTGGAAACTT GGTTGCTTAT GGCTCCTACT TTGTTCTTTA TCCCTTTTCT GTTGTTTATG AAATCGGTAC TAGATACAAA CAAACTTGGA GAGCGTTCAA GAGTCTCTAT TTGAGTTTTA GAAACTTCAA GAAATCTACG TACGAAGGCT ATAATGACCC ACACTCCACC ATGATGTCAG CTTATAAGGA AGTTCCAGAT TGGGTATTTT TGGTTGTTTT GGTGATTTCT CTTGTGTTGG CTATTATTTG CGTTGAAATA TACCCTGCTG AAACTCCAGT TTGGGGTTTA TTCTTTTCTT TGGGTATTAA TTTCGTTTTC TTGATTCCAA TTACTGCTGT TTACTCCAGA ACTGGTTGGG GCTTCGGACT TAACGTCTTG GTTGAATTGA TTGTAGGTTA TGCTCTTCCG GGTAACGGTC TTGCATTGAA CTTCATTAAA GCATTTGGAA CCAACATTGA TTCTCAAGCT CAGAACTATA TCACCAACCA AAAGATGGCT CATTATTCCA AGGTTCCTCC AAGAGCTCTA TTCAGAGTTC AAATCATTGG TGTTTTCATT GCCTCATTTG TTCAACTAGG TATTATCAAT TTCTTAATTA ATAATATTGA AGATTACTGT GATCCACATA ACAAGCAAAA ATTTACTTGT CCACATTCAA AAACATTCTA TAACGCCTCT ATTATTTGGG GAATTATTGG ACCTAAGAAA GTTTTCAATG GGTTGTACCC AATCTTACAA TATTGTTTCT TGATTGGATT TTTGTTAGCC ATTCTTGCAC TTGCCTTCAA GAAGTTTGCT CCTTTGAAGT ACACCAAATT CTTCGAACCA ACATTAGTAA TCGGTGGTTT CATTAACTAC GGTGCTTACA ACCTTTCATA TTATACTGGT TCTTTTTACC TTTCCGTTGT CTTTATGTAC TACATTAGGA ACAAGTACGA AGCATGGTGG CAGAAATACA ACTATCTCTT GTCAGCTGCC TTGACTGCCG GTGTTGCTTT TTCTTCTATC ATCATTTTCT TTGCAGTTCA ATACCATGAC AAGAGCATTT TGTGGTGGGG AAACAGTGTT ATGTTCGGTG GTATAGAAGG CGGGTACGGA CAACAGTCGA TATTGAATGT CACCGAGGCT CCAGGGGGAT ACTTTGGTCC AAGAATTGGA AATTTCCCAT AA
|
Protein sequence | MSKVEKKEKL DNIVSIGSVT SHLQLEDHEV NLRAITSNPV SINEIGVSLS NEQKHFILKR LHLDGLQSLD ELPPQVIFYL DKIEKLGNDE ALSILKETLV EHHDDTNIPT HDLELWTKLV DIGAENSGTR VKLDSALNEK NNESGESSSQ EVFEGGKYES SIHQVVDWDL QLRLEAVLIA YHSPYPEVRS VTDPFDDPTI PVETIRVYIL GIVWTAIGAV IDQFFSERQP AIALYPAVVQ VFLYPCGLLL EYVLPKYKFK IWKYTIDLNP GPWNYKEQML ASLFYSVTGG ATSYVSYNIH VQKMKMFYDN QWVDFGYQTL LILSNNFLGF GFAGIFRRFA VYPVEAIWPT VLPTLALNRA LMVPEKKEII NGWKIAKYTF FFIAFAASFV YFWIPNYLFG ALSTFNWMTW IAPFNFNLVA ITGTFFGLGL NPIPSFDWNV LSLNAPLIYP FYSQLNNYIG NILGFLAVAG VYWTNSKWSG YLPINSPSLY TNTGEIYRVT SVVNENSLFD IEKYQEYGPP FYTAGNLVAY GSYFVLYPFS VVYEIGTRYK QTWRAFKSLY LSFRNFKKST YEGYNDPHST MMSAYKEVPD WVFLVVLVIS LVLAIICVEI YPAETPVWGL FFSLGINFVF LIPITAVYSR TGWGFGLNVL VELIVGYALP GNGLALNFIK AFGTNIDSQA QNYITNQKMA HYSKVPPRAL FRVQIIGVFI ASFVQLGIIN FLINNIEDYC DPHNKQKFTC PHSKTFYNAS IIWGIIGPKK VFNGLYPILQ YCFLIGFLLA ILALAFKKFA PLKYTKFFEP TLVIGGFINY GAYNLSYYTG SFYLSVVFMY YIRNKYEAWW QKYNYLLSAA LTAGVAFSSI IIFFAVQYHD KSILWWGNSV MFGGIEGGYG QQSILNVTEA PGGYFGPRIG NFP
|
| |