Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_50925 |
Symbol | |
ID | 4841145 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | + |
Start bp | 731194 |
End bp | 734100 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640392460 |
Product | predicted protein |
Protein accession | XP_001386548 |
Protein GI | 150866825 |
COG category | [R] General function prediction only |
COG ID | [COG3568] Metal-dependent hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.766697 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAGA AAAAAATCTC GGCCTCGGAA TCTTCGCCGA TCGCGTCCAC GGCTGCTAAG AAGTCTGAAA TTTTCAGATT CAATGCTTCC TATATAGCCA TTCTACACAC CATTACGGCG TACTCAGCCT TCATTGCTGC ATTGATCGTG GGAACATGGC TTCATTACCA TAAGATTGTC CAGAACTCGT CGTTCGGCTA TCCTGACGAG TGGTTCCCTT CTGTCTCGGC TACAATCGGC GATAGATACC CAGAAAGGTC GATTTTCCAG GTTCTCATCG CAGTTACAGC TGGCCCTCGA TTTCTTCTTT TGATCTTTGA TTTCATCAAC CTTTACAGGA AAGGATCCGT GCTACCATAC ATTGGCTTTT TCTCGGGTCT TCTTCGTACC TTCACCTGCG GTGGATGGGT GTACATCACA TCAACAGACG ATCACGACTG GCACGATATC TTCATGATCA GCTACATCGT CTTGACCATT CCATGGACCG TTTGCATCAC TATATTGTCT AAGCCAGGCT CATTTGAACG CCAAGGACGT TTCTATACCC TGACGTCGTT TTTCGCTATG CTCGTGCCTC TTGTCTACTG GTTCATTCAA CATAAAGTCC ATGTGAGACC TGGGGCCTAT TCCATTTATG CCTACTTCGA ATGGGCATTG ATTTTGCTCG ATGTAGGTTT TGATGCTTGG TCTATTGTAG ACTTCAATAA TATTGACATT GCCATCTCTG GTCAAGGCAT TGAGCTCACT GGTGCTAAAC CCAAGACTGC CGTAGACAAG AAGAAACAAG ACGAGTTGCT CCACGATGAG TTCACCGATT TCGAGTTCTA TATCTCCAGC ATTAATGCGT TTGTCTTCTG GTCAGTAGTC ACTTCTTTAT TTTTGTGTGT TTGGTACTTT CCCTTGTGGC ATATGGGTAT CTCTGGCTAT GAGGCTGCGG TTTTCTCTCT CTTTTTGTCT CCCGCCATCT TGATTATCCC CTTTGTAAGA AACTTCATTT CCAACTACCC ACTGCTCACG AAAACCTTAA CGGTGTTGTT GGGTATTGGT GCTTACAAAG TTCACGAACC TGAAAACAGA CTCCTCACCA TCACTGCTGG TACTTCATTT GCCATAGTAT CTTTAGTGGT AGACGTCTGG TCACTCTCTA ACCAACCAAA GAAGTTTAAC TCCTACATTA TCTCCATCTT GATTGGTTTG TTGGGCACTT CAGTATTCAA GTTCTTGTGC TTTTCCAACA ACCCAATTTG GCCCATTATG CACAAGGAAA ACGGTGGTTA CAACGAAGTG GGAATCTTTG TAGGTTTGAT AGGTGCATTC TTTACTCCGC AGCTTCATCT GATAAACTCT ACAACTCATA ATGTGAAGAG AGCTGGTGGA TCGTTCTTCC TTGCTGCTTT GGGAATTGGT GGTTACTTCT TCTCCATCGA ATCTTACTTG AGTGACACAT CTACGTTGGC ACTCTGGACT TGGGAAGGAT ACCCAGTAAA GGGACCTCTT CCCGTTACTG GTGCTCTTTT CAATATTGGA GCGGTTGTTT TAGGTGTTCT CGCGGCCATT TCAATCCCAT CTAAGTTTTT CAGCAACAAG TTCTACAACT GGTTCATTGG TGGCGGTAGT GCATTTGTTC TCTACTATTA CAAGGGCTGG ACTGGGTATG CTGGAGCTAC TGTATATTCA TTTTACTTGA CTTCTATTGC TCCATTGATT TGGCAGTCTT CTATTGGCTA TAACCCAAGC TTGTTGTTCT TTGTAGGCTT CTTCGTTCAC GTCTTCCTCG GCTTGGCTAG TACGTGGATT GTTGCATATG CTTTCGTTCC AGGTGGACCA TTATTGAGAG AGAGAACTGA TATCATCTTG GGAACATCCT TTGTTCTGAT CTTAGCTGGA ATTTGGAATT ACAACTTGAG GCGTCGTAGT GGTGAAGCTG TCAAGATTGA CTTCCATGGT AAGAAATTGT TTAGACAGGC TTGGACAATC TTGACTGTTT TGTTAGCTAT TTCTATTTCT GCTTTCTTCC AAAGATATGC TGTTGAATCA TTCAAGCCAT ATAATGCCGA GTCTAAATCG TTTACCACTG GTATCTGGTG TGTTCATTTT GGTTTGGATA ACGACATGTG GTCCAGTGAA GTCAGAATGA GAGATTTGAT CAGAGATGCT GAGATCGATA TCATTGGTTT ACTTGAAACT GACACCCAAA GATTGATTGG GGGCAATCGA GACTTCACCC AAAGGATTGC TGAAGATTTG GGAATGTATG CCGACTACGG CCCCGGACCC AACAAGCACA CCTGGGGAGC TGCCTTACTT TCGAAGTTCC CAATCATCGA GTCTAGTCAT CATTTGTTGC CATCTCCTGT AGGCGAACTT GCTCCGGCTA TCCATGCCAC TTTGGATATT TATGGTGAGT TAGTTGACGT AGTTGTTTTC CATTCTGGTC AGGAAGAGGA CGTGGAAGAT CGTCGACTTC AAAGTTTGGG TATTCAAGAA ATCATGGGTA ACTCCACAAG GCCTTTGGTG TTATTGAGTT ACTTGGTGAC CACTCCTTTG GAAGGTAATT ATAATACCTA TGTCAGTGAG AAATCGAGAA TGTACGATAT CGACAACTCT GACTGGGATA GATGGTGTGA ATACATTTTG TTCAGAGAAT TGAAGAAGGT TGCCTACGCT AGAATTTCTC GATCCACCAT TACTGACACT GAATTGCAAG TGGCTAAATT CAAATTGTTA ACCAACGAAG AAAAGCAAGA AATTGACGAA CCTTTCTTGT ACGGTAACAA CTACGTCAAT GAAGATGAGA TTGACCAAAA TTTGCGTATG CCTCAAATTT TCAGAGGTGA TGGTGTTAGA GGTCACAGAT ATCATGTCTT TGACGAGCCG CGTTACTTTG CCCAAGAAAA GAATTGA
|
Protein sequence | MSEKKISASE SSPIASTAAK KSEIFRFNAS YIAILHTITA YSAFIAALIV GTWLHYHKIV QNSSFGYPDE WFPSVSATIG DRYPERSIFQ VLIAVTAGPR FLLLIFDFIN LYRKGSVLPY IGFFSGLLRT FTCGGWVYIT STDDHDWHDI FMISYIVLTI PWTVCITILS KPGSFERQGR FYTSTSFFAM LVPLVYWFIQ HKVHVRPGAY SIYAYFEWAL ILLDVGFDAW SIVDFNNIDI AISGQGIELT GAKPKTAVDK KKQDELLHDE FTDFEFYISS INAFVFWSVV TSLFLCVWYF PLWHMGISGY EAAVFSLFLS PAILIIPFVR NFISNYPSLT KTLTVLLGIG AYKVHEPENR LLTITAGTSF AIVSLVVDVW SLSNQPKKFN SYIISILIGL LGTSVFKFLC FSNNPIWPIM HKENGGYNEV GIFVGLIGAF FTPQLHSINS TTHNVKRAGG SFFLAALGIG GYFFSIESYL SDTSTLALWT WEGYPVKGPL PVTGALFNIG AVVLGVLAAI SIPSKFFSNK FYNWFIGGGS AFVLYYYKGW TGYAGATVYS FYLTSIAPLI WQSSIGYNPS LLFFVGFFVH VFLGLASTWI VAYAFVPGGP LLRERTDIIL GTSFVSILAG IWNYNLRRRS GEAVKIDFHG KKLFRQAWTI LTVLLAISIS AFFQRYAVES FKPYNAESKS FTTGIWCVHF GLDNDMWSSE VRMRDLIRDA EIDIIGLLET DTQRLIGGNR DFTQRIAEDL GMYADYGPGP NKHTWGAALL SKFPIIESSH HLLPSPVGEL APAIHATLDI YGELVDVVVF HSGQEEDVED RRLQSLGIQE IMGNSTRPLV LLSYLVTTPL EGNYNTYVSE KSRMYDIDNS DWDRWCEYIL FRELKKVAYA RISRSTITDT ELQVAKFKLL TNEEKQEIDE PFLYGNNYVN EDEIDQNLRM PQIFRGDGVR GHRYHVFDEP RYFAQEKN
|
| |