Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_68531 |
Symbol | |
ID | 4841150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | + |
Start bp | 755469 |
End bp | 758850 |
Gene Length | 3382 bp |
Protein Length | 1114 aa |
Translation table | 12 |
GC content | 37% |
IMG OID | 640392465 |
Product | predicted protein |
Protein accession | XP_001386553 |
Protein GI | 150866828 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.134601 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0417383 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGGA ACGGTTTAGA TGACCGTGTG AATAGATTGT ACTCGTTGAG CAGACGATGG TATCCAGAAT TGAAGAAAGA CTTCTCCAAT ACACTCATCA ACAATGTACA CTCTGAGCTC TACGATCTGG AGAATCTGTA CTTGCCCCAG ACTTCATGTC TTGAAGTATT GTTAGAAACA AAATACTTCG AGAAAGTGCT ATGGGCGAAC TTTAATGAAG ATGTCACAAC GACCCACATT GAACAAATTT TGCATTTGGA CTGGCTCTCG ACCTATTTTG AATATGAGAA GTTTCGTCAC AATTCAATCT CTAGTATGTT GAGGAGCGAA GACAACATCA ACTTTATCAT TAGAATCTTG GAAATTACCT TCAAACTCAC AGCATCTACC AATTATAGAC TCAATTCCCT CGTATTGCGG TTTTTCAGTA TCTCCAAACC CCTTCCTCTG TGTTTGGATG AGATAACAGA TATAATCATA TGGAATAATG TACCTAGTCT AACGTCGAAT CTCGCCAGCC CCTACAAAGA ACGCTTAAAA GAAGCAGTTG TTAAATTTGA AAAGATAGAA GATGCTTCTC AAAGAAGAAT ATCAACGCTC ACAAACAAAT GGCTCTTTAA TTTATTGCAT GATACAGCTA GACAACACAT ACTAATATCC ACCAAATCGT CCATTCCTCC ATTTTACTTC GAGTATCTAA ATGAACTTTT AAATTTCTTA GCTTTTCTCG TTTTTAACTA TCCGGAGAAA GCAGATGAAT TTATTTTACA GTCCAACATA GTCTCTATTG TTTCATTCAA TTCGTCCCTC AAGAATCTGC TAGAACAAGT GAAGCAGAAA TTTCTTTCCC ATTTTGCTAC AAAGCAGAGT AGCTTCGAGA TACTACAACA TTTAATTTAC AATAGAACTA AGGTTGTTGT AGATGTTGAT ATTGCTGTTC CACATAACAT AGCTATAGAT GAGATTTCAA AGCATTTACA GAATTTTAGT ATTCTAGATT TAGCATCATT GGCTAGTGAT TTCAGATCAA TTAAGGATCC ATTGGTTCTC TTTGGGTCTA CAGAGTTGGA TGATAAGATG AAACTTAGTA TTTTAGTTCA AACAGTATGT GATGGAATTG CTCAACCTAA TGAATCTATC TCAAAATTTA TTTCTAAGTT GGATGAATAC GACTTGATGG ACAATCCAAA GTTTTCGTAC CCTCGAGGAT GTATTTCTTC CATTCTCAGT CCATACAGAT ATCCATTTGC GAATAAAAAT AATTCGTCTT TCACAATTAA GAATCTTCAG GAAGAGTTTC AAATTTATCT TCATCAACAT ATTTCAGGAG TATTGGAAAG GCTACGGATT GATCCAAAAA GTGGCATTCA AGGCAAGAGC AAATATTTCC ACAAAGTTGA ATCACTTGAG AATGTTAATG GAAGTACATT TTCTATGAAA ACCAAATCAA TTGTCCCTGC ATCAATACAG TTTATCGTCT TGGTAGAGAT GTTAAAACCA GTACAGTATT CTAAACAGAA GCGTATGAAG GAATTTGGAG TTAATGTGAT TAGAATCGTA CAAATCAGTT CGAATTCTGT GGATGGAAAA GATGCCACTT TCAAGTTCAC TTTTAATGAA GAAATTCAGT CATTTTCAAG TAGAATAAAT CATTTTATTT CTGTTCCGTT TTCGGTGCCG GGAAGTTCAC TCCTTGCACT TTCAGACAGT AAAGCTGTTC CAATACAAAA AAAGAATGAA TCAATATCTC CAGTAGACTA CCAAACTCCA TTGGGAATCT CCATTTTGTC TGAAACAAAG AGTGAAAATG GTAAAGACCG ATCGCATGAG AACGAATTAT GGGAAGTTGA AAGAAATGGA GACATATTGA GCAGTTATCA AAGAGATATT ATGGCAAATA TTATTAAAGG ACATGGCAAA GCGTTCAGGT TTGAGAAAGG GTGTGGCATG AAAAAGTTGA TTTCACTAAT CTTGACTCTG AATCTTAGTA ATTCCCGATG TCTTGTCATA GTACCCAGCA GAAATTACTC AAGACAAATT CCTGCTTCTA TTTTAGAAAA TCGAGTCTCA TACTTGAGAT ACGGTAGCGA ACAGGACATT CGGTCTCTCA GCAATTTTGT CAAGGAAACA TATAAAGCGA CGATCGGTTT AATAGGCGAG AAATGCGAAG TGGAAGAGAA TGAGCTGGTA TCCATTGCTG GTATTTACAA GTTCGAACAA CGTCTTAAAT TGGAGTGGTC TAAGATTGCG AACTCTTTAT CTCTAGGAGT AGAGACAAAA AATGAATTGA AGAGCGCCTT CGCTTTCTTT AGTGGCGCTG ATTCACCAGT GCCACAATCG GTGAACTTCA AGATGGTATT CAAGGCATAT AAGAAACGCC GATATGCCTT AGCACTTGCT GCTACATTAA TTCCGATTAT TGAATTGCAT AGCAAGGGGA GAAACGATGA CGTATGGAAC CTTTTGTTTC GCAAGTTCAG TTCTATCATT GCTTATGAAG ACTATTTGAA CTTATTACAA AACAATTCTC ATCATAATAT GAACAATTTC GATAATATAA TTGTCGTCAA CGGATGGCCG GGAGCAGCAT TAGTGGCGGG TTTGAAAAAT ACCCATACAA GGAAAGTAGT TGAAATTGGT GCTAGACATG TGCTCAGTTC GACTAAATTA GGCGAGCAGG AACCAGTATT GTTTCAGTGG AGGCCAGAAT TTATACCAAT CTCTAAACAG AATCTCAAAC TCGTTAACAA GAAGATTAGA AATTTCAACC CTGGTTTGAA GCATGTCTTT CAAGTTATAT ATACCGAGGA TCAAGTCGAT AGCATGGAGT ATAGTGTATT GTTGTATCAA TACTTAAGAC TTCTCGGATA TCCGTCATCA AAGATATGCA TTTCTGTCGG TTCTCTTTTG CATAGAGCAC TCTTAGAGGA AGTCTTGAGT AAGCATTGTA CAAAGATTAG CAAAGATAAG TCTATCAAGT CAGCCAATGA AAGTAGTGAT GATCCAAAAG ATTTCCAATT CGGATGGCCC GACATCTTAA TTTATGATGA TCCAGATTAC TACTTTGATA CTTATGAATA TGGTATCATT TCAGCAGTCA GTCAGACTGC AAACAGGTTG GAAGTCATTA GTTTACCAGG AAGATTTGGC AATTACATTA TTGGACTGCA AATATATTCA CATGAGTTCG AGCTTCCTGA GGTGGCTGAT CTTGAAGTTG TTGTAGGAGA GAACTACAAT ACTGAAGTGC GGAAGCAGTC GCAGCTGTAT CCTATCGAAA GCAAAGACCA TTTTGAACAG TACATTCACA ACATGACAAA GGTAAGGCTC GGGCACAAGA AGTAGCATAC ATAATAGAAC TAATAGAATA AAATACCATT TT
|
Protein sequence | MSRNGLDDRV NRLYSLSRRW YPELKKDFSN TLINNVHSEL YDSENSYLPQ TSCLEVLLET KYFEKVLWAN FNEDVTTTHI EQILHLDWLS TYFEYEKFRH NSISSMLRSE DNINFIIRIL EITFKLTAST NYRLNSLVLR FFSISKPLPS CLDEITDIII WNNVPSLTSN LASPYKERLK EAVVKFEKIE DASQRRISTL TNKWLFNLLH DTARQHILIS TKSSIPPFYF EYLNELLNFL AFLVFNYPEK ADEFILQSNI VSIVSFNSSL KNSLEQVKQK FLSHFATKQS SFEILQHLIY NRTKVVVDVD IAVPHNIAID EISKHLQNFS ILDLASLASD FRSIKDPLVL FGSTELDDKM KLSILVQTVC DGIAQPNESI SKFISKLDEY DLMDNPKFSY PRGCISSILS PYRYPFANKN NSSFTIKNLQ EEFQIYLHQH ISGVLERLRI DPKSGIQGKS KYFHKVESLE NVNGSTFSMK TKSIVPASIQ FIVLVEMLKP VQYSKQKRMK EFGVNVIRIV QISSNSVDGK DATFKFTFNE EIQSFSSRIN HFISVPFSVP GSSLLALSDS KAVPIQKKNE SISPVDYQTP LGISILSETK SENGKDRSHE NELWEVERNG DILSSYQRDI MANIIKGHGK AFRFEKGCGM KKLISLILTS NLSNSRCLVI VPSRNYSRQI PASILENRVS YLRYGSEQDI RSLSNFVKET YKATIGLIGE KCEVEENESV SIAGIYKFEQ RLKLEWSKIA NSLSLGVETK NELKSAFAFF SGADSPVPQS VNFKMVFKAY KKRRYALALA ATLIPIIELH SKGRNDDVWN LLFRKFSSII AYEDYLNLLQ NNSHHNMNNF DNIIVVNGWP GAALVAGLKN THTRKVVEIG ARHVLSSTKL GEQEPVLFQW RPEFIPISKQ NLKLVNKKIR NFNPGLKHVF QVIYTEDQVD SMEYSVLLYQ YLRLLGYPSS KICISVGSLL HRALLEEVLS KHCTKISKDK SIKSANESSD DPKDFQFGWP DILIYDDPDY YFDTYEYGII SAVSQTANRL EVISLPGRFG NYIIGSQIYS HEFELPEVAD LEVVVGENYN TEVRKQSQSY PIESKDHFEQ YIHNMTKVRL GHKK
|
| |