Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_45331 |
Symbol | |
ID | 4838982 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 355617 |
End bp | 361421 |
Gene Length | 5805 bp |
Protein Length | 1848 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640390297 |
Product | predicted protein |
Protein accession | XP_001384368 |
Protein GI | 150865234 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.871421 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.583957 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCCAC AGTGGATGCC TCAAAATATC CAAAAGCGTC TTCTACTCTA TGTTCTCCAG CAGCTTTCGC TCTTTTCGGA AATCGACCTT CCCAACCTAG AAGAAGTCTC TCTCAACAAT ATAGTGCTAA GAGATATCTC TATAGATCCG GAGAAAGTAG GCAAGCTACC AGGCTGTAAT CTTCGTTTTG GCCAGGTAGG TACATTGGAG TTAAACACAG TTACAGGAGG AACCATTATT GGTGGTGGCG GGGGTGTTAA TGTCGACGCT CGTGATGTAG AAGTAGTGAT ATCTCCTGAT TTCGATATCA ACGAAGAAGT GCGAAAAGAA GTGCAATTTC TGTTAGCACA AAGTACTGCT GATTTGGCCA AAACGATTAT AAAAGATAGC GACAGCGCTG CTGAAGATGA ATCCGATGAT ACTGACGAAG AGATCGTCGT TGAACCCAAG AAGTCGCGTT CGAGCAGTTC CTCTTCTTTC TCTGGCTCTA CTTCTAAACC ATCTGCTTTA AGTGCTGTCA TGTCTCGCGC TGTGGAAATG GCGCTTCTGC GTTTACAAAT CAAGATCACC AACATGAAGA TCAAATTGGT TTCTGAGATG ACGGACTTGC TTTTTGAAGT TGACGAGGTT CTAATCAACA CCGTGAATGG AACGAGAGTA GTGAAGATCA CTGGCGTAAG GCTGATGACT TTGAAGCCAA ATGTAAACCC GGGTGAATTG GTTGAGAAAG TAGTTCAAAG TCCTCAGAAA GATGATACTA GTGATAATGA AGAAGATGAC AGTAACTATG AAGACGATAA CAATGATTAC GGCGACGAAT CGCTCATGGA TAGTATGGTT TTCACGCATG AAGAGGCAAG TTCCATCTAT TTGAGCGCTA CCTCTCAGTC TTTTCCACGA CCGACTAGCT ATAATGTAGA TGAAGGAGAA GTTCATGTTG GAAATGAGTC TGTATCTTCA GATCCTCCAG CAATCTTCCA TATGGACTAT TGTGATGTTG AGTTTGATGG CCTCAGTAAT GTTTCCAATT TGAAGATCGA TATAGGAACT ATCAAAGTAG CGACAACTCC ATTGGCTCCA ACCATCATTT CCATACTCAA TGGCATTACC AGAAGCTTGA AGATAAAAAA TCATCAGAAA TGGACCCAAC AAGCTCTCAA ACGCCAACAG AACTCGAGGT TTCCTCAATA TGCTGAGACA ACTGATGAGT TGACAGATGA CGAAGCTCTG TCTAAAGATG GAGAAAATAC TGATCCTTTC TTTAATAAGC TTCATATAAG AGATATTATT ATTAGTACCA CTTCAGCGTT GAGTCGTGAT GGAGTCTTTG CTTCCCCCGA CAATTCTATC AATTTTGTAT TACACAATCT GAACATCAAG CAGAAAAATG ACATGCTCAT TTATGGGGGA GTTGAGACTT TCAGAATAGA ACAAGTGAAG GAAGGAGTAA CCACCGATAT ATTTACCTTT GAGTCACCAA CGGCAGCTAC TCATGCAGAA CAATCTCAAT CACAACCAGA TTCTGAAGGA GTATCTTTTG CTCATGGGTC TCCACCTCCT CCAATACGTC CAATGTCTCC TCTGAGTATT CTGTCTATGT CTTCTTCTGG TTCTAAAAGT TCAACTCTAA AAGCCGATCT TCGTTTTGAG ATATTTAAGA AGCTTGAGGA GAATGTAGTC ACCATAGAGA CAACAGCGTT ACTCTCTAAG ACTGCTTTGT TGACTTTGGA CTTGAACAGC TCTTTGATTC TTTCCAATTT CATTACGGCT ATGAATTCAA TCCATTCTAA TTTCAAAGTA TTGATGGCAA CGATAGAAAA TCTCAGCAAG CAGCAATCTC CAAAGAAGCA AAAGACACAT ACAAACGCTG CAGAAGCAAT GACAACAAAA ACTCAATTTA TATTACAAAC ATCTCCAATT ATCATGAGCG TTAAATTCAC ACAAGATTTG TTAGTCAAAG CTATCATTTT TCCCATCTCA TATAATTTAC AACAGAATCA GCTAAGTATT TCCAAGATTT TGATCAATAC AACTATCAGA AATGAAAGGG AATCCACTAC TATAACAATA TCGAACATAG TACTATTAAC AAAATTGCAC GAGTTCAAGT CGTTTATTAA GAGAATTGCT AATCCAAGCA ATGCCAATCC TATACCTCGT GAAGTACAAG TAACAGCTTC GTCTAACCTA TTTATATCAA AGATCATGGT TAATATAGCT TTGAAAGAAT TGAAATTTGT GATATCAAAT ATAGTATCAT TTTATGATTC ATTTGCTTCA CTTTCAGCAA AACAATCAAA TTCCTTGGAA AACTCGGTTC TTGACTTTGT AAGAGACAGA AGCCATAAAT TAGAGATTTC GCTGATACTT CAACCTCCTG GTCAAAGCAG AAGAAGAATT GGACCTGGTT TTGCCAGTCC ACATCTCAGT AATCCTACAT TCGTAAACAT TAGCCGGAAC AATATTGCTT CTTTCCGCTG TTCAATCAAA GAAGTGGAGT TGAATCTCGT TCAGGTGTTG CCTAAGTTTG GAGGACTTAC GCTAAGATTA AAGGATATTT TGTTATACGA ACAAAAGAAC GATATCAACG GCTCTATTCT CTCTTTTGAT ATTGTGAGAG TGGATGACGG ACAGTTACAA AAGTTTGTCT ATGAGTTCCA GGAATTGCCC TTGGAATCAA TTCGCTTACC ACTAATAATG ATTCATTGCA AGAATACTGA AAAAATAAGT ACTGTGGATG TGGTTATTCG CAATGTTCTA GTCGAGTATT ATACACAATG GTTATTGTTG CTCGATGATT TTGAAGCAAA CGAAGAGAAA TTGGCAGAAT TGGTAGTCGA GAAAGTCAAA CCAGTAAATC CATCTTCATC CCAGAATCGA TTTGATATTC GTTTTTCTGT ATTTGACTGT ATTATTGGCT TGAATCCAGG AAGATTGCCT TGCAAGAGTT ATCTTGTTGT CGGTAGAGGA ACCTCTGATT TCACATTTGG AGTGAACCAA TTCTACATCA AAAGTTCTTT CAGAGATGTT TCCGCTCTCT TGATTGATGA TGTCAAGAAC AAGGAAAAGA TGCCATTGAA GGATAACTCA ACTTCTAGAT CAAGGAAGTC TCTTCCAACG AGTAGCTACA CATCCCCGTT GACTTTTTTT CTGAATCTAG GTTATATTAT GATAGGAGGT CTTAATGTTG TTCACATCGG AATTACATTC AACACTAATA TTGAGGAGGT TATGAAGAGG AATGAAAAGT TAGGAATCAG TGATAACTTA TCGTTGATTG ATCTCAAGAT AAATTCCGAT GAACACCATT TGGAATTGTG CGCTGATTCA ACACATGTTT TATTGCAATT GATTAACGAT TTGAAACTAC CTTTGAATTT CAAGGACGAC GAGAAAATGA AAGTAACTGT GGATAGTCTG ATTAATCTAA TGGACGATCT TGACGAGAAT CAGTTTCAAC TCAAGAAGAG AAATATTTCC GTTGCTCCTG GAGTTGAAAC TTCTTCTCTG TCGGCCGGTT CAGAAAACGA AAGTTACAAT GATTTAGAAA TTGTGGATGC CTATTATGAT GATGCACAGG TTTCGTCAAG TTCTGGGCAG ACTGCTTCTC AAACTTCAGA TGCTACGTCT ATTGTGAACT CAGATTTAGA GACTATTGCA TTTGACGAGG ACCATTTTTC CAAAGCAAGA AACAAAATTC CTAAAGGTTC TAAAGTTGAC CCTTTCAAGT TGAATATCAA TCTCTCAAAG ACAAAGATCT ATTTGTATGA TGGATATGAC TGGAAAGATA CGAGGAAAGC AGTTCGAGGA GCTGTGAAAA GAGTTGAAGC GCAGGCCATG AAAGAGAGGC TCAAGAAGTT AAAGAAACAG ACTGATAAAG AGTCTGAATT GAACAGAAGC GATCAAAAAA TCGATAAACC AAATTCTCCT GTTGAAGAAG CTGAGTTCGA AGAAAGCGAC CAAGGCAACG AAGAAGATAG CGATTACGAT GAAGAGGATG AATTCATTGA AGAGACTCTA TTTTCCTCGA TTCATGTTGG AATTCCTAGA GATGCTACAG ACGCTAATTT GACAGATAGG ATCAATAAGC GAGTACAGAG TAGTCTTCAG GAAACAGATA TGACACCTGA AGAAGCTCAG AAGGCTCAGA TTAATGTCGA ATTGGGTAAA AACTACAAAA ATTTGAAGTT GCGTAGATCT AGAGTTCACA AGATAATGGC TGATTTCACC AACATTGAAG TTAATGTGCT GGTGTACTCT ACAAGAGATC CAAGAAAAGA TCCCACTGAT GAAAATCTAC CATACGAATT GTTGAACGAT GTAGAAGTTA GACTTGGGAC AGCTGATGTT TATGATAACG TAGCTACTTC GACATGGAAC AAACTTCTCA CCTATATGAA CACTTCTGGC GAAGGAGAAA TTGGAAAAAG TATGTTGAAA TTGGCTATAA CTAACGTCAG GCCATCTCCC AAGTTGGTAT CCAGCGAAGC TATAATGAAA GTGCAAGTAT TGCCTGTAAG ACTTCATATT GATCAGGATA CTTTGGATTT TCTAATGAGA TTCCTTGAAT TTAAAGACTC TCGCTTCTCC TTGCCCTTGG ATGAAATTGT TTACATCCAA AAATTTCAAA TCAGCCCAGT CATGTTGAAG TTGGATTATA AACCCAAACG AGTTGATTAT GTTGGAATCA GATCTGGTAA CTCCGCTGAG TTTATGAATT TCTTCATTTT GGATGGCTCC ACTATCAATC TTGCTGAAGC CACAGTATAT GGCTTATTAG GAATGCCTAG TTTAGGCAAA GCCTTAGGTG AAGTTTGGGG TCCTCAAATT CAGCAAACAC AAATTGCTGG TATTTTAGCT GGTTTGGCTC CAATTAGATC GATAGTTAAT ATAGGAGGTG GGGTCAAAGA CCTTATTGCA ATTCCTATAA GTGAGTACAA GAAGGATGGA AGACTCTTTA GAAGTATTCA GAAGGGCACT CAGAAGTTTG CGAAAACTAC AGGCTATGAA ATCTTGAATT TGGGTGTAAA GCTAGCTTCT GGAACGCAGG TTATATTAGA ACAAGGTGAA CAACTCTTAG GAGGAGAGGG CTCTGGTGCC AGACTACCAG CGTCAAGAGG AAGTAATTCC CAGAAATGTA ATGTGAACAA GCGTCTGTCC TATAAAGTAG GTAGCGATGG TGAATTCTCT GATTACGGTG ATGTAGAGAG AGAACAAGCT GCTAAGGTCG ATTTCAACAA ACTACTTGCA AATTCGCAAG TATTGAATCA AAGTGTTCGC GTAGACAGAG ACCAGTATGC CAACAAAAAG TTCTACTCGT ACATTGATAT TGACGAGGAT GACGATGAAC TTGTCACCGG TATCGATAAA GAACTACTTA GTAAATCCAT TTTCTTGTTA CCAAGAGATG ACAACAGCAA GAAAGAAGGC GAGGAGGACG AAGCTGATGA TAGCAGTGCA GATGAAGAAG GAGAGAAATT GATTAGTTTA TACTCGAATC AGCCCGAAAA CATACAAGAA GGCATGAAAT TGGCATACAA GTCATTTGGC AACAATTTGA AGATCACCAA GAGGCAATTG ATCAACTTGA AGAACGAATT GAACGAGTCA GAGAACATAC AAGACTCGTT GAAGTCGATT CTCAAATCGT CGCCCATAAT ATTCATTCGG CCAATCATAG GAAGCACGGA GGCATTATCC AAGGCATTGA TGGGATTGGG TAACGAAATA GACTCGAAAC GGATAGTAGA GTCTCGGGAT AAGTATAGAT ACATCAAACG AGCCAAGGAT GAAGATGTCT TGTGA
|
Protein sequence | MSPQWMPQNI QKRLLLYVLQ QLSLFSEIDL PNLEEVSLNN IVLRDISIDP EKVGKLPGCN LRFGQVGTLE LNTVTGGTII GGGGGVNVDA RDVEVVISPD FDINEEVRKE VQFSLAQSTA DLAKTIIKDS DSAAEDESDD TDEEIVVEPK KSRSSSSSSF SGSTSKPSAL SAVMSRAVEM ALSRLQIKIT NMKIKLVSEM TDLLFEVDEV LINTVNGTRV VKITGVRSMT LKPNVNPGEL VEKVVQSPQK DDTSDNEEDD SNYEDDNNDY GDESLMDSMV FTHEEASSIY LSATSQSFPR PTSYNVDEGE VHVGNESVSS DPPAIFHMDY CDVEFDGLSN VSNLKIDIGT IKVATTPLAP TIISILNGIT RSLKIKNHQK WTQQALKRQQ NSRFPQYAET TDELTDDEAS SKDGENTDPF FNKLHIRDII ISTTSALSRD GVFASPDNSI NFVLHNSNIK QKNDMLIYGG VETFRIEQVK EGVTTDIFTF ESPTAATHAE QSQSQPDSEG VSFAHGSPPP PIRPMSPSSI SSMSSSGSKS STLKADLRFE IFKKLEENVV TIETTALLSK TALLTLDLNS SLILSNFITA MNSIHSNFKV LMATIENLSK QQSPKKQKTH TNAAEAMTTK TQFILQTSPI IMSVKFTQDL LVKAIIFPIS YNLQQNQLSI SKILINTTIR NERESTTITI SNIVLLTKLH EFKSFIKRIA NPSNANPIPR EVQVTASSNL FISKIMVNIA LKELKFVISN IVSFYDSFAS LSAKQSNSLE NSVLDFVRDR SHKLEISSIL QPPGQSRRRI GPGFASPHLS NPTFVNISRN NIASFRCSIK EVELNLVQVL PKFGGLTLRL KDILLYEQKN DINGSILSFD IVRVDDGQLQ KFVYEFQELP LESIRLPLIM IHCKNTEKIS TVDVVIRNVL VEYYTQWLLL LDDFEANEEK LAELVVEKVK PVNPSSSQNR FDIRFSVFDC IIGLNPGRLP CKSYLVVGRG TSDFTFGVNQ FYIKSSFRDV SALLIDDVKN KEKMPLKDNS TSRSRKSLPT SSYTSPLTFF SNLGYIMIGG LNVVHIGITF NTNIEEVMKR NEKLGISDNL SLIDLKINSD EHHLELCADS THVLLQLIND LKLPLNFKDD EKMKVTVDSS INLMDDLDEN QFQLKKRNIS VAPGVETSSS SAGSENENLE TIAFDEDHFS KARNKIPKGS KVDPFKLNIN LSKTKIYLYD GYDWKDTRKA VRGAVKRVEA QAMKERLKKL KKQTDKDDYD EEDEFIEETL FSSIHVGIPR DATDANLTDR INKRVQSSLQ ETDMTPEEAQ KAQINVELGK NYKNLKLRRS RVHKIMADFT NIEVNVSVYS TRDPRKDPTD ENLPYELLND VEVRLGTADV YDNVATSTWN KLLTYMNTSG EGEIGKSMLK LAITNVRPSP KLVSSEAIMK VQVLPVRLHI DQDTLDFLMR FLEFKDSRFS LPLDEIVYIQ KFQISPVMLK LDYKPKRVDY VGIRSGNSAE FMNFFILDGS TINLAEATVY GLLGMPSLGK ALGEVWGPQI QQTQIAGILA GLAPIRSIVN IGGGVKDLIA IPISEYKKDG RLFRSIQKGT QKFAKTTGYE ILNLGVKLAS GTQVILEQGE QLLGGEGSGA RLPASRGSNS QKCNVNKRSS YKVAAKVDFN KLLANSQVLN QSVRVDRDQY ANKKFYSYID IDEDDDELVT GIDKELLSKS IFLLPRDDNS KKEGEEDEAD DSSADEEGEK LISLYSNQPE NIQEGMKLAY KSFGNNLKIT KRQLINLKNE LNESENIQDS LKSILKSSPI IFIRPIIGST EALSKALMGL GNEIDSKRIV ESRDKYRYIK RAKDEDVL
|
| |