Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_67546 |
Symbol | |
ID | 4838459 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 692796 |
End bp | 695912 |
Gene Length | 3117 bp |
Protein Length | 811 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640389774 |
Product | predicted protein |
Protein accession | XP_001384429 |
Protein GI | 150865280 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.430026 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.64637 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CAGGAGATCT GGATTGGGAT TCGCCATTGA ATAGGTTTCA CTTACAAGTT TTGAGTGAAG AAAGGAGAAT TTGCCGTCAT CTCGCTAGAC TGCTATCTGA ATCTTATCTA TTAGGCAATA TTAGATATTT GAAAAAATAA CAGAAGAAGG AAAGTGAATC CAGATTTTGT TGAAGTTTTT AAAGAAGAGC AGTAGAATTC CTGAGGTTTA TACTGGAGAC TTTATTGCAG CACCATTAGT TTGCTTTCTC CAGATACACA AAACTACAGA GCCATATTCA TCCATATCAA CATACTGTTT TCTATATTAT CAGCATTCAG ATTGGAACAA CACAGTTTTT TTCAGGTTTG TTTATTCAAT ATCACAGATA ATAAAGACTT TGACATTTCA TCTTTGTCAC TGTTGACTTC AGATATAGCT ATTACAGCAT CGTTTTGCAT TCATATTCAT ATTCCAGTTT CGTTTCTTTT GTTTAATCTC ATCGCACCAC AATGCCTAAG AAACTGACTT CATCTAAAAC GGCCTCTTTG TTTGATATCC GGTTGAAGAA CCTCGACCAC GATGTGTTGG TACTCAAGGG AAATGCACAG GATGCGGCAT CAGCATTGTT GGCTGGAAAA ATCATGTTGT CTCTCAATGA ACCGTTATCG ATCAAGAAAT TCACCTTGAG AATGTATGCC AATCTCAGAC TCAGTTGGCA GGACTCATAT AAGACAACAA AAGGGGAATT TACCAAACCA ACCAAGTTTT CCAAGAAAGT CTACGAATAT GTGTGGGACA ATGTAGAAAT CAACCTGTAC CTCACCAACT TGTACGACAA CTCGTCGCAA GCAACCCCAT CTGTAGGTAT CAGCAGAAAC CAGTCTGCCT CTTCGTTGAA AAACTTGGGT TCTTCTTTTC TCCTGAGGTC TTCTTCCAAT TTGCAATTGA CAGGTATGGC GTTGTCAAAT TCGGCATCGT CTACAAACTT GTCTGCTTCC CCCAAACCTG GAAACCATAT CCTAGTTGCA GGTAACTATG AATTTCCATT CAGTGCTGTA CTTCCCGGAG ACATGCCTGA ATCAGTAGAA GGTTTGCCTG GTGCTTCGTT GGTGTATCGT CTTGAAGCAA CAATCGACAG AGGTAAGTTT CACGCACCTA TGGTGGCCAA AAGACACATA AGAGTTATTA GGACATTGAC CACTGACGCT GCTGAATTGA CAGAAACTGT AGCTGTAGAC AATACTTGGC CCAAGAAGGT GGAATACTCG TTAAATGTTC CCTGTAAAGC TATAGCTATT GGTTCAGGTA CTCCCATTAG TTTCATGTTG GTGCCATTGT TGAAAGGGTT GAGATTAGGA GACATTTCCA TCAAGTTGGT AGAGTACTAC TCGTATTTCG GGTATCTTCC TCCAGCATAC AATGCCGAAC GTATTGTATG TGAGAAGTCG ATTCCACGAC CTCTGGAAAA CGACCCAAAC TTCCAGATGG ATAAATGGGA AGTCGACACC TTCTTGCGTG TACCACTCAG CTTGTCGAAA TGTACTCAGG ATTGCGATAT TTTCTCGAAC TTGAAGGTCA GACACAAGAT CAAGTTTGTC ATAGGCTTGG TAAATCCAGA CGGACATGTA TCCGAGCTTA GAGCTTCGTT ACCCATTCTG TTGTTCATTT CTCCCTTTGT GACAATCAGA GCTAGCCACG ATGATCCCGA GGAAGTTGCT TCTCATCCTC CAAATGAGCA TGGCGTGCCT CCGGAGGAAG AAGAATTATT CACCAACGAT CTTCATAGTG CCAGCAACAC CAGCTTGAGC GACATGGCAG AGGCACAGGC TGAAAGAGGC GAATTGAGAA GATCCAACAA CAACTCGGCT ACCAACTTCA ACGGCTTCAT GGCTCCTCCA GTGTACGAGA AACATATTTA TGATCGCTTA TGGTCAGATG TGTCTCCTGT AGAATCTCCA GTAAATTCCG GTGCTTCCAC TCCTAGAGTA TTTGAACGTC CGGCTTCCGA AGTAAATCTG CATTTTGCTA TGTCTCCATT AGACAGTGTT CAGTTGAACG AGAACTTGCG ACAGTTGAGT TTGCAACGAC AAGTTCAGGA CTTTTCTGAA CCCGGATCTC CTATTTTTTC TTCACTTCCC GGTACTCCTC AAGAAAACAG AGCCATTTTC AATCTTGACG GGGAGGCGGA CCACAGCGAT TACTTCTCCA GAAGTTTGGG CTCTGGCTCG TACAGACCAG CAATGGCTAG AACAGGCTCG TCTTATAACC AATTAATTTC TTCGAGTGTA GGATCACCTG TTCACATTTC CAGAATTAAT TCTGACGTAA ACTTGAGCAC AGATACTTTA TCTAAGGTTC CATCCTATAA TGAAGCCTTG AAGGGAGATG CAGAGGAAAT AGTACTTGCA CCAATGTATG AGCCACCATT GCCTGGTTCC CAGATTAATC TTGCTGAAGT GAACAAGAGG TTTGAAGAAA TGAGACCCAG TCCTGTGCCA CAGCAGACTA CATTTAAAAA CAGATCGTTA TTGTCGCGTG GGTCTAGTAG CGCAAACTTG CGCAATTTGT CTTCCGGCAA CTCGTCACCT TCAAACTCTC GTAACGTCTC TTCTTCCAAC CTAGCTGGAT TGTCCAGATC CTCATCTAAG AGAAGTCTTG TTGGTAGTGG AAGCAACAGT AATGGAAGCA GTGGTGTTAG TTTGAATGCT CATCTGCCGA AGCCGTCTAC ATCTCCTATA GGTATTTCTG CTCTACACTC TGGCTCTGCT GTGTTTTCAA TGACACCATT GCCGCATCCT TCAACTTCTC CAGTGCATAA CAGCACTTTT AAAACACAAG ACTTGCCTTC TCATTTGATG CTCCATACAC AGTCGACGTC CAGTTTGAAA ACCAAGCCTT CTTCTAGTGC AGCTGACAGA AGTGCAGCTG CCAATGGAGC ACCCATGAAG TCCGCTTCCA GTTTGAGTTT ACACAACTTA CAATTTTTGA ACAGAAAAAA GGAAAAGAAA GAAAAATGAT GATAGAAATA AAGAACAAGA AGACGAATAT TCATATTGGA AAAGACATAC ACGGTAGATA TGATACTATA TAAACAAAGT AAAGAATGGA GTAAACTATT AATAGACAAA TTAAAAC
|
Protein sequence | MPKKSTSSKT ASLFDIRLKN LDHDVLVLKG NAQDAASALL AGKIMLSLNE PLSIKKFTLR MYANLRLSWQ DSYKTTKGEF TKPTKFSKKV YEYVWDNVEI NSYLTNLYDN SSQATPSVGI SRNQSASSLK NLGSSFLSRS SSNLQLTGMA LSNSASSTNL SASPKPGNHI LVAGNYEFPF SAVLPGDMPE SVEGLPGASL VYRLEATIDR GKFHAPMVAK RHIRVIRTLT TDAAELTETV AVDNTWPKKV EYSLNVPCKA IAIGSGTPIS FMLVPLLKGL RLGDISIKLV EYYSYFGYLP PAYNAERIVC EKSIPRPSEN DPNFQMDKWE VDTFLRVPLS LSKCTQDCDI FSNLKVRHKI KFVIGLVNPD GHVSELRASL PISLFISPFV TIRASHDDPE EVASHPPNEH GVPPEEEELF TNDLHSASNT SLSDMAEAQA ERGELRRSNN NSATNFNGFM APPVYEKHIY DRLWSDVSPV ESPVNSGAST PRVFERPASE VNSHFAMSPL DSVQLNENLR QLSLQRQVQD FSEPGSPIFS SLPGTPQENR AIFNLDGEAD HSDYFSRSLG SGSYRPAMAR TGSSYNQLIS SSVGSPVHIS RINSDVNLST DTLSKVPSYN EALKGDAEEI VLAPMYEPPL PGSQINLAEV NKRFEEMRPS PVPQQTTFKN RSLLSRGSSS ANLRNLSSGN SSPSNSRNVS SSNLAGLSRS SSKRSLVGSG SNSNGSSGVS LNAHSPKPST SPIVHNSTFK TQDLPSHLML HTQSTSSLKT KPSSSAADRS AAANGAPMKS ASSLSLHNLQ FLNRKKEKKE K
|
| |