Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_41332 |
Symbol | |
ID | 4836749 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 1092694 |
End bp | 1095879 |
Gene Length | 3186 bp |
Protein Length | 951 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640388064 |
Product | predicted protein |
Protein accession | XP_001382988 |
Protein GI | 150864247 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACTC ATCTTCTCGC AGTTCCCTCA AAGCGATCCG AGGAAGTCAA TTGGTTGAAA CCCTTGAGCA ACTACTTGCT TTCTACTTAT GGTTCCACCA GTGAATACAC CGAAGACCTC ACAGCCTTCA ACAAGTTGAG ACAGGACATT CGTGGAGTGA ATGCTGATAA CACGGGAATC AACCTATACT ACAAGTACTA CAGCCAGCTT GAGCTACTCG ACTTGAGGGT GCCCTTCAAT GTAGTTAACG CAAGTAAGAA GATCAACTTC ACGTGGCACG ATGCCTTCCA ACCCTCATTG GTTAATAAGC AAGGAGCCTT GCCGTTTGAG AAGGCCAACG TTCTCTTCAA CCTAGGAGCT CTTTTGTCAG AGTATGCCAA AGTAAGATAT GAAGAGTCTC AACGCAGTGT TGCTGGAACG GAGGAGGCTT CTACGAAGGA GGCTATCCAG CTCTTTCAAC AGGCTGCTGG AATCTACCAG TTTTTGAACG AAAACTTTCT CCATGCTCCC TCCCTAGACT TGAACCAGTC TACAGTTAAG TTTTTGGTCA AATTGATGTT GGCTCAGGCG CAGGAAGTTT TTGTTTTGAC GGTTATAACC GGCGATCTTG AAGGAAAGAA GAACTCTTTG GTATCAAAAT TGTGTCGTAG TGCCTCCGTT CATTACGAGG AATGCCACAA TATGACTTCG TACATATCCA GTTTGGGAAG CAACTTCGAC GATTTCGCAG TGGTGGACTC GGAAGACTTA GAAGAAGATT TTTTGGATAA ACCGGATGAC TCTGACGAGA CTACAGAAGC CAGCACTTCC CATGTTCCTG CCAAGCTTGA TGCTTCATGG ATTGCTACCG TCACCTTGAA AATGCACTAT TACAAGTCGT TGTCGTACTA CTACAATGCC ATGAACTTGG AGGCCGGAAA AAAATATGGT GATGCATTGG CTTACTATAC GAAATCGCAG GATATTCTTC ACGAGATCAA CAGCACCTTG TTGAGAAATA TCTCCAAAGC TGGCTCTAAT GAAGCATACG AGATTTTAGA CAACTACAAG TACCAAAAAG ATGCTGTAGG AATCAAATTA ACTGATTTGA CAAAGGATAA TGACTTAATC TACCATGAAA TAATACCATC TTTGGTGACC TTGCCAGACA TCAAGCCATT GGATAGTACA AAGGTCATCC CTATAACTCA GAATACGACG TTCCAGGAGA TAAATGACCA CAACTATAAC AATTTCATGA GCAATGTTGT TCCCGTGAAT ATCCACGAAT TGTCCAGTTT TTACTCTGAA GAAAAGTCAC AGTTCCTTAG GAACGAGTTG GATGCTGTTG ATGTTTCGAA TGAGGAGATT TCGTCTGTTT TGGAATACTT GAAATTGCCT AAGGCCTTGG TAACTATCAA GGAATTGATA AATAGCACAG AAAACTCTGA CACAGACTCA AGTGGTAGTT CTATCGACCC CAAAATTGAA GCCATCGCCA ATGAAATCTC GTCGGAGTAT GCCAATGATC AATTGAATAG GCAAAAAATT TCCCAACTTA GGAAGGAAAT CTATGAAAAT ATTTCACAGA GCGAAGAAAT AGCTTCCAAG CAGGTTTCAG AGTCGTTGAC TAGTTTTAAG ATGGATCTTG TGAAGATCAA GAAATCGCTA TATGATGCCA CTAATTCGGA TAACCAACTT TTCGGCTTAA TCAATGACGA CTCTCAAAGT TTGTATGCTC TTTTGGGAAA GGGTTCGAAT TCCGAAGAGC TCAAGAATCT CTTCAAGACT TCTTCTGATA AGTCACAAGC TTCGAAGCCG GACATCAGTT TGCTAGACAT GATAGATACA GAAGTCAAGT CACCTAAAGA CCAGATTCTC TCGCAAATCA AAGTCTTGGA AGATATATTG CACGACTTAA ATGTGATCAA AGCCAACAAG ACCAAGTTAG TTGAGACGTT GAAGAAGGAA ATTCATAATG ACGATATTTC AGACATTTTG ATTTTGAATA GCAAGATGAA GTCTACAAAT GAAATCAAAA CACTTATATT CCCAGAAGAG TTGAAGAAAT TCCAGCCATA CAACGAAGAG TTAGATAAGT TGATCCAGAA GGAAAAGTCC TTTGTCAATG ATTTGAGGAC CGAATGGGGC AAACTTTCTT CTGATCCTGA GATCAAAAAT ATCCAATCAT CGAAGGCATC CAAAGATCAA TTGGTAGCTA GTCAAAGTGC AAGAATCACC TCTTTCTACA ACGACTCCTG GAAAAAGTAT TCTTTGGGTT TGAAGAGAGG TTCTCAATTC TATGCTGGTT TACTAGATTC GGCCATTAAT TTGAAAGGAA ATATCCAAAA CGAAGCTGAC CGAGCTGCTA TTAAACCAAG GTCATCGTTG ACTAGTAGTT TCGATGGTTT AAGTTTGAAC CAACAGCCAC CTCTACCACC CCAACAACAT TATCAGCAAC AACCTCAGCA GCAAACCCCG TCAGCAGCTC CGGGTCAGTA TGAGTATTTT GATCGTTACT CAGCACCTCA GCGACAGAAT ACACAGCCTT TGGGTTCACC TTCTGTCAGC CAGTATTCGA TGCCATCGCA AACCGCACTT AGCCGTCAGA ATTCTCAACC TATGGTTACA CCTCCATTTG CTAACCAATA TGGCCAACCA TCGTCTCAGA ATCAGAACCA ACATCAACAG CCTCAACATC CTCAACAAAG ACCACAGTAT GGTCAACCTA CAAACTCGTA TAATCAACAA AACCAATATT ACAACACTCC TCCAAACGTC TCTCTGCCGG CTCCAGGCTC ATCCTACGAT ACAGCAGCAC AAGCTCCGCA GCGTAGTTCA ACTGGAGGAA GTTTTGCTGG TTATTCTGAA GCTTCTACTG GTGGATACCA TAGAGCTCCT CCCGTTCCAC CAAAGAATAT CGACCAGCAA CCACTGGGTG GACGTTCTGC GCCTCCTTTG CCTCCACAGA TTCCACACTA TGGCCAGCCA TCACAATTCC ATTCATATGG TCAGCCGCAG CCTGGCAGCG ACAATGGCCA GGGAAGACCT CCACAGGCTA ACCAAGCTTA TCAACAAGCT TACCAACAGC AGCCTTCTCA GGGCAACGGA GGAAACGACC CCAACAACCC TAACGGTAGC AACTTGATCT ACGACCAGCC TTCGAAGTAC CTGCCAAATA TGTACAATTT CTTTTCCAAC AATTAG
|
Protein sequence | MKTHLLAVPS KRSEEVNWLK PLSNYLLSTY GSTSEYTEDL TAFNKLRQDI RGVNADNTGI NLYYKYYSQL ELLDLRVPFN VVNASKKINF TWHDAFQPSL VNKQGALPFE KANVLFNLGA LLSEYAKVRY EESQRSVAGT EEASTKEAIQ LFQQAAGIYQ FLNENFLHAP SLDLNQSTVK FLVKLMLAQA QEVFVLTVIT GDLEGKKNSL VSKLCRSASV HYEECHNMTS YISSLGSNFD DFAVVDSEDL EEDFLDKPDD SDETTEASTS HVPAKLDASW IATVTLKMHY YKSLSYYYNA MNLEAGKKYG DALAYYTKSQ DILHEINSTL LRNISKAGSN EAYEILDNYK YQKDAVGIKL TDLTKDNDLI YHEIIPSLVT LPDIKPLDST KVIPITQNTT FQEINDHNYN NFMSNVVPVN IHELSSFYSE EKSQFLRNEL DAVDVSNEEI SSVLEYLKLP KALVTIKELI NSTENSDTDS SGSSIDPKIE AIANEISSEY ANDQLNRQKI SQLRKEIYEN ISQSEEIASK QVSESLTSFK MDLVKIKKSL YDATNSDNQL FGLINDDSQS LYALLGKGSN SEELKNLFKT SSDKSQASKP DISLLDMIDT EVKSPKDQIL SQIKVLEDIL HDLNVIKANK TKLVETLKKE IHNDDISDIL ILNSKMKSTN EIKTLIFPEE LKKFQPYNEE LDKLIQKEKS FVNDLRTEWG KLSSDPEIKN IQSSKASKDQ LVASQSARIT SFYNDSWKKY SLGLKRGSQF YAGLLDSAIN LKGNIQNEAD RAAIKPRSSL TSSFDGLSLN QQPPLPPQQH YQQQPQQQTP SAAPGQYEYF DRYSAPQRQN TQPLAAQAPQ RSSTGGSFAG YSEASTGGYH RAPPVPPKNI DQQPSGGRSA PPLPPQIPHY GQPSQFHSYA YQQQPSQGNG GNDPNNPNGS NLIYDQPSKY SPNMYNFFSN N
|
| |