Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_41120 |
Symbol | |
ID | 4837176 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 2091234 |
End bp | 2094425 |
Gene Length | 3192 bp |
Protein Length | 1039 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640388491 |
Product | predicted protein |
Protein accession | XP_001382629 |
Protein GI | 150863968 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.599383 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGACTC TCACTGCCTC TAATGTATGG ACGCAGGACG AGATCGAGCA TTTCCATACG TGCATAGTTC ACAACGCTAC ACGGACTGGG ATGGGCCACC TTATCGAGCA AAAGAGCATA GCTGAGCTCA ACAATGCTGT AACCCAGCTT ATGTCACGTT TCTACTGGAC GAAGGGCGAG ATGACCTTTC TTGAAAACCA CTTGGGAGCA GGCGTAGCAG AACTTTGTCA CGAGATCCCA CTTAGAGCTA AACATGGGAT CGTCCAGATG GTTTGGAAAG TGAAAAAGAG TCGTAATCAG GTAGCACTGG CCAAAAAAGC CAAACCCAAC CCCTCCACCA AGAAGTCCGT TAGGGACTTG GAAGTCACTA TAGCCGAAAA AAAAGTTGTG AATTTGACTG ACTCTTCTAT GGAGGACACG ACAGAAGATG ACAATTTAGC GTATTGTATT CAACACGATT TGTCCAGAGA AACACTCGAG GCTCTCTTCC CCGACTGGTC GTTGTCCGAG ACACTAGACA GAATCGTATT CCAGGCCGGT CCATTGGATT CCATAGCTCT TACAACTGGT GAAAAACAGA TTATCAAAGA CTCCATAAAG AAACAAAAGT CGTTGGAATC AGTACAGTTC AACTTCCCGT GCCGTTCCAA AGACTACCTA GTAGAAAAGT TTAAGGAATT TGAATTTGTC TCTTCGCGTA AGACCAAGTT CAAGTCGTTG TCAGAGAGAT TGCTTTACGA AGCAAAATGG GTTCTTTATT CTAGTGGGGA AACTTTTACA ACTACTAGAC GGTCCCGTAA GCGGGCTCTT GAAGAGTCCT TTGAAACCAT GGAAAAGGAA GCCCTAGTGT CGATCTACAC CAAGCCTGCT CCTAAGCAGG AAATGACTCC CGAAGAGTTG GAAGAAAGAG AACGTCGTAG AGAAGCCTTG AAGGAACTGA GAAGACTCGC GAATGAAAGA AAAGCTCTTC TCAGAAAACA AAGGAAGGAA GACTTGGAGA GACGTAAAGC GGCTGGTTTA ATCAAAGCAA AACCAAAATC TGACATTACC CATTCGATTA AAGATCTCTT GGCTGGATCC GAGCACTTTC AATCTGTTAT TGGCGATAAA AAGAAAGTTG AGGAGGGACA AAAAAGAAAA CGTATACAAG CAGTTCACTA CCGTCCTGAA ATCGAACCCA AGAAGCCTTC TAAACTAAAG GTCAGACACA GACAAGCCGA AAAAAGCAAA ATCAAACTAG CCTTAAAATT GAAAAAACAA CAAGAACACA ACAAGAGAAA ACCCAAAAAA GAACCAAAGA AAAAGAAGAA GACTGTCGAA GAAGAGTCCT TAGCTACACC GGAAACGGAG GAATTCATTA AAGAAGAGAT CGACGAAGAC GAAGAGGAAG AGGAAGAAGA GGACACCTAT AGTCCATTCG ATCCTGTTGA TCTCAATTCA GATTCGTTTG TGCCTCTTCA TGGAAGACAA TTCTACGCTG AAGAAATATA CGTAGATAAG CCCCATGTCC CTGAGTTGAA GTTTGTTGAA GTTACAGAAG AGTCTGAAAA TCTGCTCATC AGCAGTTCTA CTTTGACAGA AACAAAAAGA ATTATGACCA CAAACGACGA TGACATAGTC TACGAAGACT GCTTGGCAGC AGATATCATC CGTTCTCACA TCAAGAACTA CCGTGATTTA CCTATATCGT TCCCACCTTT GCTTGACCCT TCTAGCCTGG AAAGGAAGAT ATATCCTACG AATAAGGTCA GAATCAGATT CTTATTGTAT CCTCAACACT GTGAACTGTT TATTCTTGCA GCTCCTAAAA CCAACGAACT AGATCCTGTC TATGAGATTA TCAAATTATT CATGATCCAT TACGCCTTGT TCTTTTCGCA TTCCTCCGAG ATCAGAAGAA TCATCACCGA AGAGTATTGT CAGAAAATTG AGCATTCTAT CGAAGAGAAT GACTTTTCAG ATTTCATGTT CGTCGTTGAT AAATGGAACG CTTTAATGCT TAAATTATCA CCCAATGAAG AGGCAGTCCA ATCTATTGTC CAGAGTGGAA AAGAGGACAT AAATGCTGGT CTTAGATCGT ATCTTAGCGA ACAAGAGATT CGCGTACCAA CTAGTGAAGA CTTGAAGTTG CAGGCTTTTC TTGAAGCTGT CATTCTTGAA GATCTTAGTC CTACATTCCA ACTAATCAAA GAAGAACCGA GTGATGCCGA GAAACAGGAA TTTGTTCCAA AACATCTTGG TGACGTTGAA GCCCCAAAGA ATGTGAGTGA CGAATTGAAG GATATGAAAC CTGATGATTA TAATTTGATT TTCTTCACTA GGCTCAAAGA GAAGACTGAG ATTTCTAGAT TTGCGACTCA ACAAATTCTT TTGCGGATTT ATTCGAGAAT CGTTTCGACG GATTCACGAA AGTTGAGATC ATACAAGGCA TTCACAGCTG AAGTATATGG GGAACTTTTA CCATCATTTA CCAGTGAAGT TCTTGAAAAG GTTAATTTAC TTCCAAACCA GAAGTTCTAC GATTTAGGCT CTGGTGTCGG GAATACTACA TTTCAAGCAG CCTTAGAATT TGGTGCTTCT ATGAGCGGAG GATGTGAGCT TATGGAACAT GCTTCAAAGT TAACTAAACT TCAAGAGGGG TTATTGCAAA AGCATATGGC CGTGCTTGGT TTGAAGAAGT TAAACTTCAA CTTTGCCTTG CTGCAAAGTT TTGTTGATAA CGATCCCGTA AGAGATGCTG CTCTGGACTG TGAAGTGCTC ATCATTAACA ACTACCTCTT TGACGGCAAT TTGAATGCAG AAGTTGGGAG ACTTTTATGT GGTCTCAAGC CAGGAACCAA GATCATTAGT TTGAGAAACT TTATCAGTCC TAGATACAGA GCAACTGGGG ATACTATTTT TGATTTCTTC AAAGTTGAAA AGCACGAAAT GAGTGATTTC TTGTCAGTCA GTTGGACGGC AAACAAGGTT CCATACTACA TTTCCACTGT CCAGGATAGA ATCTTGCCAG AATACTTAGG CAAAGATGAA TCTCCTGATA GTGATAGATC CACACCTACA CTCAAATCAG AAAATGGCAG TTCCGAAAAC CTCGCTGGTA GCCTAACCCC TTTTACTGCA ACACCTGAAC CTGATATGTT CAAGGACCTA TCTTGTGTAT TAGGAGATGA AGACGACATT CTTCTCCACT AA
|
Protein sequence | VKTLTASNVW TQDEIEHFHT CIVHNATRTG MGHLIEQKSI AELNNAVTQL MSRFYWTKGE MTFLENHLGA GVAELCHEIP LRAKHGIVQM KSVRDLEVTI AEKKVVNLTD SSMEDTTEDD NLAYCIQHDL SRETLEALFP DWSLSETLDR IVFQAGPLDS IALTTGEKQI IKDSIKKQKS LESVQFNFPC RSKDYLVEKF KEFEFVSSRK TKFKSLSERL LYEAKWVLYS SGETFTTTRR SRKRALEESF ETMEKEALVS IYTKPAPKQE MTPEELEERE RRREALKESR RLANERKALL RKQRKEDLER RKAAGLIKAK PKSDITHSIK DLLAGSEHFQ SVIGDKKKVE EGQKRKRIQA VHYRPEIEPK KPSKLKVRHR QAEKSKIKLA LKLKKQQEHN KRKPKKEPKK KKKTVEEESL ATPETEEFIK EEIDEDEEEE EEEDTYSPFD PVDLNSDSFV PLHGRQFYAE EIYVDKPHVP ELKFVEVTEE SENSLISSST LTETKRIMTT NDDDIVYEDC LAADIIRSHI KNYRDLPISF PPLLDPSSSE RKIYPTNKVR IRFLLYPQHC ESFILAAPKT NELDPVYEII KLFMIHYALF FSHSSEIRRI ITEEYCQKIE HSIEENDFSD FMFVVDKWNA LMLKLSPNEE AVQSIVQSGK EDINAGLRSY LSEQEIRVPT SEDLKLQAFL EAVILEDLSP TFQLIKEEPS DAEKQEFVPK HLGDVEAPKN VSDELKDMKP DDYNLIFFTR LKEKTEISRF ATQQILLRIY SRIVSTDSRK LRSYKAFTAE VYGELLPSFT SEVLEKVNLL PNQKFYDLGS GVGNTTFQAA LEFGASMSGG CELMEHASKL TKLQEGLLQK HMAVLGLKKL NFNFALSQSF VDNDPVRDAA SDCEVLIINN YLFDGNLNAE VGRLLCGLKP GTKIISLRNF ISPRYRATGD TIFDFFKVEK HEMSDFLSVS WTANKVPYYI STVQDRILPE YLGKDESPDS DRSTPTLKSE NGSSENLAGS LTPFTATPEP DMFKDLSCVL GDEDDILLH
|
| |