Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_64420 |
Symbol | |
ID | 4841097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | + |
Start bp | 248922 |
End bp | 251363 |
Gene Length | 2442 bp |
Protein Length | 813 aa |
Translation table | 12 |
GC content | 37% |
IMG OID | 640392412 |
Product | predicted protein |
Protein accession | XP_001386446 |
Protein GI | 150866749 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0169] Shikimate 5-dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0116008 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATATATG AGAGATGTGA GAACCTGCTC CAACATAAGT TTCAATCGGT TGTAATTGTA GGTTTGCGAG GAGTGGGAAA GTCAACTTTG GCCTTGATGG CTCTGGCTAC TTTGGGTCTT GAATATGTAG ATCTTGAAAG ATGTCTAGTC GACTATACTG GAGTGTCTGA TGCAACTTTT ATCAAAAGCG TTTCTAAGGA AGAATTTATA CATTTGCAGT ACAAGTTGAT TGTAAGATCA TTTAGAGCTA ACAAGAATAA GAGAGCCATT TATGTATTGC CAGCAAGTTC GATCAACAAT TCCGCCGTGA TGGAATATTT AAGAAATAAT TGCAATTGTC ACTGCGTTAT CAATATCGAA TGTGACGAAG ATAGAATATT GAAGTATGTC AACTACACTG GTGAATATCA AAAGGGAATC CTGTCAATAC AGTCCGGAAT ATCCCAATAT AGATCTGTTG CAAACTATAA TTTCTTCAAC TTGGAATCAA ATTTAGATGT TTGGAAAAAG TATTCGTTCT CTAAGGTGGA TGATTCCAAG CAAATTGAAG TTACACCACA TCTAATATTG AAACCCGTAG AATTGGAATT CATCAATTTT ATGTCATTTA TTCTTTGGAA TCCAATACCG GAACCCTCTG ATGTTCTTTT GAAGCATTAT CAGAGAAGCA ATTTAAGAAG CTTTTCTAAC TGTTTACAGC TCACATTCCC TTATGATGCC AGAAACATTC ACGTGAACAA TTTTGGCGAC GTCTTGAATG GAGTAGATGC AATCGAAGTC GGAGTTGATC TTATCCAGTT AATTAGAACA AAAATCATGC ATGTCAATTT GCGGCTAGAT GAGTATATCG CGAGAATAAG AAGAAGTACT CGAGGATCCA TACCAATACT TGTTGGTATT AAGAATACTA TCCCTGAGCT CAACAACTTC ATTATGGAGA GTACTGTTGA CTCTGTCATT AGTACATCTC AAATTAAACA GGATTTTAGA AGCTTTTACT TCAGTATCTT GTACTCGATT ATCAAAGTAG CTGCTGACTA TGTGGTTCTC AATTTGGAAA TTTTTCTTTT TGATGAAGTC AATTTCCTAA ACGATATTAT CGTGAATGAT TCTTTCTATA TCATGGACCA ACTTAGACAT ATGCAAGGTA ACAGTCTGTT TCTAGGAACA TATAACTCGA ACTGCGATGA ATTCTGGGCC ATTCAAAAAA CAATAGGCAA AGTACGGTGC CTTGATATTA TCGATTTGAC GAATGATTTA CAGATATCCA TGGTAAGAGT CACTTCTACT GCTCGTTGTG TATCAGATAA TTACAAGATA CAAACATTCT TGGAATATTG CATGAAGAAA TATCCAGAAA CTACGGTATC AGCTTATAAT CAAGGAACAA ATGGAAAAAT ATCCAAGATC TTAAACAAAG TGTTGACACC TGTATGTTCT CCAAGTGCTG ATCCTCTGCA AGGTGAGCTC ACGTCATATG CTCTTAACCA GTCGAGGTTT TCATGTTTTT TGCAGCCAAC GCTACGATTT TTTGCTGTGG GCAGAAGTGA TTCAAGTATT CTCTATCAAT TTGTCTATAG ACTGGTGTTT GAGAAATTGG GACTTTCATA TTTTTTCAAA ATTCTCGAAG ATGTATCAAT TGATGAATTA TTGAAGTCTC CCGATTTTGG AGGGGCGATA TTAGCAACTC CTATAGAAAT TAAAGCGAAC GAATTTGCTG GTAAATCATC AGCACATGCC GCAGAGATTG GATTAGTGGA TTCTATAATT GCAGAAAGAT CTCTTGATGA TCCATCTAAG TTCCTACTTC GAGGGGAAAA TGCGGATTGC TTAGCAATCA AGGTCTATAT CTCTGACAAT GTTGCTCCCA TAAATGCTGT AAGCCACAAT AAGAGTGTTC TCGTCATAGG TTCAGGTTTC AAAAGTCGTG CTGCTATTTA TTCGTTGATG AAACTTGGCT ATAAGAACAT TTTATTGTAT AGCCCTATGT CCATTGCTAG ACAGACTGAG AAGGATGTAT CTCTATCGCA CAATTTGGAT TCTTCCAGGA AATTGGATTC CCACAATCTA TTGGCTAAGA TTACAATAAT TACTGAAGAA CAATTCCAAA ATGGTATCCT CCCTGATGAT CTTCTATATC CAACAATAAT TATTAATTGC ATGAGTGATG AGGATGTTCC TATCGATGGT CAGGTCAAGC TATCTGCAAA TTGGCTCAAG AGTCCTTCTG GTGGAATATT CTTGGACACA CATATTGCAA ACAAAGAAAT TACAACCCTA AATGAGAGCC TGGAATGGGA AAAAGGATGG ATCAAGACTA ATGGACTTGA ATTCTTGCTT GCCAAAACAT TGATCCAGTT TGAGTTGTTT GTGGGTAAAC CAGCACCAAG AGAGCTTATA AAATCCATTC TAATAGAGCA TTATCCTAAT GAAGTTCAAT AG
|
Protein sequence | MIYERCENSL QHKFQSVVIV GLRGVGKSTL ALMASATLGL EYVDLERCLV DYTGVSDATF IKSVSKEEFI HLQYKLIVRS FRANKNKRAI YVLPASSINN SAVMEYLRNN CNCHCVINIE CDEDRILKYV NYTGEYQKGI SSIQSGISQY RSVANYNFFN LESNLDVWKK YSFSKVDDSK QIEVTPHLIL KPVELEFINF MSFILWNPIP EPSDVLLKHY QRSNLRSFSN CLQLTFPYDA RNIHVNNFGD VLNGVDAIEV GVDLIQLIRT KIMHVNLRLD EYIARIRRST RGSIPILVGI KNTIPELNNF IMESTVDSVI STSQIKQDFR SFYFSILYSI IKVAADYVVL NLEIFLFDEV NFLNDIIVND SFYIMDQLRH MQGNSSFLGT YNSNCDEFWA IQKTIGKVRC LDIIDLTNDL QISMVRVTST ARCVSDNYKI QTFLEYCMKK YPETTVSAYN QGTNGKISKI LNKVLTPVCS PSADPSQGEL TSYALNQSRF SCFLQPTLRF FAVGRSDSSI LYQFVYRSVF EKLGLSYFFK ILEDVSIDEL LKSPDFGGAI LATPIEIKAN EFAGKSSAHA AEIGLVDSII AERSLDDPSK FLLRGENADC LAIKVYISDN VAPINAVSHN KSVLVIGSGF KSRAAIYSLM KLGYKNILLY SPMSIARQTE KDVSLSHNLD SSRKLDSHNL LAKITIITEE QFQNGILPDD LLYPTIIINC MSDEDVPIDG QVKLSANWLK SPSGGIFLDT HIANKEITTL NESSEWEKGW IKTNGLEFLL AKTLIQFELF VGKPAPRELI KSILIEHYPN EVQ
|
| |