Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_32703 |
Symbol | |
ID | 4840095 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | - |
Start bp | 447250 |
End bp | 449281 |
Gene Length | 2032 bp |
Protein Length | 636 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640391410 |
Product | predicted protein |
Protein accession | XP_001385771 |
Protein GI | 150866242 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAACG AAGTAAGTAT ACTGTGCCAA TGACAGGAAA AAGACTCGCA CCAACCGTTT CATTACGTCA CTGTACTGAT GCGATTAAGC GACTCACCTT AATATCTATG GCGGTACTAA CGACACCCTA TAGCAACAGA AGACTACTTT TGATTTAGAG CCCAATCCTT TTGAACGCTC GTTCGCCTCC AAGGACTCGC TGGTACTGAA CGCTTCACTG GTTCTGGAAC ATCATAACAG AGACTCCGAG AACTCATCAG CCACTTCGTC TACCAAGTCC TCTAACAAAC ACAACCTCCA CATCCCCAAT CTTTCCACAC TCAACGGTGC CAACAACAAC ATTAATAACA ACATCAACAT CAACAGCAAC AATAATAGTA TCAATAATAG TAGTAATAGT ACCAGTAACA AACTTCCGGG AATCACTCCT CCCCTTTTTA CACCCGGTGG AAGAAGATTA CATCCTTTGG GACTTTCTCC TCCTGTCCCC GGATCTAATG GTGCTGCTGT CACAGCTAAC GGAACGGCCT TGCTGAATCC TGGCACTCCA GGTTCTAACT TATGGAACAG TTTGTTGAGT GCTACCAACA ACCACAATAA TAATGGTTCC AACGTCAATA CTGCCAATGT TGCTACTAAC GGTGCCAATG CCAATGTGAT AGCTGCAGGC AATGGTCCCA ATTCTCAAGC CAATTTCAAC CAGTTTGTGA ATACTCTCAG AAAGACTGGA TTGACTCCTA ACGAGTCGAA TTTGCGTTCT GGCTTGACGC CTGGAATTCT CTCCCATCAG TTTTCATTTG GAGCACAGGT TCCGGGCTTG ACTACTCCTA GCGCCTTGCT TAATAGTCCT ATGACTCCTG GTTTGTCTTC CTTGTTGGGC TTGACGTCTA ACAATTCTGC CAATAACGTC GCTACCATTA ACTCGTTACA GCCAACACAT CAACAGACAT ACGATACCCT TCCCCTGATC CCGCAAGAAC CTTCTGAAAG TTTGCCTACT TCAGAACCCC TTAGACAGCC AATGGCTGCT CCAGTTGTGA AACAGGAAAT TAAGAAACAA GAAACGAGTA AACGTAGTCA AAGCAAGAAG AGAAAGGCAG ATACCGCAGA TTCTAGTAAG GGAAAGAGGC AAAAGGCAGA TTCCGCTGCA GCTAAGAAGG CTGCCGCCAC CAGAGCCAAC CTGGAATCCG ATTCGGACAA AGAGTCCTCT CCACCTAGAA ACTCGAACAA TCCCAAATCT GAAGACGAGA AAAGAAAGAG CTTTCTCGAG AGAAACAGAG TGGCTGCGTC TAAGTGTAGA TTACGTAAAA AGCAATTGGT TCAGAAGATG GAGGACGAAT TGGCGTTTTA TTCTACAGGC TACAGAGAGT TGTCTGCTGA AGTCAACCAA TTGCGCGATC TGTTGATTAC ACTCAAAAGT ATTATAGAAA ATCACAAGGG CTGCGCTTTG TTGGCACAGA ATGTCGGAGG TTTTGATCAG ATAGAGAGAA TAATACAACA AGCCAACTAC ATCGCTGAGA TGAGTAACAA CAGTCTGAAC GATGTTACCT CTATCCCACT GACTATTCCA ACAACTCTTC ACAGCACAAA TTCCGTCAGT GCCATTCCTG CTCGTGGAAA TGATTCTCAG TTTCAGGCGA TGTCCAACAC CTTAGTTACT AAGACCATCG GCACTCCTAA CAGCAACGAT GTTCAAGCCA ACTCTAGCAC GAATACCACT ACTGTGGTGA CACCTGAATT GGCAGGAGCT TATTCTCATG CCACTATCAA CCATGCTCAC GGCATGTCAG ACATGCCACA ATCTAATCCT GATGGTCCTG TTGCTATGAA TGGAGGCAAT GGTGAGTTGA GGGCCATAAA CAGCATGTCA AACTTATCCG CTTTGAACAC AGGTGCTCAA GCTCAAATGC AGCAACTTCC ACATCCTCAA CAGGCCTTGC AGAACTATAG TCTCCGTCCT GTTAGCAGCA TGGTTGAGTT ACAACAAGCC ATGCATGCAC ATGGCAATCT TGGAAGCGAG TTGAACGTAT AG
|
Protein sequence | MTNEQQKTTF DLEPNPFERS FASKDSSVSN ASSVSEHHNR DSENSSATSS TKSSNKHNLH IPNLSTLNGA NNNINNNINI NSNNNSINNS SNSTSNKLPG ITPPLFTPGG RRLHPLGLSP PVPGSNGAAV TANGTALSNP GTPGSNLWNS LLSATNNHNN NGSNVNTANV ATNGANANVI AAGNGPNSQA NFNQFVNTLR KTGLTPNESN LRSGLTPGIL SHQFSFGAQV PGLTTPSALL NSPMTPGLSS LLGLTSNNSA NNVATINSLQ PTHQQTYDTL PSIPQEPSES LPTSEPLRQP MAAPVVKQEI KKQETSKRSQ SKKRKADTAD SSKGKRQKAD SAAAKKAAAT RANSESDSDK ESSPPRNSNN PKSEDEKRKS FLERNRVAAS KCRLRKKQLV QKMEDELAFY STGYRELSAE VNQLRDSLIT LKSIIENHKG CALLAQNVGG FDQIERIIQQ ANYIAEMSNN SSNDVTSIPS TIPTTLHSTN SVSAIPARGN DSQFQAMSNT LVTKTIGTPN SNDVQANSST NTTTVVTPEL AGAYSHATIN HAHGMSDMPQ SNPDGPVAMN GGNGELRAIN SMSNLSALNT GAQAQMQQLP HPQQALQNYS LRPVSSMVEL QQAMHAHGNL GSELNV
|
| |