Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_67675 |
Symbol | |
ID | 4838687 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 1503267 |
End bp | 1506316 |
Gene Length | 3050 bp |
Protein Length | 980 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640390002 |
Product | predicted protein |
Protein accession | XP_001384242 |
Protein GI | 150865142 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.364575 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCCTAATAAC ATACTAGATA TGTCTTCCAA CGTGTTTGAG ACCAACGAGG GAGCTGAACA TGTCAACGAA GCCAATCGTG TTGTACGAGA TGCTGAGTCC GACGACGAAG AGGAAGAAAT GGAAGATGCC ATGGATATTG AACGTGATGA ACAGGTAGAT GCTTTGATAG AAGGTTTTGG AATTGACTCT GATGATGAAG ACGAAGACGA TGGCGCTCTT TTTCTTTCCT CGTCTGAGGA TGATTTCGGA CTGGACGGCG AGCCAATTGA AGATGACTAT AATGAAGACT ATAATCTACG TGATGCTTTG AGAAGCGCTG GGAACTTCAA AGTAAGGAAG AAGAAGTCTA CGGCTAAATC ATTTTACAGG CGTAAAGTAA CTCAGAGTGA AAACAGGGAT TTGGATCCGG AGGTTAGGCT GTATATGTCG CAGGCTAACG AAGCATTTGC ACACAACGAC TATCAAGTTG CTCGCAACCT CTACCTTGAA GTCGTCAGAT TAGATAAAAA GAATTACAAC GCCTACAAGA CATTGGGAGA GATTTCTCAA CATCAGGGCA AGTTGAATCA ATGCTGTAGC TACTGGTTTA TTGCAGCTAA CTTGCGGCCC TGGGATAGTA AATTCTGGGG CGATGTAGCT GAACTTAGTA CTCAATTGGG TCATACTGAC CAAGCGTTAT TTTGCTATAA TAGGGCCATC TCTTCTGAGC ATAAGAAAAG TGCACGATTC ATCCTTCAAA GAGCCCTTGT ATACAAGGAA ATCAAGCAAT TTGGGCGGGC TTTGGAAGGA TTCCAGCGAG TGAGACAGCA ATATCCCACT GATGCTTCTA TTGTGAAAAA CTTAGCTGCG GTGTATGTAG AGCAGAAACG GTTGAACGAT GCTATCAATT TGTATATGCG CATTCTCGAC AGCAACATCC ATCCAAATCC ACAGAACAAG CAAAAGTATC CTAAATTCAC GTGGGCAGAG TTGAACATTC TCTTGGAGTT GTATGTTCAA CAGCATTCGT GGCGTGTTGC AATAAAGGTC ACTAGGCTTG CCGCTAGATG GATCCAGGGT CGTGAAAAGG AGACATGGTG GGATGAGAAT GACGACGACA GCGAATTCGA CCCCAAACGC AGAAATGAGG TTATAGACAA GTTGACCGAT ACCGCGAAGA AGGCAGAAGC CAAGGAGAAA CCCTTTGAGT TGCCCATCGA TATCCGCTAT AAGATTGGAA TCTTACGCCT AGGTTTGGAT CAACGAGATG AAGCTTTGCA TCATTTCGAG TTCTTGCTAG ACGAACAACT GGAGATCCCT GACTTGTTGA AAGAGACTGG TAAGGCATTG GAAGAGAACG GGTACCATGA GGAAGCCCTT CAATTCTTGA CAAGGGCTAT TTTTCCAGAA GATACTGGGG AGCAACTTGA ACTAGTCAAT CTTCTAGGGA AGTGTTATTT AGAAATTGGA GACTACAGTC AAGCCAAGAG TGCCTACACC GACCTTCTTA CACAAGATTC GAAGAATTTA GACTACAAGT TGGCTCTAGC TGAAGCCCTC TACCATCTTG GAGAAGAAGT AGATTCGAAG AAATTATTAG TAGAAGTCTC AAAGGAGAGC CACAAGCTGC TCCTGGATGT AAATGATGAA CTAGATAAGT CTGCTGAAGA GAGTTTGTCA TTGATCAAGT CCCTGAAGTT TATCAGATCC AAGACAGCAA AGCTCACAGA TCAAGAGAAA CTCGAAATAG AAAACCATGC CAAACGAAGA GTGCTAGAGA TCTACAGACG TATGGAAAGA CTCGAAGAGT CTACTATAAA TGGTGATGAA GTAGCCATAA GTGCATGGAT GCAATTAGCT TCTCAGTTGG TGGATATGTT CATGAGTGTT CGGAGTTTCT TTCCCCGTGA CAAGAATCGT ACTTTCAAGG GTATTGTTCT CTATCGTAGA AAGAAGCAGA TGGGCATTGA CGAGAAGTTG GCTAGAGTAT ACAACTTGTA CGAAGGTATA ACCAATGACG AGAACTACTC CAGGCAGTTC TTGACTTCCA AGACTGAGTA CCGTGGTTTA AACTACGACC AGTGGTTTGT CATTTTCCTT CAGTATGTAA TCTATGTGTC GAAGTTCGAT CACAACACGG AGTATGCCAA CGAAATTGTA GAAGTTGCAA TGTCTGTCAG TGTCTTTGTT CAGGACAAGA ACAAGGAAGC ATTGTTGCGA ATCTTGAAGT TGAAGTTTGG CATTGAACGC AGCGAGGCTA GTTCCACAGT AACGACATTT GTCAGGTTTT TCCTTATATC CAACCAGTTC TCTCCCTTCG TGTATAAGTT TTTCATCTGT TGCTTTGCCT CGGGAATCAA ATTCTGGGAA ACGTTCACCA ACTACAACCA CCAGAAGTTT TTCTTGCGTC AGTTGAAGGC TCACGACTCT ATTATCCTCA ACAAAAAAAT CACAGGCATG GCTACGATCA CTGCTGATTT GAAGGATACT ACACTTCCTA AAGAGCACCC TGACCTTTTG TATGTGTATG CTAATTTGCT AGGAGGGAGC AGAAGTTATG TCTCCTCAGT TGTGTACTTG AATCGTGCCT ATAGGCACTA CGACAGAGAT CCGATGATTT GCTTGGTGTT GGGACTAGCC CATGTGCACA GGTCTATGCA GAGACTCAGC TCAAACAGAC ATATCCAACT CTTGCAAGGA ATAAGTTATG TGTTGGAATA CAGAGATCAC CGAAAACACA ATTCCACCTC CTATGAGTTG CAGGAGATCG AGTACAACTT TGGAAGACTC TTCCACATGT TGGGGTTGAG CTCGTTGGCC GTTAACCATT ACAATAAGGT TTTGGAGTAT CATGACGAAT TGTCTGAAGA TCCAACTTAT GATTTGTCTG TTGATGCAGC GTACAACTTG ACGTTAATCT ACAATATTAA TGGTAACACC CAATTGGCAA GACGCTTGAT GGAGAAGTAT TTGACAGTGT AATCTAGTAA AATTGATGTG TAAAGTAGCA AGGTTGTTGA TTAATTTTAA TATCGTATAG AGACAGCATA ATATACTAAA GTACATTACA
|
Protein sequence | MSSNVFETNE GAEHVNEANR VVRDAESDDE EEEMEDAMDI ERDEQVDALI EGFGIDSDDE DEDDGALFLS SSEDDFGSDG EPIEDDYNED YNLRDALRSA GNFKVRKKKS TAKSFYRRKV TQSENRDLDP EVRSYMSQAN EAFAHNDYQV ARNLYLEVVR LDKKNYNAYK TLGEISQHQG KLNQCCSYWF IAANLRPWDS KFWGDVAELS TQLGHTDQAL FCYNRAISSE HKKSARFILQ RALVYKEIKQ FGRALEGFQR VRQQYPTDAS IVKNLAAVYV EQKRLNDAIN LYMRILDSNI HPNPQNKQKY PKFTWAELNI LLELYVQQHS WRVAIKVTRL AARWIQGREK ETWWDENDDD SEFDPKRRNE VIDKLTDTAK KAEAKEKPFE LPIDIRYKIG ILRLGLDQRD EALHHFEFLL DEQSEIPDLL KETGKALEEN GYHEEALQFL TRAIFPEDTG EQLELVNLLG KCYLEIGDYS QAKSAYTDLL TQDSKNLDYK LALAEALYHL GEEVDSKKLL VEVSKESHKS LSDVNDELDK SAEESLSLIK SSKFIRSKTA KLTDQEKLEI ENHAKRRVLE IYRRMERLEE STINGDEVAI SAWMQLASQL VDMFMSVRSF FPRDKNRTFK GIVLYRRKKQ MGIDEKLARV YNLYEGITND ENYSRQFLTS KTEYRGLNYD QWFVIFLQYV IYVSKFDHNT EYANEIVEVA MSVSVFVQDK NKEALLRILK LKFGIERSEA SSTVTTFVRF FLISNQFSPF VYKFFICCFA SGIKFWETFT NYNHQKFFLR QLKAHDSIIL NKKITGMATI TADLKDTTLP KEHPDLLYVY ANLLGGSRSY VSSVVYLNRA YRHYDRDPMI CLVLGLAHVH RSMQRLSSNR HIQLLQGISY VLEYRDHRKH NSTSYELQEI EYNFGRLFHM LGLSSLAVNH YNKVLEYHDE LSEDPTYDLS VDAAYNLTLI YNINGNTQLA RRLMEKYLTV
|
| |