Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_80251 |
Symbol | |
ID | 4851486 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 1952288 |
End bp | 1956407 |
Gene Length | 4120 bp |
Protein Length | 1030 aa |
Translation table | |
GC content | 41% |
IMG OID | 640393194 |
Product | predicted protein |
Protein accession | XP_001387613 |
Protein GI | 126274615 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGGTATATCA CTATAGTGGA GATAGTCATA AGTTGTAGTA ATAGCTTCGA TCGAGCCATC CATCGCCGAG ACTGCTAATA TATAACTATA CGTTTGTTTG CCCGAAGCTC GAATCTTTCT CATTTCACAA GACAGACAAT CAGAACGAAA GAATCGAATA CAACAGGACC AGAACATCTA CAATATCATC TACCAGACGT TACATCCACT ATACTCTACT CTAACTACCA CTCATAAATA CAGACAAACA GTGCTGTCCA ATTGTTGATT ACTCATCGTT ATCATATTTT TCACTTTCAT AAATTTCATA AATTTCATTT CACTTCAGTT CGTTCATGAC AAATAAAGAG GAAGAGCAGC TTCTTCAGGA TGCGCTGACG CTTTTGATGT TCGCCAACGT AGCGGCAAAG CAACAACAGC AACATATCAA GAACACTTCT GCTAGCCATA GTCCTCTTCA AACTGCTACG TCACCGGCAG CCTCCTCCGT GACCCCACCT CCGGCCAATC CTTCGACTTC GTATCAGCAA ATCCAACAAC TTCAAAACCA ACAAGCACAG CTTCAAACTC AGTTTCAAAA CCATATTCAA CATGGTGCGA TAAAAGTAGA ACCAATTCAA AAGAATCAAA CTCAACAAAG TCAGCTTCAG CCACAACTTC AACAGAGTCA TCAACCTCAG AAACAACTTC CACCGTTTGA TTACTATGGA GCTCAGAATC GCCCAGCAGT TTCTTCCAAT ATTGTCCCAA ACCCTGAAAA GTTCTCGACT TCGTCGTCTC TGTCTACGGT TTCGCCTCCT ACGGCTTCGT TTGTGCATAA TCAAGACCTC AGAGAGGCAT CTAGACCTCA CCTTCCACAT AAAAGTTCGT TGAGCATTCT TATGAACAGT CCAGAACCAC AATCCATGCC ATTCTCGCCT TCTAATGTGT CAACTCTTAC AAAAGATTCT CAGATTTCAC AAACTTCCCA GGTTTCACAG CAGAAACGCT CGGCTTCGCT TGAGGATTCT AATCTTCCCC AGAAAGGCAG TTTTGTCCAA CAACACAAAC GCTCTAAATC GACCCCAGAG ACCGACAAAA AGACACCGGC TTATAGACAC ATTGTTCTTT CTCCTGGCCC AGCAAACGTA GCTCTCGCCA GAGGCATCAA CTTGGAAACA GGAGAAAGAA ATAATAACAA TGCGGTGATA GCTGCTGCTG CATTAGCAGC TGCTGCAGAT ATACCGTTGC CGTTGAAACA TGTAAAAAGA AACGTAGAAC CAACAGTTTC TAAGTCTACA ACAGTAAAAT TTGCACACAT AGTATCAGAA CAAGAAGCTG AACCGGCTAA AACTGTTCAA GTAGCACAAC CTGTTGCAAA AGTTGAGCCG GAACAAGTTC CACTAGCAGT TCCAAAACCA GTAGTGGTTC CTACACTTCC TGTGACATTA CCGGTCATGG CTACAAATCT TCGAGAAAAG GAAATCAAAA TAACCAAAGA TGAAGATCAG CTTACCGAAG CAGAGGTAGA CGAGAAAACA GACGATGAAC GCACAGACGA TGAATATTCC TCAGCGAGAG GAAGTAAAAC AGAAACAGAA ACAGCCTCGG AAATCTTCGA AGTAAAATCA GAAGTTGAGG GCCCAAGACA AGAACAAGTA GTTCAACTTC AACTTCAACT TCAACAAGAA GCACAAGAAC AGGAAACTCA TGCACATCAA GAACAGGCCC AAGTTCAACA AGAACAAGAA CAAGAACAAA TTCAAATTCA ACGAGAACCA GAAGCAAAAC AAGTTGAACA AGAACAAGAA TCACAGGAAG TTCAACAACT TGTAGAAAGT GATGATCAGG ACTATGAACG GGAAGAAAAA GTGGACACAG AGATTCAACA TGAAGAACAA AAGATCGATT TTAAGGCACC TCCTCTTTCT TCATATCAGG TTGATCCTGA TTCTGGTTTA ATCGGATGTA TCTGTGGAAT TTCTGACGAT GACGGGTTTA CTATCCAGTG TGATGTGTGT TATAGATGGC AGCATTGTGT CTGTATGGGT TTCAAGACAA GTGAAGAAGT GCCCGAAGAC GAGTATACTT GTTATTACTG TGACAGAGCC AAGTGGGGAA AATTTGACCC ACTAGAATGT CGCAGAGAAA CGTTGGATCG TCTAGACAAC GATAGTCGAC AGCCTCAATT ACTGCAAGAA TTGAACGAAA GGCAACAGCA AGAACTTGCG GAATTTCAGG AGAAGCAGAA GCAAAATGGC AAGAGAAAGC CGCTGAACTC TGATAAAAAC GACAAGCGAC GTAAAGTGGA AAGTCAAGTA GAAAAGAGAA AGGACTCCTC AGCAACTGAA GTACAGCCAA AGAGTCAAAC TCCTAGCGAT ACCCTTCCAA ACAAAGACAA TGAGTTGTTA GAAGATGGAG TCACAGCAGA GTCGTATCAA TCTGTGTATT ATAAGCTTAG AAAGAACGAC TACAAGAGAC AGTCTATAAG CGATTTCTTT TCTAGAATAG GTACTGAGTT CTTCAACGAA TACCTTTCTT TGGACCCAAG CACAAAAGCC TCCAAAGAAT TGAGAGGGAT CAAAGTGATG TCTATGCCTG AGTTTAAAGC TATTAGATTG AGCAAATTGA ATTTGCCAAA CCATCTTAAT TATATTAGCG AGCACAAGAA CAATTTGTCC AAGAAGAAGT TGTTCAACGA CACTTCGATA CAGGTTAGAC AGTATTCCGA TAATCAGAAG CAGAAGTTTA ATGGAATCTC CAAGTTGTCA CTTTTTATCA CGTCCACTAA TAGCGATAGT TTGACGATCC CAGCCAACAC TGCGATTATC GAGTATTTAG GTGAAATTGA TCTTTTCAAG AACTACGTCA GGGATCCTAT CAACCAATAT TCCAGCTGGG GTACCCCAAA GCCCAAGGTT CTTCGTACAA GTCTCAAAGT AGCACAGGAC AACAATTTGG AAGTTGTTCT TGACTCCAGA TTTGTGGGTA ATGAATCTAG ATTCATTAGA AAAGCTTGTC CTGCTTCCAC AAATTGTAGA ATAGAGCCAA TTTACATTCC TGAAGACAAT TCGTTTCGCT TCTTAGTCGT GACAACAAAG CCCATTAATT TGAAGTCAGA ATCTGCTGAT GAAGAGTTGC GCCTTGAATG GGAATGGGAT CCACAACATC CCATCCTCAA GCTTTATGAA AATAACAACT CTGAGAAGTT TGAGCAATTG GTGAATGCTG ATAAATCAGC ATTAATCACC TACATCGACA ATATTTTGCA TTTTGCCGAG TGCGGCTGCT CAACAGCAAA CGCTTTTTCC TCTTGCGCTA TTTTTAAGAT CAAGAAGGCT ACTTCGTATT TGTTGCGTTC AACTAGAAAA GCATCATCTA TCAGCAATTC CAATCTAGCC AAATCAAAAG AGGAATTGAT ACTTCCTAGA AAGGAAAAGG AGTACATTTC TTGGGAAGAA AGATTGTTGG AAAGAGACAA CATCATACAG ATGAATCTTT CAGTAACTAC TGAACAGATT AGCGAAGAAT CAAAAGAAGA GTTGAAAAAC GATAATGAAG TGGAAGTCGA TAACGAAGAA GTTAAGGGAG AAATTAAGCC AAATTATCTT TTCAAGTTGC CTTACAAGCA GCAATTGCTC TCCAAGAACA GGGGCCGTAA GATCGTAGTT CGTCCTGGAT CTTCTTCTGT AGAAGGAGAT GTAAATAGCG GAACAGCCCA CGACGAGTTA CCATTCCCAA TTGTGTCTGA CTTAGTAGTC AAGATCGAGA AAAGTATTGA CGAAAAGCTT AAGCCAATGG TCAAGGAGGT GGAAGAGAAG ATAACTACTG TCCTTGAGTC ACTTCCAAAG CCTTCGGCCA CGCAAGAAGA AACGTCAAAG AAGGATGAAG TTATTGCTGC TGTGGAAGAA ATACAACATG TTCTCAAGAA GGAGACAGAA GATCAAGTCA GCTCTCTTCC ACCACTTTCA ACTGAAACTT CTTCAGAAAC TAAAAAGGTG GAAACTTCTG ATTCGACAAC AGCACAGGCA CCACCTCCGG TTGTCAAGAA ATTATCATTC GCTGACTATA AGAAGAAATT GAAATAATAG AATAGGTTAT ATATTTATAA TGAGTAATAA TGTACTATAT CATATTTTCA
|
Protein sequence | MTNKEEEQLL QDALTLLMFA NVAAKQQQQH IKNTSASHSP LQTATSPAAS SVTPPPANPS TSYQQIQQLQ NQQAQLQTQF QNHIQHGAIK NRPAVSSNIV PNPEKFSTSS SLSTVSPPTA SFVHNQDLRE ASRPHLPHKS SLSILMNSPE PQSMPFSPSN ISQTSQVSQQ KRSASLEDSN LPQKGSFVQQ HKRSKSTPET DKKTPAYRHI VLSPGPANVA LARGINLETG ERNNNNAVIA AAALAAAADI PLPLKHVKRN VEPTVSKSTT VKFAHIVSEQ EAEPAKTVQV AQPVAKVEPE QVPLAVPKPV EVQQLVESDD QDYEREEKVD TEIQHEEQKI DFKAPPLSSY QVDPDSGLIG CICGISDDDG FTIQCDVCYR WQHCVCMGFK TSEEVPEDEY TCYYCDRAKW GKFDPLECRR ETLDRLDNDS RQPQLLQELN ERQQQELAEF QEKQKQNGKR KPLNSDKNDK RRKVESQVEK RKDSSATEVQ PKSQTPSDTL PNKDNELLED GVTAESYQSV YYKLRKNDYK RQSISDFFSR IGTEFFNEYL SLDPSTKASK ELRGIKVMSM PEFKAIRLSK LNLPNHLNYI SEHKNNLSKK KLFNDTSIQV RQYSDNQKQK FNGISKLSLF ITSTNSDSLT IPANTAIIEY LGEIDLFKNY VRDPINQYSS WGTPKPKVLR TSLKVAQDNN LEVVLDSRFV GNESRFIRKA CPASTNCRIE PIYIPEDNSF RFLVVTTKPI NLKSESADEE LRLEWEWDPQ HPILKLYENN NSEKFEQLVN ADKSALITYI DNILHFAECG CSTANAFSSC AIFKIKKATS YLLRSTRKAS SISNSNLAKS KEELILPRKE KEYISWEERL LERDNIIQMN LSVTTEQISE ESKEELKNDN EVEVDNEEVK GEIKPNYLFK LPYKQQLLSK NRGRKIVVRP GSSSVEGDVN SGTAHDELPF PIVSDLVVKI EKSIDEKLKP MVKEVEEKIT TVLESLPKPS ATQEETSKKD EVIAAVEEIQ HAPPPVVKKL SFADYKKKLK
|
| |