Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_80414 |
Symbol | MBT1 |
ID | 4851159 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 1078811 |
End bp | 1081338 |
Gene Length | 2528 bp |
Protein Length | 783 aa |
Translation table | |
GC content | 42% |
IMG OID | 640392867 |
Product | MADS box transcription factor |
Protein accession | XP_001387851 |
Protein GI | 126274145 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.854363 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGAAA CCAATGAAGT TGACGCAACG TCAAAAACAA AACAAAGAAA ACATGTGAGC AGAGCATGTC TCGAGTGTAG GAAGAGACAC TTCAAATGCG ATGGGAAGGA GCCGGTATGT GATCGCTGTG CCAAAGCCAA TAAGGAGTGT GTTTATGTAG AGAGCCATAG AGGAGGTCTG CGAAAGAAAG GTGTATCACG AAAAGCTGAC AACAAGAACG GTGCGAGTGA AGGACTACAA GATATCGAGG ATTTCGCGCT AATGAAGACC ACCAAGGATC TTAACTTTGT CCAGGGCAAG GATAGCAATG TACGGGGCAG AAGCGGCAAC AATGATGAAA GTGAACTCTT CGACGAGTTG TACAAGTTAC CTTGTGCTAA AGACAGCACC AAGTGTAGTG GACTCAATTG TCCAGGTAAA CATGCCGTAC AGTATTTCCA GAGTCGACCC GACGGGAAAG CGTTGGACCT GAAGGAGGTG GAGATCCTAA ACAATATCCG AAAGAGAATC AAGCTAGACC ATTCTGTTGG AAATATAGAT TGTCTCTTTG CAAATCGCGA ACCAGTGAAG AATTTTGTCT CTTTCCGTGA GTTTGACTTC GGACTCAAGA TCCACAACGC AGATCATTTT ATCAACTTGG CTTCCCTCAA CCAGGATGAC ATTCTCAAGA AGTACTATGA GAAGTTCCAT GTAGCTCATC CGTTTTTTCC ATCCAATGAC GAATTACTTG TTTACCTTGC TACACCTTCA GTGGCTAGAG AAGTTCTTTT GATAATGCAA ATCGTAGGTG AGGGCGAAAC GTCTAATATT TATTCCAAGA ACATCGACTT GATCTCAGAC AGATTAATCC AGTGTCTTGA GCTTATCAAA ACTAACAATG AGTTGGACCT TATGTCCATC CAAGTGCTAT TGCTTGTATC AATTGTAGCA CATATTTCCT CTCTCCATGT CTTCAGCAAG AAACTTAGAC ACTCCTGTGT TCACTTATTA CAGGAGTTGG AAATCAACAC GATCGATAAG GATGAGGGCT TGCCTAATGT GCCTGTGCTA TCGCCAGAAT CTGACGATCC TTCTAAAACT CCGAAACTAT TCCACTCTTC CAGATTAAAA CACATTTCCC ACAGCAGTAT ACTCAACGCA GCCAGAAGAA CATACTGGGA GTTGTACTTC TTTGATGTTA TTATTGGCTC TGCTGACGGG AGAACTATCA CAACTCTTGA CTCGTTACCA TCTCATATCA GCTATCCATC ATTTCCCCCA AGAGAAGTAT TCGATTACAA GGGCAGATCA GAAGCCGCGA AACTCGTCAC GGACTCGGTC AAGTTGAATA TTCAGATAAT CGAGAAAAAA CCGATCGACA CCATGCTTAC AAGGATGAAA GCTGCCATTT CGAGCTGGGA AATGAAATTA GAAGAGCCTC AGATGTATAA TTCTCCGCCT TTGATCCAAC AAAATGGTAC TGTCAACGAA GGTGTCCATC AAGCTATCTT GTTGTGCAAC TATGCCAAAA TATTTGTCCA CCGTCCATTC TCGTTCTTGT GGAAAATCAA CTCGCCTCAG AATCCTAAAT GTGGAGAGGA AGTGCTCGAA GCTAAGGACT TGCCAACACA GACTGACGCG GATTCCAGAG CAATTATAGA AACGAGAAAA ACCATTGAAG CAGCAAACTC GATTGCCCAA GTATTAATCG ATACGAATGC CTCTAAGGTC ACAGAAAGAA CTCCCTTGTT TGCCTGTGCT CTTGCTTTAG CATCCTTGGT TCATATCAGT GCATATGTCT GGGTTGAAAG CACACTATTA AACGACATTT CAAGGACATC AGGCTTAGAT CCTAGTGAAA TGGATGTTTA TGCCGAGTAC ATTAAGCTCT CATTGTCTGC GATTTACCCT ATCTCTATGC ATTGGATCCT TTCTGGTAAG TTGGCCAGAC ACATTAGAGA GTCTTTGAAT ACATTGCGTC CCCAATTGTA TTCCAAGTTA AAGGACAGCT TGCCTCAGAT CGAGATCAGC ATAGAAAAGA TGAATTTGGG CGACTCCATA AATGATACCA GTTCTGAACC TAGTCGAACT TACAGTACTG AGCTGTATGA TCCAAAAGAT ACAGCTACTG TTTCCACTAG TCAGACTAGC ACTTCGGTTG GCTACACAGG GGAACCATTC TTGAACGAAG CTAGTAATTA CGTTGAGCAC CAAAGATCTG GACTTGTTGT CAATGCCAGT AATAATAACA ATAATAACAA TAATAACGGT AATAGTGCCA TTACGGCCAA CAATGTTGAC CTTCACAGAA ACAGCATCTC CAATTTGCAA GGCTATAGCA GTTTCTTCCC TAGTGGAGTT GAACAATTTG ATATTCCTCT CTCTGGACAG TTGTCTCCAG TTTCGGATAC AGGATGTGAT TGGATTGATA AGGCATTATT GGATTACTTT GACGGAGAAA GCTTGAATAT GGTGAGCTGA TGAAGAATAA CGACTAAATG GATATAAATG TAATAATGAT TAATATATTG GAAATACAAG AATATTTT
|
Protein sequence | MAETNEVDAT SKTKQRKHVS RACLECRKRH FKCDGKEPVC DRCAKANKEC VYVESHRGGL RKKGVSRKAD NKNGASEGLQ DIEDFALMKT TKDLNFVQGK DSNVRGRSGN NDESELFDEL YKLPCAKDST KCSGLNCPGK HAVQYFQSRP DGKALDLKEV EILNNIRKRI KLDHSVGNID CLFANREPVK NFVSFREFDF GLKIHNADHF INLASLNQDD ILKKYYEKFH VAHPFFPSND ELLVYLATPS VAREVLLIMQ IVGEGETSNI YSKNIDLISD RLIQCLELIK TNNELDLMSI QVLLLVSIVA HISSLHVFSK KLRHSCVHLL QELEINTIDK DEGLPNVPVL SPESDDPSKT PKLFHSSRLK HISHSSILNA ARRTYWELYF FDVIIGSADG RTITTLDSLP SHISYPSFPP REVFDYKGRS EAAKLVTDSV KLNIQIIEKK PIDTMLTRMK AAISSWEMKL EEPQMYNSPP LIQQNGTVNE GVHQAILLCN YAKIFVHRPF SFLWKINSPQ NPKCGEEVLE AKDLPTQTDA DSRAIIETRK TIEAANSIAQ VLIDTNASKV TERTPLFACA LALASLVHIS AYVWVESTLL NDISRTSGLD PSEMDVYAEY IKLSLSAIYP ISMHWILSGK LARHIRESLN TLRPQLYSKL KDSLPQIEIS IEKMNLGDSI NDTSSEPSRT YNTATVSTSQ TSTSHQRSGL VVNASNNNNN NNNNGNSAIT ANNVDLHRNS ISNLQGYSSF FPSGLSPVSD TGCDWIDKAL LDYFDGESLN MVS
|
| |