Gene PICST_80414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_80414 
SymbolMBT1 
ID4851159 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1078811 
End bp1081338 
Gene Length2528 bp 
Protein Length783 aa 
Translation table 
GC content42% 
IMG OID640392867 
ProductMADS box transcription factor 
Protein accessionXP_001387851 
Protein GI126274145 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.854363 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAAA CCAATGAAGT TGACGCAACG TCAAAAACAA AACAAAGAAA ACATGTGAGC 
AGAGCATGTC TCGAGTGTAG GAAGAGACAC TTCAAATGCG ATGGGAAGGA GCCGGTATGT
GATCGCTGTG CCAAAGCCAA TAAGGAGTGT GTTTATGTAG AGAGCCATAG AGGAGGTCTG
CGAAAGAAAG GTGTATCACG AAAAGCTGAC AACAAGAACG GTGCGAGTGA AGGACTACAA
GATATCGAGG ATTTCGCGCT AATGAAGACC ACCAAGGATC TTAACTTTGT CCAGGGCAAG
GATAGCAATG TACGGGGCAG AAGCGGCAAC AATGATGAAA GTGAACTCTT CGACGAGTTG
TACAAGTTAC CTTGTGCTAA AGACAGCACC AAGTGTAGTG GACTCAATTG TCCAGGTAAA
CATGCCGTAC AGTATTTCCA GAGTCGACCC GACGGGAAAG CGTTGGACCT GAAGGAGGTG
GAGATCCTAA ACAATATCCG AAAGAGAATC AAGCTAGACC ATTCTGTTGG AAATATAGAT
TGTCTCTTTG CAAATCGCGA ACCAGTGAAG AATTTTGTCT CTTTCCGTGA GTTTGACTTC
GGACTCAAGA TCCACAACGC AGATCATTTT ATCAACTTGG CTTCCCTCAA CCAGGATGAC
ATTCTCAAGA AGTACTATGA GAAGTTCCAT GTAGCTCATC CGTTTTTTCC ATCCAATGAC
GAATTACTTG TTTACCTTGC TACACCTTCA GTGGCTAGAG AAGTTCTTTT GATAATGCAA
ATCGTAGGTG AGGGCGAAAC GTCTAATATT TATTCCAAGA ACATCGACTT GATCTCAGAC
AGATTAATCC AGTGTCTTGA GCTTATCAAA ACTAACAATG AGTTGGACCT TATGTCCATC
CAAGTGCTAT TGCTTGTATC AATTGTAGCA CATATTTCCT CTCTCCATGT CTTCAGCAAG
AAACTTAGAC ACTCCTGTGT TCACTTATTA CAGGAGTTGG AAATCAACAC GATCGATAAG
GATGAGGGCT TGCCTAATGT GCCTGTGCTA TCGCCAGAAT CTGACGATCC TTCTAAAACT
CCGAAACTAT TCCACTCTTC CAGATTAAAA CACATTTCCC ACAGCAGTAT ACTCAACGCA
GCCAGAAGAA CATACTGGGA GTTGTACTTC TTTGATGTTA TTATTGGCTC TGCTGACGGG
AGAACTATCA CAACTCTTGA CTCGTTACCA TCTCATATCA GCTATCCATC ATTTCCCCCA
AGAGAAGTAT TCGATTACAA GGGCAGATCA GAAGCCGCGA AACTCGTCAC GGACTCGGTC
AAGTTGAATA TTCAGATAAT CGAGAAAAAA CCGATCGACA CCATGCTTAC AAGGATGAAA
GCTGCCATTT CGAGCTGGGA AATGAAATTA GAAGAGCCTC AGATGTATAA TTCTCCGCCT
TTGATCCAAC AAAATGGTAC TGTCAACGAA GGTGTCCATC AAGCTATCTT GTTGTGCAAC
TATGCCAAAA TATTTGTCCA CCGTCCATTC TCGTTCTTGT GGAAAATCAA CTCGCCTCAG
AATCCTAAAT GTGGAGAGGA AGTGCTCGAA GCTAAGGACT TGCCAACACA GACTGACGCG
GATTCCAGAG CAATTATAGA AACGAGAAAA ACCATTGAAG CAGCAAACTC GATTGCCCAA
GTATTAATCG ATACGAATGC CTCTAAGGTC ACAGAAAGAA CTCCCTTGTT TGCCTGTGCT
CTTGCTTTAG CATCCTTGGT TCATATCAGT GCATATGTCT GGGTTGAAAG CACACTATTA
AACGACATTT CAAGGACATC AGGCTTAGAT CCTAGTGAAA TGGATGTTTA TGCCGAGTAC
ATTAAGCTCT CATTGTCTGC GATTTACCCT ATCTCTATGC ATTGGATCCT TTCTGGTAAG
TTGGCCAGAC ACATTAGAGA GTCTTTGAAT ACATTGCGTC CCCAATTGTA TTCCAAGTTA
AAGGACAGCT TGCCTCAGAT CGAGATCAGC ATAGAAAAGA TGAATTTGGG CGACTCCATA
AATGATACCA GTTCTGAACC TAGTCGAACT TACAGTACTG AGCTGTATGA TCCAAAAGAT
ACAGCTACTG TTTCCACTAG TCAGACTAGC ACTTCGGTTG GCTACACAGG GGAACCATTC
TTGAACGAAG CTAGTAATTA CGTTGAGCAC CAAAGATCTG GACTTGTTGT CAATGCCAGT
AATAATAACA ATAATAACAA TAATAACGGT AATAGTGCCA TTACGGCCAA CAATGTTGAC
CTTCACAGAA ACAGCATCTC CAATTTGCAA GGCTATAGCA GTTTCTTCCC TAGTGGAGTT
GAACAATTTG ATATTCCTCT CTCTGGACAG TTGTCTCCAG TTTCGGATAC AGGATGTGAT
TGGATTGATA AGGCATTATT GGATTACTTT GACGGAGAAA GCTTGAATAT GGTGAGCTGA
TGAAGAATAA CGACTAAATG GATATAAATG TAATAATGAT TAATATATTG GAAATACAAG
AATATTTT
 
Protein sequence
MAETNEVDAT SKTKQRKHVS RACLECRKRH FKCDGKEPVC DRCAKANKEC VYVESHRGGL 
RKKGVSRKAD NKNGASEGLQ DIEDFALMKT TKDLNFVQGK DSNVRGRSGN NDESELFDEL
YKLPCAKDST KCSGLNCPGK HAVQYFQSRP DGKALDLKEV EILNNIRKRI KLDHSVGNID
CLFANREPVK NFVSFREFDF GLKIHNADHF INLASLNQDD ILKKYYEKFH VAHPFFPSND
ELLVYLATPS VAREVLLIMQ IVGEGETSNI YSKNIDLISD RLIQCLELIK TNNELDLMSI
QVLLLVSIVA HISSLHVFSK KLRHSCVHLL QELEINTIDK DEGLPNVPVL SPESDDPSKT
PKLFHSSRLK HISHSSILNA ARRTYWELYF FDVIIGSADG RTITTLDSLP SHISYPSFPP
REVFDYKGRS EAAKLVTDSV KLNIQIIEKK PIDTMLTRMK AAISSWEMKL EEPQMYNSPP
LIQQNGTVNE GVHQAILLCN YAKIFVHRPF SFLWKINSPQ NPKCGEEVLE AKDLPTQTDA
DSRAIIETRK TIEAANSIAQ VLIDTNASKV TERTPLFACA LALASLVHIS AYVWVESTLL
NDISRTSGLD PSEMDVYAEY IKLSLSAIYP ISMHWILSGK LARHIRESLN TLRPQLYSKL
KDSLPQIEIS IEKMNLGDSI NDTSSEPSRT YNTATVSTSQ TSTSHQRSGL VVNASNNNNN
NNNNGNSAIT ANNVDLHRNS ISNLQGYSSF FPSGLSPVSD TGCDWIDKAL LDYFDGESLN
MVS