Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_14132 |
Symbol | MDS3 |
ID | 4840553 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | + |
Start bp | 987596 |
End bp | 992392 |
Gene Length | 4797 bp |
Protein Length | 1427 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640391868 |
Product | negative regulator of early meiotic expression |
Protein accession | XP_001386202 |
Protein GI | 150866560 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0438243 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTACTC TCATTCCCAC TTCAAGTGCT TGCCACAATC TTCAACTTCC TCATACAGAG AAAGACGATA GACTCAATCT CAATGTCAGA ACAGGATCCT CGTCGACTTT GTATAACTCA CTCGTATTGA CCTCAGGTGG GTTGACCATA GGCTTGGAAC TCGGAGACAC AACTATTGAA GAGATCTACA CTACTTTCCG TCAAAAGGTC TCCAGCTCTG TAATGAAGTT TAAGTCTCTT GATAAATATT TATCAGGAGA GCTCTTCTAT CTTAGTCTTA TAGACAAGGT CTGGAGCAGA GTTCCATACA ATAGAGATAC AGATGGACCA CGACCTAAAC CTCGTCTTTT CCACCAAATT TGTGCTCTCA ACAACTGCGT CTATCTTTTT GGAGGATTGG TGATCCCAGA AGACGCAAAT CCCACCAGCG AAGATGATAC CACTATCAAA GACTTTTTGG TTCCATGTAA CGATTTGTGG CTGTTCAATT TGGAGCTGAA CAAATGGAGA CTTTTAAGCG ATGGATCTGA GTATGAAACG AATCGAGCTA TACCTCAACC TCGGTTCAAC CACAAAATGA CGACTATCAA CAGTCTTTCG TTTGTGAATA AAAAAGACCA TTACGGAATT TTCATTGCTG GAGGAAAGGA TGGAAACTCC AATCCCATCT ATGACAATGC TGTCTTCGAT TTGGTGGAAA ACAAATACGT AGGCTCTGAA CCCATCAAGT TGAAGACAAC TACAGGCAAC GTTGAAAAGG ATTCCAAGCT GGGTTTGGAC AATATTGTAG CCAATGACAA GCACGAGTTG AATGTCGACT ACAACAACAG CATGATAGTG ACTTTCACAG AGGATATCGA CCACCAGCAT CACCAGCAGA TCAACGACAA GGGCAAAATC CATCATGAAT ACAGAAAATC AACATCGCAA GAACAGTCGT TCATCATCTA TACTCCGACT ATAGACAAAT CACAAGATCA GAATAACAAT CCATTGGTCT CCTTCCGTGC TGGAAGAAAA ATTCACAATG GAAAGCCGCT TCCCATGCAT CGCAAAAAAT TATTCACCAA TGGACAAAAC AGCGAGGGCC CTACTATTGA CAGACGAGGA TTGAACCATA CCATTCCCTT TAATCTTAGA TATCCCACTG GTGGGTTATT TGGGCAAAAT ATCGTCATCA CTGGGTTCTT GCCAGGAGAT TTTGACATTT CCATTTTCAT CTACAACAAG CCTACGGGCA AATGGTCGCG ATTGAATATC TTCTGCAATC ATGACTATGG TTCTCACAGA TTCTGGGGAG GTTTTGCCTG GCAGTCGCAT CACAAGGTTG TCCTCATTGG TAACTACTTG ACATCTCGGA CTACTAGCAG TATTCGTTTC TTCACTGTGA TGATAACTGT AAGTCTACCC ATCACGAACA TACTAGCTAG TTCTGAACTT GCTGGTGGCC ACCATCATGG ACCTGATGGA AGAAGGATTC CTCACCGTAG AGTCAGATTG AGTAATGCCA GTGAGAATGC CTCCAATCGT AAAAATATCG AAGAAGAATT GAGTGATGAG TCTACTAGCA GCAGCAGTTT GTTGAGACAT ACTGATGACG ATGATTATGA TGAAGGAATA ACAAGTTCTT CGCCAGAAGT ACTCACAGAA GAAGAGTTTC TCAAGAAAAC GTCTGATAGA AGACCCTCGT CGATATCCCA GGCTAGCGAC AAGACTTCTC CCACAGCTAT AAGTTTCAGT GAATATGTTC ATTACGCTGC TCCAAAGACG AACTTCACAA GCATTCGATC GGTATTTCCT CCTGCTGCTA TCACGTTAGG TCGGAATGCT TTTGACAGGT ACGGAGACTT GATTTCAGAC TTTGAGTTGG TGTCGTGCAA TGGGGATCGT ATTCCAGTTT CGTTGATTGT ATTGATGGAA AGGTGGGGTA GATATTTCAT CCAACTTTTG GCAAGAGGCT ATGTGTCTGC TGTAGATAAG TTTGAGTCTG ACCAGGCTCT AGGCGTATAC AACTCCGACA AACAGAGATT GAGAACGTCC AAGAGTGGTG GCAGCGGCAG TTCTGGAAGT TCGCATAGTG TAGCTCTGAA CAAGTTGAAA TCATCTATGG TTAAGTTCAA TTCTTTGTCG TCTGATGGAG TTCATTCTGT CCTGGAAAGT ACAAAATCAA ATTCGTCCAA AGAAGAATCT GCCAAGGATA AGTACTATAT CTCACTTCCT GTTCCGCAGA GTAAGGCTCC TGCCAAAGAT GTTCCTCAGT TCAGACTTCC TTTCCAAGAT GGTGCCAATT CATCTCATTC CAGCTTGAAA GAAACAAATA AAGAACCGTC TAATCCCTTG GAACAGAATG AATCTGAAGC TTCACCAGAA GGTCCTACAG CAGTAGATAT CAACAGAACT GATAGCAATA TCGGTCCCTC TGTTGCTTTG AATGACGGAG CCGTAGTGTC ATCGTCACAA ACTGCTGTAG ATCCTCATCT GATAGCTATT CCTCGTAAGG ATTCCGTTAG TTCCTTTTCC AGTAGTAATT CCTTGCTTGC CTCTCAACTT CAGGATATTC CGCCACAATT GCCTCTACCA TCTGACCAGA TTCCAGGCAT ACCTGCAGCT CCTGCCTCGT TCAAGAGTAG CACTTCTCGT AAAGGGTCAC AGGATCTTGG ATCACCAAGA GCTTCTTTGA TCCATACCTT GACTGCGTTG AGAAATATTC CTATCTCCAA GTCTCCTCGT GAGTCTCCAT TCGCATCTCC AAGAGCTTCT GTATCTGCTC AAGGAGCTTC CGTTGTTGGA GGAGGTGACT TGTACAGTTC TCCAGTTCCT AACTTACGTC CTGGTCGTTT CAGTCCCACG CTGGAGGTGA TGGGACGTTC CAAGTCTATT GACTATTCGT TGAGTGCGTT CAATGAAGAA TCGGCAGAAC AAGGGAACAA GACCGAACCC AGCCAAGAGA TGGAAAGAGA ACAAAAGAAC GCATCGTTGT CATCGACCCA CTCTGAAGCA CTTTCTGCTA CCTCTTCTGG AAGAGACAGC AATGATGATG ATTTTTCAGG AGCTTCTGAA GCGGATTTGT TGAGTCAAGC ATGCAAGGCT GCTAAGGAAA GTCATGGAAT GTTTGATAAT GCTTTGCTCA ACTTTGAGAA CTTAGATGCT GCCACGTTCA CCATGGAGCC ATCCTTGATT CCCAGAAAGT TGTATGTCCC TTTCCCTTCA ATAACGTTGA AGGGTTTCTG TGAATACTTA TACACTGGAC AGGTTGGAAA CAAGTGGCTC CTCGTTCCCA CGACCTTGGA CAATTTGTTA ATGTCCAAAT TCTTCAAAGT TCCGTTATTG TACGATTTGA TCAGTGAGGT TTTGTTTGGT ATTATTGGCA GAAAAGAAGC TTATATCGTC AAGGAAGGTA ACAGATTGAA GAGAAAGTAC TTCAATGCAC TTGAGGAGAT GGGAAAATCG TATGATAATT CCTTTAAGTT CCCTTTGAAT GAATACGAAG GATTCATGGA TACAGTGGAC GACGGGTATT TAGATATTGC CTTGTTGAAA AAGACTTCTA AGACTCATCA GAGTAGTTCT GTTGTTTCAA TGTCTAGAAG AAGAAGATAC CAAGCGGAAC ATGGTAATTC TCGTCGTCCT TCGACAGCAA CAGAGTTGAC TGAACCTGAT GAAGAGGCTG AAGATGAAAA TGAAGTTGAC GATGAGAAAG AGAAAGAGAA AGAAGAATCA CAAGGATCTA ATCTTCATAG CGAGTCGGCT GAAGACTCGT CTGATAGAAA AACTACTTCT ACTAGTGAAG ATGATGGAAT CGAATTAGGA TTCTTGAATG TTCACGAGCG AAATTCCACT ACTGTAGGAC CGCGCTCTAA GTCGGTATTT GACAGATCCA ACACTGTATA CAACGATTTC TTCCAGCACG CCTATGAACA GGCACATGCT GGAGATGACA ATGGCGAGAA GGCAGTAGGT CTCACCATTG AGCAATTGGT CAGCCCAGAT TCTGATATAC CAAATGACTA TGTAATTGAT TTGGTATATG AAGCATCGTC GATTGTGACG GATTTGAAGT TGATGTTGAG AGCAGCCAAC GTCAGATTGA TGAACAAGAT ATTGAACCAA AGTAGAGTTG AAGTTGAAGC CGAAATAGAG AGGTTGAAAA ACAGCCAAAT GGAATCTGAT TTGGAAGATA GTGACGACGG TGGCATTTCT ACCTCCAAAG CGAACGATAA CTTTGTTGCA CCAAAAGCAC ATACTAGTGT CAGTGCTAGT GCCAATGCAT CCACTGGCAG TAAACTTAGT TCTCCTGTTC CTGCTTTAGA TGGAGATTCT AGTTTTGTCC CTCCTCTGTT AGCCCATATG GACTTAAATC CGTCAACTTC TGCATCTTCG TTGGTTTCTT CGGGTCTGCT GAAGGCTCCG AAGGATTTGG AGAAGATGAA GTCGAATATC AGTTTCCGAA CTGTAGGCTC TCTTACGCCG TTCAAGCACA GCAAGACTGA AAATAAGATA GGCGGTAATA AGGACATTGA CAAGCGCTTC TCCAAGCTCA TGAAGAAGGA CGAGAAGTTG AGAACAAAGG AAGGCTTGCT CAAGAAAAAG GTAATGTCGA AAAGTTCGAC AAGCTTGAAT GAAATTGGCC AATCCACCGC TTCGTCAATA ATGTCGAAGA GTACAACGCA ACCCAAGAAG CACCATGGTT TGTTCCATCG CAGCCACAAG AAGAAAGACA CAGATGATTC TTCGTTAGAA TCTACAAAAT TAAGCAGAAC ACAGTCTACA ACTGCCAGTA TAGAATCCGC ACACAGC
|
Protein sequence | MSTLIPTSSA CHNLQLPHTE KDDRLNLNVR TGSSSTLYNS LVLTSGGLTI GLELGDTTIE EIYTTFRQKV SSSVMKFKSL DKYLSGELFY LSLIDKVWSR VPYNRDTDGP RPKPRLFHQI CALNNCVYLF GGLVIPEDAN PTSEDDTTIK DFLVPCNDLW SFNLESNKWR LLSDGSEYET NRAIPQPRFN HKMTTINSLS FVNKKDHYGI FIAGGKDGNS NPIYDNAVFD LVENKYVGSE PIKLKTTTGN VEKDSKSGLD NIVANDKHEL NVDYNNSMIV TFTEDIDHQH HQQINDKGKI HHEYRKSTSQ EQSFIIYTPT IDKSQDQNNN PLVSFRAGRK IHNGKPLPMH RKKLFTNGQN SEGPTIDRRG LNHTIPFNLR YPTGGLFGQN IVITGFLPGD FDISIFIYNK PTGKWSRLNI FCNHDYGSHR FWGGFAWQSH HKVVLIGNYL TSRTTSSIRF FTVMITVSLP ITNILASSEL AGGHHHGPDG RRIPHRRVRL SNASENASNR KNIEEELSDD SSPEVLTEEE FLKKTSDRRP SSISQASDKT SPTAISFSEY VHYAAPKTNF TSIRSVFPPA AITLGRNAFD RYGDLISDFE LVSCNGDRIP VSLIVLMERW GRYFIQLLAR GYVSAVDKFE SDQALGVYNS DKQRLRTSKS GGSGKTNKEP SNPLEQNESE ASPEAIPRKD SVSSFSSSNS LLASQLQDIP PQLPLPSDQI PGIPAAPASF KSSTSRKGSQ DLGSPRASLI HTLTALRNIP ISKSPRESPF ASPRASVSAQ GASVVGGGDL YSSPVPNLRP GRFSPTSEVM GRSKSIDYSL SAFNEESAEQ GNKTEPSQEM EREQKNASLS STHSEALSAT SSGRDSNDDD FSGASEADLL SQACKAAKES HGMFDNALLN FENLDAATFT MEPSLIPRKL YVPFPSITLK GFCEYLYTGQ VGNKWLLVPT TLDNLLMSKF FKVPLLYDLI SEVLFGIIGR KEAYIVKEGN RLKRKYFNAL EEMGKSYDNS FKFPLNEYEG FMDTVDDGYL DIALLKKTSK THQSSSVVSM SRRRRYQAEH GNSRRPSTAT ELTEPDEEAE DENEVDDEKE KEKEESQGSN LHSESAEDSS DRKTTSTSED DGIELGFLNV HERNSTTVGP RSKSVFDRSN TVYNDFFQHA YEQAHAGDDN GEKAVGLTIE QLVSPDSDIP NDYVIDLVYE ASSIVTDLKL MLRAANVRLM NKILNQSRVE VEAEIERLKN SQMESDLEDS DDGGISTSKA NDNFVAPKAH TSVSATHMDL NPSTSASSLV SSGSSKAPKD LEKMKSNISF RTVGSLTPFK HSKTENKIGG NKDIDKRFSK LMKKDEKLRT KEGLLKKKVM SKSSTSLNEI GQSTASSIMS KSTTQPKKHH GLFHRSHKKK DTDDSSLEST KLSRTQSTTA SIESAHS
|
| |