Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_68294 |
Symbol | MSN2 |
ID | 4840655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | + |
Start bp | 378459 |
End bp | 381488 |
Gene Length | 3030 bp |
Protein Length | 1009 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640391970 |
Product | zf-C2H2 Zinc finger, C2H2 type |
Protein accession | XP_001386089 |
Protein GI | 150866470 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAACG ATTCAGACAT CTACCAGTAC CTGTCTGAGC CTTCACAACG GTACGGCTAC CAGGATGGCG AAACGTTTGA TTTCCAGACG ATCCAGGAGA ACCACCATGA CGTTAAGGAC AGCGAAATAG TCCATTCAGA CATCGAGGAG AACCTCAATA GAGATCTTGA AAATGGAGGA TTATCAAATG GAAAGACACT AAACCACACT TCCAGTAACA ACACCATCAA TAATAGCATC AATAATTACA ACAACAATAT CAAAAACATC AATAACAACA GTATGAGTCT GCATAACAAT ACAAACACTA TCAATAATAC CAATAATTAC AACAATAACA ATGTCAGCAA TACTATAAAT TCAACAGACT ATGAGTCAAT CTTTGCTCTC AACTCGTTTG GACTTCCCAG CAATGCCCTC TTTGGCGGAA CTGATCCGCT GAACATCGTC TATCGTCCCA ACAACATGAT CAGTAATCTT GAATCACCGG AGAACTTCCC TGGTGTTGAT TTTGAGCACG CAGCAGAAAA CCAGAATCAG GCTAATGTTT CGTTTTCGAT GCCTGGAGCT TTTCACGATG GTGATAATAT GGATCTTGAT GAGTCGCCGC CGTTTACAAT CGAACTGGAG ACCTATTATG ATCCACATAT CGAAAGTACC TCTATCAATC ACGAGACTGA GTCATTGTTT AAAGCCAATG GAAATGCCAA TACTAACGTG GGAAACACTC TTTTGGCACC TCAGCAACGA ACTCCAACTC AACGGATTGA GTCCTCGTTT TCACCATTTG ACAATTCTTC ACGAGTTTCA TCTTTTCGGG TCGAGAACGG GAATGGTATC AACCCAAACG GAGGATTTGT GTTCAACCAT GTGACCACCA ACGCCGATAA CGGCGTTAAC CAATCAGGTC CGGTACTAGG ATCTGTAGAC AATTACTATT CCAATATCAA TGGAAATAGC AACGCCAATA ATATTGGCAA CACTAATAAT AGTAGCGGCA GCAAGAATGG TAATGATAAC AGTAACAGTA ATGGCAATGG AAACGGAAAC GGAAACGGAA ATTTCAACAA CAACAATATC AACAATACCA CACCTAATAG TCATAACACG AACAGTGTCT ACAATACTAA TAATGGAAAC TACTTGAGCG TTCATTCAAA TATCACTGGT AATGCTTTTT CCAATGCCTC GTTGGGACAA AACAGCAACA ATAATTTCTA CAATGAACTC TCTCCCATCA CCACCACCAC CTCGCTCACA CCTTCTATCA GCTCCGTTCA TTCTACTCAG CCGTCTTTCT TCTCAGCACA CCAATTCCTA ACTCGTAATT CGCTTGATCA AGGTCCTCCT ACTCATCTTG TGTCTTCTTC GTTTGACTTA TTCAATAAGG GAAGACCTTC AATGGACAGC CAGCAGTCGT CGTCTCGTAG AAACCCCAGC AGTGGTCGCT ACACGAGTTT CACCAACTCG TTGACCAATA TGATTCCGTT CATGGGGGAC AGAAACCAGC GATCTCCAAT TTCTGGACCT CCTTCTCCAC AGTCGCAAAA CTCATCTTCT TTCATGTCTC AACCTCCTCC TCAGCAGCCT CGCCATTTGA TCCGTAGTAT CTTCAAAAGC AACTCTGCTC CTAACAATGT CCAGGCTGCC AACGACGAGC TCACCAATGC TTTTGTCATT GACGAGTCCA GTGATCCGTT CGTTTCTGGA AGTGGAAACA CCGAAGACTT TTTGATGATG AGCCCCACAA AAGAGGAGCC TGAGCTAGAA GCAATCGATG TTTCTGTTCA GCCAAAGAAA GCAAAGAGGT CCAAGAGAAG TTTGTTCACA CGTTTCAAAG GTCCTTCCGT GAAACAAGAG CCGATAGACG AGAACGAGAT GTTGATGATT GATGAGTTTG CTGTGAAAGA GGGCGAGAAT TTGGACAATT CCACTTCGAC AAGTGGGCCA TTCCAGCCCA CTTCCATAAG TCGCACTCCT TCAACAGCCA CAGGCAACTT CCTTGATTCT GCGTCTCTGA GTAACACTTC TCATAACCAG CTGCAACTGC AATTGCTTCC ACAGTCGCAA ACTCAGCCTC AGAATCCTCA ATCTCAGGAG CCAGACTATG CATCGCTCTT CGAAAATGTT GGTAAACGTA AGATCGTGAA CACCTCTAGT TACAGAAAAA GTAAGACCAA GGTCAAGAAC GAAGATGGAA CCACGAGCAA CAATTCTAAT TCCACTGTAG AAAACTCACC CATCTTGAAC GTCTCCTTGG GAAATAAAAA AATAAAGACG GATTCAGAAA TTGGATCGGG AAATAATTCC GGACATACTA CTGAGAAATC CTCATTGCAC AACTTAAGAT TGTCCCACCA GCGGTCCAAC CAGAGTAGCG GTAACAACAT CTCTGTTAAG GAAGAGTACA GTGACCGAGG CAGTGTTGGA GGCATGAGTC TGAAATCGAA CACCTCGAAA GATAATAGAC TCCACCCTCA AAGTTCTGAG GAAGTAGACA TTTCAGACGA TGAATCCACA ACATCAACAA CTGCTTCATC CAACTTTGCT ACTGCCTCGA AGAGAATTCT TGGATCCAAG TTGATGAAGA AGAAGACCTC ACCTGTAAAA ATGCCTGTGG CTACTGTGAT TAACAAGGGA GTGGAGGTAG AAGTTGATTT GAAGTCGCTA GATTTACCTC CCAACACGCA GATCTTCCCC ACCAGTATAA TAAATTCCAA GAATAGAACC AGGGGTCGTA AGGAAAACAA AGAAGCAGAT ATGGTTGATC TGACCAAGAT CTACTTGTGT AACTATTGTT CACGTAGATT CAAGCGACAA GAACATCTCA AAAGACACTT CAGATCGTTG CATACTTTTG AGAAGCCATA CGACTGTACG ATTTGCAATA AAAAATTTAG CAGATCTGAT AACCTTAACC AGCATTTGAA GATCCACAAG CAGGAAGAAG AAGCTGCTGC TCTTGAAAAG GAGCTTTTAG AACAGGGTTC TATGGCTAAG ACTAAAGTAG AAGACGAGCT AATGGAGTAG
|
Protein sequence | MDNDSDIYQY SSEPSQRYGY QDGETFDFQT IQENHHDVKD SEIVHSDIEE NLNRDLENGG LSNGKTLNHT SSNNTINNSI NNYNNNIKNI NNNSMSSHNN TNTINNTNNY NNNNVSNTIN STDYESIFAL NSFGLPSNAL FGGTDPSNIV YRPNNMISNL ESPENFPGVD FEHAAENQNQ ANVSFSMPGA FHDGDNMDLD ESPPFTIESE TYYDPHIEST SINHETESLF KANGNANTNV GNTLLAPQQR TPTQRIESSF SPFDNSSRVS SFRVENGNGI NPNGGFVFNH VTTNADNGVN QSGPVLGSVD NYYSNINGNS NANNIGNTNN SSGSKNGNDN SNSNGNGNGN GNGNFNNNNI NNTTPNSHNT NSVYNTNNGN YLSVHSNITG NAFSNASLGQ NSNNNFYNEL SPITTTTSLT PSISSVHSTQ PSFFSAHQFL TRNSLDQGPP THLVSSSFDL FNKGRPSMDS QQSSSRRNPS SGRYTSFTNS LTNMIPFMGD RNQRSPISGP PSPQSQNSSS FMSQPPPQQP RHLIRSIFKS NSAPNNVQAA NDELTNAFVI DESSDPFVSG SGNTEDFLMM SPTKEEPELE AIDVSVQPKK AKRSKRSLFT RFKGPSVKQE PIDENEMLMI DEFAVKEGEN LDNSTSTSGP FQPTSISRTP STATGNFLDS ASSSNTSHNQ SQSQLLPQSQ TQPQNPQSQE PDYASLFENV GKRKIVNTSS YRKSKTKVKN EDGTTSNNSN STVENSPILN VSLGNKKIKT DSEIGSGNNS GHTTEKSSLH NLRLSHQRSN QSSGNNISVK EEYSDRGSVG GMSSKSNTSK DNRLHPQSSE EVDISDDEST TSTTASSNFA TASKRILGSK LMKKKTSPVK MPVATVINKG VEVEVDLKSL DLPPNTQIFP TSIINSKNRT RGRKENKEAD MVDSTKIYLC NYCSRRFKRQ EHLKRHFRSL HTFEKPYDCT ICNKKFSRSD NLNQHLKIHK QEEEAAALEK ELLEQGSMAK TKVEDELME
|
| |