Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_80960 |
Symbol | ZMS1 |
ID | 4851941 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 3250529 |
End bp | 3255091 |
Gene Length | 4563 bp |
Protein Length | 1195 aa |
Translation table | |
GC content | 40% |
IMG OID | 640393649 |
Product | Zinc Finger Protein C2H2-like protein |
Protein accession | XP_001387192 |
Protein GI | 126276090 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.758804 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0450473 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CAATCCTACA GCGGTATTTG TTTAATTCTG CAATTCAACA GCGGCTAATA ATAAGAGTAT CATCGGCAAC GTCGTGTATT GAAAATTAAA GTCATTAGCA AGGAATTTGA CAACCCTATC ATTAACAAAG TCAGTACCAG AATCATTTAC TGTCGTCTCC AAAACAAATT CACATAATTT GGCTACTTAA AAATTGCTGT ACTATACATT TAAGCACTCT TTCCATCAGG ATCAATTTCG TTGTTAAGAT CACATTTTTC ATATACTGAT ACCACTCATT ACAACTAGAA ATATACCATC ATGAGCGTTC CCAATGTCGG TAAATCGCCG GTGTCTACGT CGCTGACGCC TGGTATCTCG TTCACAGACA CTGCTCTGAC ACCTGGGTCG TCTTCTGCAA ATCCCCCAAT AGCTACTCAG CCAATTCCCA AAAAGTCACA ACAGATCAAA ACTGACAAGC CCAGGCCTCA TGTGTGTACG ATTTGTACGC GAGCTTTTGC CCGTCTTGAG CATTTGAAAC GTCACGAGCG TTCCCATACA AACGAAAAGC CTTTCCAGTG TGCTGCTTGC GGGCGCTGTT TTGCACGTCG AGATTTGGTG CTTCGTCATC AGCAGAAACT TCACACATCT TTGCCAAATG TTATGAGAAG AGGTTCTACG AAGGATTTGG ATAATAACGA ACACATCATC GTTTTACATA ACAACACTCT GCCCAACGCG CCGCTTCCTA ACGGATCATT TGGTGACGGA ATTGGACTCA ACGTTGGTAC TAGCGATATG TCCAAGTTGC GACATGACGA CTCAGCCAGC GATTCGCTGG GATATCCATA CTCACCTCCA CGTACAAACG ATGTAACTTA TAATCATCCA CAATTCCGTA CAGCGATGTT TGGAGATGGC AATGGTCATA ATAATAGTAA TGGCAATGGA AATGGAAATG GGAATGTCAG TCACAATAGT AATAACAATC ACATCAGCAA CAACAATAGC AATAATAATC ACACCACTCT CAATAATGCT AGCCTCAACA GTCTCAATAA TCACATCAAT AACAACAATG ACGATATCGC CGGTTTTGCA CTTTCAGTGC TGCCTGCAAA CAATAACACT AATAATAACG GTGGAGCCTT CTCGCCTCAT CCATCGATTC TGAACCACGC CTCCCCGCCT ATTCTCGCAA GTTCTATACC CAACGTTTCC AATACTGCCA ATAACAACAG CAACTTCCAG CAGCCTTCGC CACAGCTGAA CGAGTCTCCC AATCGTAGTT CTGTAGCTCC TAAACCCATT CCAGCTCATC TTCAACAGAA ACAACTGTTC TCTCCACCAA ATATTCCCAG CTCTAAACTG ACGCCTACAG CTAACCATAA CGACACTGCT ACTCATAGCG CGATTAATCA TCATATCTCT AATCTTGCTG CTCAGTATAG GCATGCTTCT TTCTCTGCAG CTTCCAATAT TTCATACACT AACTTGAAAG ATGCACTCAG CATCCAGCTG CATAACAATA TGGAGCCAGC ACCGATGCAA GTAGACTTTG CTACACCTCA GTTATCTGCT CAAGACGACT ATTCTCGTAA TTTGCTTCTT TCAGGCCTCG ATTTAAGCTC CTACAACATG ATGGATTGGA ACAGTATCGA CAACCTCGAC TTGAACGAAG CATCTACAAC AGAAATGTCT ACTGGTGCTG CTACTGCGAA ACAAAAGTCG ATCAAGAATC TCCAGCAGTT TTTTCTTGAG AATGTAAGCA GTAGTAAAAA CAGTAATGGC TCGAACTTGT TGACAACACA TCAGTTTCTG AACCCGAACC ACCCCCACCA TATCAAGGGA ACTACTCCGT TTGAATTCGG TGTCAATCCG CCAAATGATG TCAACATTAT GCAGCAGTTG CTCGAGCAAA ACGGATTAAG ACCAGATGCG GCAGCGTTCA GCTTGAATAC ATCCATGATT GATCAGAAGA AGCTTCAGAA AAAGAAGGCT CCTCAAGCTT TGCATCCACC ACCAGTTAAA AGAACTAAAA GAGAAGATTC TACCACAGAT AAGAGCAGCT CTGAAAGTAT TAACACTCCA GGAACTACCT TCACCACAAC TACAGGTATG CCTATAAGCA TCTCAAACAA CGATGACGAC AATTGGCTCA AAGAAATCAT CGGAACACCA TATGATACAA ACTTTCAGGC TAATAATCAG CATATGGGTC TTTTTGAACC TCCCAGTTTG CTCAATTCAC CCAAGTTGTT GGCTCTGATG CCACAAATCC AACAGTCTAA TGCTGAAAAT GCAGGCTCTC CCAATGAGTT GACTACTCTC TTCAGGTCCA GACAAAGCGA CTTGGTCAAC CAGTTGAAAC CAAACTTCAG TTTGCCCACG CAGGCTAGAC TTGATTCTGG TACTACTTTC GCAGGAGGTA TCACACAAGT TGCTGACCTC GGTATAGACT TTCCATTCAA GAAGGACAAG TATTCATCGT TTTCACAAGA ATTGAGATCG AGAATCATTC TGATCAGCAA CATTTCAGAC TCGCAGTTTC CTTCACTTGA AGACTTGAAC AGGTACATGA AGTTGTACGA GTTAGAGTTC AATAGGTACT TCCCCTTCAT CCATTTGCCT TCATTGAAGA ACCCAATGGT GGACAATTTT GAGAATATTC CCTTGTTGCT TTCCATGGCA TCTATTGGTG CCTTGTATTC ATTTCACGAC TCAAACACTT TGCTTTTGTT CAATTTGTCC AAGTTCCACA TCCAAAGTTT CTTTGAAAAG GAGATTACTT TGGACAATTT ACAGTTCAAG AAAGTTCCGT TGATGGCCCA CCAGTGCTTA GTATTGCATA TATTTATTTC CATGTTCCTC AACGAGCCCA ATATGGTTGA CATAACTTCC AGACAGATTA AGTCTATGAT TGGTCTTATT AAGTCGACTA ACTTCAATGA GCCTCTTGAG CAATTCTTGG TTCCACCACC AAGCATTTTG GAGACAGTGG GTTCAGACAC TAGCAGTCAG AGAGCACAAC AGCTCATTCA GAACAACTTT GATTACTTTA TTATGGCCCA GCTGAGAATT AGAACGTTGC ACATGTTTTA CATGTTACAG ACATTCAGAT CTAGTATTAT TGGTTTGCCC ATCTACTTGA ACTCGAAGTT CTTGAAGAAC GGAAATTACT GTTTTAATGA AGAGTTGTGG AGATGTGAGG GATCACAAGC ATGGTTTAAA GAATTGTCGA AGGATAATAA AAAGACATTG GTAGAACTCA GTAATGGTGA ATCTCAGGAG TCCTTGTTAA AACTTTTGAA GGATAATACA CTCGTTAACC CACATGAGCC AAAACTTTCG TTGAACAACT CACTCGCCCT TTTGATATAT TTGCACGAGT TGGTCCAGAC TGAGATCTCG TCCATGAAGC AGCGGTTCAC TTACTTGAAT TGGAAGCTTA ATCACAAACC AAAATTGGAA CATATGGTCA GAGCTTGGGA AGGAAAGTTC TTGAAGAACA ACGGAACTTT GCAAATTGAT TCCTATAGTA GATACTTGTT AAATTCGAAG AATGAACTCA AGTTGATATT GCCTTTACAT GCACTATTAA AGATTAAGTT AGAAGTGAAC TTCAATCCAA TAATAGCAGC AATTCTCAGA AAGGATTGGT CAAGCATGAA TTCCCAGTTG AATTTGTTAT TGATTCAAGA GCCCATTCAC GAAAATATCA GAGCAAGTCT TCCTCATTGT TTCGAAATTC TTCAATTGTG GATCTACAAC ATTGAAACAA TTAACTATGA TATTAAACAG ACGTCACTAA GATCGCCAGT TTTCTTTGTT GCGTGTTTGT TTGTCGCGAT TCTCTTAGTT TCAACATACT TGGATTTCTT GGAAGCGAAG TTCGAAAAGG GTACCAAATT CAACGATAGA GAGCTCGTGG ATTGGTTATC CTGTGAAACG ATCATGTTGA AGGTTGAGAA AGTGTTATCT CCTGTTCTTA AATCTTCGTA CTCTGAATTC TTGACTAAAC AAGCGCATGG TGCTTTCAAC AATATTATTG ATGATAAAAC CGTTAACAAC ATTGGAGACT TAATTGAAAA GAAGGAAGGT GTCACTGGTG ACATCATTGC TACTGGAGAC ACAAAGGAAA AGATCGACAA ACTTGAAAGC ATTAGCAAGG AACTTGCCCA GGAGATTAAG AAAATCAATT TGTCTACAAA GTCACTTTAT TTGGGAATTA GAATATTGGC TGATGCACCC ATCTGGCCCA TTGCCATGGG CTTTGCTGAG GCTTTGAAGA ACAGAGCTAC ATATTTATCA TCTAGAAAGT TGTCTCAAAC AAGGAAATAG GTCAAGAGCA TACAAAATAA GTGGAATTAC AAAGCATATC AGAGCATTCT TGGGATTAGA TTGTACAGTA TAGTATTCGA AACAACATGA TATTGATGTC GACATTCGTT TCAAACAGCA TTGATATTCG TTGGATTGCA CCAGACGCTT GTTTCTAACA TACACTGGGA TTTCACGAAT TAACGGAAAA AATGTATAAA GTATATTTAC TTCTTTATTT TAATTACTAC GAATACTTCA GTT
|
Protein sequence | MSVPNVGKSP VSTSLTPGIS FTDTALTPGS SSANPPIATQ PIPKKSQQIK TDKPRPHVCT ICTRAFARLE HLKRHERSHT NEKPFQCAAC GRCFARRDLV LRHQQKLHTS LPNVMRRGST KDLDNNEHII VLHNNTLPNA PLPNGSFGDG IGLNVGTSDM SNDSLGYPYS PPRTNDVTYN HPQFRTAMFG DGNGHNNSNG NGNGNGNVSH NSNNNHISNN NSNNNHTTLN NASLNMLPAN NNTNNNGGAF SPHPSILNHA SPPILASSIP NVSNTANNNS NFQQPSPQLN ESPNRSSVAP KPIPAHLQQK QLAINHHISN LAAQYRHASF SAASNISYTN LKDALSIQLH NNMEPAPMQV DFATPQLSAQ DDYSRNLLLS GLDLSSYNMM DWNSIDNLDL NEASTTEMST GAATAKQKSI KNLQHSKNSN GSNLLTTHQF LNPNHPHHIK GTTPFEFGVN PPNDVNIMQQ LLEQNGLRPD AAAFSLNTSM IDQKKLQKKK APQALHPPPV KRTKREDSTT DKSSSESMPI SISNNDDDNW LKEIIGTPYD TNFQANNQHM GLFEPPSSPN ELTTLFRSRQ SDLVNQLKPN FSITQVADLG IDFPFKKDKY SSFSQELRSR IILISNISDS QFPSLEDLNR YMKLYELEFN RYFPFIHLPS LKNPMVDNFE NIPLLLSMAS IGALYSFHDS NTLLLFNLSK FHIQSFFEKE ITLDNLQFKK VPLMAHQCLV LHIFISMFLN EPNMVDITSR QIKSMIGLIK STNFNEPLEQ FLVPPPSILE TRAQQLIQNN FDYFIMAQLR IRTLHMFYML QTFRSSIIGL PIYLNSKFLK NGNYCFNEEL WRCEGSQAWF KELSKDNKKT LVELSNGESQ ESLLKLLKDN TLVNPHEPKL SLNNSLALLI YLHELVQTEI SSMKQRFTYL NWKLNHKPKL EHMVRAWEGK FLKNNGTLQI DSYSRYLLNS KNELKLILPL HALLKIKLEV NFNPIIAAIL RKDWSSMNSQ LNLLLIQEPI HENIRASLPH CFEILQLWIY NIETINYDIK QTSLRSPVFF VACLFVAILL VSTYLDFLEA KFEKGTKFND RELVDWLSCE TIMLKVEKVL SPVLKSSYSE FLTKQAHGAF NNIIDDKTEK IDKLESISKE LAQEIKKINL STKSLYLGIR ILADAPIWPI AMGFAEALKN RATYLSSRKL SQTRK
|
| |