Gene PICST_80960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_80960 
SymbolZMS1 
ID4851941 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp3250529 
End bp3255091 
Gene Length4563 bp 
Protein Length1195 aa 
Translation table 
GC content40% 
IMG OID640393649 
ProductZinc Finger Protein C2H2-like protein 
Protein accessionXP_001387192 
Protein GI126276090 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.758804 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0450473 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CAATCCTACA GCGGTATTTG TTTAATTCTG CAATTCAACA GCGGCTAATA ATAAGAGTAT 
CATCGGCAAC GTCGTGTATT GAAAATTAAA GTCATTAGCA AGGAATTTGA CAACCCTATC
ATTAACAAAG TCAGTACCAG AATCATTTAC TGTCGTCTCC AAAACAAATT CACATAATTT
GGCTACTTAA AAATTGCTGT ACTATACATT TAAGCACTCT TTCCATCAGG ATCAATTTCG
TTGTTAAGAT CACATTTTTC ATATACTGAT ACCACTCATT ACAACTAGAA ATATACCATC
ATGAGCGTTC CCAATGTCGG TAAATCGCCG GTGTCTACGT CGCTGACGCC TGGTATCTCG
TTCACAGACA CTGCTCTGAC ACCTGGGTCG TCTTCTGCAA ATCCCCCAAT AGCTACTCAG
CCAATTCCCA AAAAGTCACA ACAGATCAAA ACTGACAAGC CCAGGCCTCA TGTGTGTACG
ATTTGTACGC GAGCTTTTGC CCGTCTTGAG CATTTGAAAC GTCACGAGCG TTCCCATACA
AACGAAAAGC CTTTCCAGTG TGCTGCTTGC GGGCGCTGTT TTGCACGTCG AGATTTGGTG
CTTCGTCATC AGCAGAAACT TCACACATCT TTGCCAAATG TTATGAGAAG AGGTTCTACG
AAGGATTTGG ATAATAACGA ACACATCATC GTTTTACATA ACAACACTCT GCCCAACGCG
CCGCTTCCTA ACGGATCATT TGGTGACGGA ATTGGACTCA ACGTTGGTAC TAGCGATATG
TCCAAGTTGC GACATGACGA CTCAGCCAGC GATTCGCTGG GATATCCATA CTCACCTCCA
CGTACAAACG ATGTAACTTA TAATCATCCA CAATTCCGTA CAGCGATGTT TGGAGATGGC
AATGGTCATA ATAATAGTAA TGGCAATGGA AATGGAAATG GGAATGTCAG TCACAATAGT
AATAACAATC ACATCAGCAA CAACAATAGC AATAATAATC ACACCACTCT CAATAATGCT
AGCCTCAACA GTCTCAATAA TCACATCAAT AACAACAATG ACGATATCGC CGGTTTTGCA
CTTTCAGTGC TGCCTGCAAA CAATAACACT AATAATAACG GTGGAGCCTT CTCGCCTCAT
CCATCGATTC TGAACCACGC CTCCCCGCCT ATTCTCGCAA GTTCTATACC CAACGTTTCC
AATACTGCCA ATAACAACAG CAACTTCCAG CAGCCTTCGC CACAGCTGAA CGAGTCTCCC
AATCGTAGTT CTGTAGCTCC TAAACCCATT CCAGCTCATC TTCAACAGAA ACAACTGTTC
TCTCCACCAA ATATTCCCAG CTCTAAACTG ACGCCTACAG CTAACCATAA CGACACTGCT
ACTCATAGCG CGATTAATCA TCATATCTCT AATCTTGCTG CTCAGTATAG GCATGCTTCT
TTCTCTGCAG CTTCCAATAT TTCATACACT AACTTGAAAG ATGCACTCAG CATCCAGCTG
CATAACAATA TGGAGCCAGC ACCGATGCAA GTAGACTTTG CTACACCTCA GTTATCTGCT
CAAGACGACT ATTCTCGTAA TTTGCTTCTT TCAGGCCTCG ATTTAAGCTC CTACAACATG
ATGGATTGGA ACAGTATCGA CAACCTCGAC TTGAACGAAG CATCTACAAC AGAAATGTCT
ACTGGTGCTG CTACTGCGAA ACAAAAGTCG ATCAAGAATC TCCAGCAGTT TTTTCTTGAG
AATGTAAGCA GTAGTAAAAA CAGTAATGGC TCGAACTTGT TGACAACACA TCAGTTTCTG
AACCCGAACC ACCCCCACCA TATCAAGGGA ACTACTCCGT TTGAATTCGG TGTCAATCCG
CCAAATGATG TCAACATTAT GCAGCAGTTG CTCGAGCAAA ACGGATTAAG ACCAGATGCG
GCAGCGTTCA GCTTGAATAC ATCCATGATT GATCAGAAGA AGCTTCAGAA AAAGAAGGCT
CCTCAAGCTT TGCATCCACC ACCAGTTAAA AGAACTAAAA GAGAAGATTC TACCACAGAT
AAGAGCAGCT CTGAAAGTAT TAACACTCCA GGAACTACCT TCACCACAAC TACAGGTATG
CCTATAAGCA TCTCAAACAA CGATGACGAC AATTGGCTCA AAGAAATCAT CGGAACACCA
TATGATACAA ACTTTCAGGC TAATAATCAG CATATGGGTC TTTTTGAACC TCCCAGTTTG
CTCAATTCAC CCAAGTTGTT GGCTCTGATG CCACAAATCC AACAGTCTAA TGCTGAAAAT
GCAGGCTCTC CCAATGAGTT GACTACTCTC TTCAGGTCCA GACAAAGCGA CTTGGTCAAC
CAGTTGAAAC CAAACTTCAG TTTGCCCACG CAGGCTAGAC TTGATTCTGG TACTACTTTC
GCAGGAGGTA TCACACAAGT TGCTGACCTC GGTATAGACT TTCCATTCAA GAAGGACAAG
TATTCATCGT TTTCACAAGA ATTGAGATCG AGAATCATTC TGATCAGCAA CATTTCAGAC
TCGCAGTTTC CTTCACTTGA AGACTTGAAC AGGTACATGA AGTTGTACGA GTTAGAGTTC
AATAGGTACT TCCCCTTCAT CCATTTGCCT TCATTGAAGA ACCCAATGGT GGACAATTTT
GAGAATATTC CCTTGTTGCT TTCCATGGCA TCTATTGGTG CCTTGTATTC ATTTCACGAC
TCAAACACTT TGCTTTTGTT CAATTTGTCC AAGTTCCACA TCCAAAGTTT CTTTGAAAAG
GAGATTACTT TGGACAATTT ACAGTTCAAG AAAGTTCCGT TGATGGCCCA CCAGTGCTTA
GTATTGCATA TATTTATTTC CATGTTCCTC AACGAGCCCA ATATGGTTGA CATAACTTCC
AGACAGATTA AGTCTATGAT TGGTCTTATT AAGTCGACTA ACTTCAATGA GCCTCTTGAG
CAATTCTTGG TTCCACCACC AAGCATTTTG GAGACAGTGG GTTCAGACAC TAGCAGTCAG
AGAGCACAAC AGCTCATTCA GAACAACTTT GATTACTTTA TTATGGCCCA GCTGAGAATT
AGAACGTTGC ACATGTTTTA CATGTTACAG ACATTCAGAT CTAGTATTAT TGGTTTGCCC
ATCTACTTGA ACTCGAAGTT CTTGAAGAAC GGAAATTACT GTTTTAATGA AGAGTTGTGG
AGATGTGAGG GATCACAAGC ATGGTTTAAA GAATTGTCGA AGGATAATAA AAAGACATTG
GTAGAACTCA GTAATGGTGA ATCTCAGGAG TCCTTGTTAA AACTTTTGAA GGATAATACA
CTCGTTAACC CACATGAGCC AAAACTTTCG TTGAACAACT CACTCGCCCT TTTGATATAT
TTGCACGAGT TGGTCCAGAC TGAGATCTCG TCCATGAAGC AGCGGTTCAC TTACTTGAAT
TGGAAGCTTA ATCACAAACC AAAATTGGAA CATATGGTCA GAGCTTGGGA AGGAAAGTTC
TTGAAGAACA ACGGAACTTT GCAAATTGAT TCCTATAGTA GATACTTGTT AAATTCGAAG
AATGAACTCA AGTTGATATT GCCTTTACAT GCACTATTAA AGATTAAGTT AGAAGTGAAC
TTCAATCCAA TAATAGCAGC AATTCTCAGA AAGGATTGGT CAAGCATGAA TTCCCAGTTG
AATTTGTTAT TGATTCAAGA GCCCATTCAC GAAAATATCA GAGCAAGTCT TCCTCATTGT
TTCGAAATTC TTCAATTGTG GATCTACAAC ATTGAAACAA TTAACTATGA TATTAAACAG
ACGTCACTAA GATCGCCAGT TTTCTTTGTT GCGTGTTTGT TTGTCGCGAT TCTCTTAGTT
TCAACATACT TGGATTTCTT GGAAGCGAAG TTCGAAAAGG GTACCAAATT CAACGATAGA
GAGCTCGTGG ATTGGTTATC CTGTGAAACG ATCATGTTGA AGGTTGAGAA AGTGTTATCT
CCTGTTCTTA AATCTTCGTA CTCTGAATTC TTGACTAAAC AAGCGCATGG TGCTTTCAAC
AATATTATTG ATGATAAAAC CGTTAACAAC ATTGGAGACT TAATTGAAAA GAAGGAAGGT
GTCACTGGTG ACATCATTGC TACTGGAGAC ACAAAGGAAA AGATCGACAA ACTTGAAAGC
ATTAGCAAGG AACTTGCCCA GGAGATTAAG AAAATCAATT TGTCTACAAA GTCACTTTAT
TTGGGAATTA GAATATTGGC TGATGCACCC ATCTGGCCCA TTGCCATGGG CTTTGCTGAG
GCTTTGAAGA ACAGAGCTAC ATATTTATCA TCTAGAAAGT TGTCTCAAAC AAGGAAATAG
GTCAAGAGCA TACAAAATAA GTGGAATTAC AAAGCATATC AGAGCATTCT TGGGATTAGA
TTGTACAGTA TAGTATTCGA AACAACATGA TATTGATGTC GACATTCGTT TCAAACAGCA
TTGATATTCG TTGGATTGCA CCAGACGCTT GTTTCTAACA TACACTGGGA TTTCACGAAT
TAACGGAAAA AATGTATAAA GTATATTTAC TTCTTTATTT TAATTACTAC GAATACTTCA
GTT
 
Protein sequence
MSVPNVGKSP VSTSLTPGIS FTDTALTPGS SSANPPIATQ PIPKKSQQIK TDKPRPHVCT 
ICTRAFARLE HLKRHERSHT NEKPFQCAAC GRCFARRDLV LRHQQKLHTS LPNVMRRGST
KDLDNNEHII VLHNNTLPNA PLPNGSFGDG IGLNVGTSDM SNDSLGYPYS PPRTNDVTYN
HPQFRTAMFG DGNGHNNSNG NGNGNGNVSH NSNNNHISNN NSNNNHTTLN NASLNMLPAN
NNTNNNGGAF SPHPSILNHA SPPILASSIP NVSNTANNNS NFQQPSPQLN ESPNRSSVAP
KPIPAHLQQK QLAINHHISN LAAQYRHASF SAASNISYTN LKDALSIQLH NNMEPAPMQV
DFATPQLSAQ DDYSRNLLLS GLDLSSYNMM DWNSIDNLDL NEASTTEMST GAATAKQKSI
KNLQHSKNSN GSNLLTTHQF LNPNHPHHIK GTTPFEFGVN PPNDVNIMQQ LLEQNGLRPD
AAAFSLNTSM IDQKKLQKKK APQALHPPPV KRTKREDSTT DKSSSESMPI SISNNDDDNW
LKEIIGTPYD TNFQANNQHM GLFEPPSSPN ELTTLFRSRQ SDLVNQLKPN FSITQVADLG
IDFPFKKDKY SSFSQELRSR IILISNISDS QFPSLEDLNR YMKLYELEFN RYFPFIHLPS
LKNPMVDNFE NIPLLLSMAS IGALYSFHDS NTLLLFNLSK FHIQSFFEKE ITLDNLQFKK
VPLMAHQCLV LHIFISMFLN EPNMVDITSR QIKSMIGLIK STNFNEPLEQ FLVPPPSILE
TRAQQLIQNN FDYFIMAQLR IRTLHMFYML QTFRSSIIGL PIYLNSKFLK NGNYCFNEEL
WRCEGSQAWF KELSKDNKKT LVELSNGESQ ESLLKLLKDN TLVNPHEPKL SLNNSLALLI
YLHELVQTEI SSMKQRFTYL NWKLNHKPKL EHMVRAWEGK FLKNNGTLQI DSYSRYLLNS
KNELKLILPL HALLKIKLEV NFNPIIAAIL RKDWSSMNSQ LNLLLIQEPI HENIRASLPH
CFEILQLWIY NIETINYDIK QTSLRSPVFF VACLFVAILL VSTYLDFLEA KFEKGTKFND
RELVDWLSCE TIMLKVEKVL SPVLKSSYSE FLTKQAHGAF NNIIDDKTEK IDKLESISKE
LAQEIKKINL STKSLYLGIR ILADAPIWPI AMGFAEALKN RATYLSSRKL SQTRK