Gene PICST_68294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_68294 
SymbolMSN2 
ID4840655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp378459 
End bp381488 
Gene Length3030 bp 
Protein Length1009 aa 
Translation table12 
GC content43% 
IMG OID640391970 
Productzf-C2H2 Zinc finger, C2H2 type 
Protein accessionXP_001386089 
Protein GI150866470 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAACG ATTCAGACAT CTACCAGTAC CTGTCTGAGC CTTCACAACG GTACGGCTAC 
CAGGATGGCG AAACGTTTGA TTTCCAGACG ATCCAGGAGA ACCACCATGA CGTTAAGGAC
AGCGAAATAG TCCATTCAGA CATCGAGGAG AACCTCAATA GAGATCTTGA AAATGGAGGA
TTATCAAATG GAAAGACACT AAACCACACT TCCAGTAACA ACACCATCAA TAATAGCATC
AATAATTACA ACAACAATAT CAAAAACATC AATAACAACA GTATGAGTCT GCATAACAAT
ACAAACACTA TCAATAATAC CAATAATTAC AACAATAACA ATGTCAGCAA TACTATAAAT
TCAACAGACT ATGAGTCAAT CTTTGCTCTC AACTCGTTTG GACTTCCCAG CAATGCCCTC
TTTGGCGGAA CTGATCCGCT GAACATCGTC TATCGTCCCA ACAACATGAT CAGTAATCTT
GAATCACCGG AGAACTTCCC TGGTGTTGAT TTTGAGCACG CAGCAGAAAA CCAGAATCAG
GCTAATGTTT CGTTTTCGAT GCCTGGAGCT TTTCACGATG GTGATAATAT GGATCTTGAT
GAGTCGCCGC CGTTTACAAT CGAACTGGAG ACCTATTATG ATCCACATAT CGAAAGTACC
TCTATCAATC ACGAGACTGA GTCATTGTTT AAAGCCAATG GAAATGCCAA TACTAACGTG
GGAAACACTC TTTTGGCACC TCAGCAACGA ACTCCAACTC AACGGATTGA GTCCTCGTTT
TCACCATTTG ACAATTCTTC ACGAGTTTCA TCTTTTCGGG TCGAGAACGG GAATGGTATC
AACCCAAACG GAGGATTTGT GTTCAACCAT GTGACCACCA ACGCCGATAA CGGCGTTAAC
CAATCAGGTC CGGTACTAGG ATCTGTAGAC AATTACTATT CCAATATCAA TGGAAATAGC
AACGCCAATA ATATTGGCAA CACTAATAAT AGTAGCGGCA GCAAGAATGG TAATGATAAC
AGTAACAGTA ATGGCAATGG AAACGGAAAC GGAAACGGAA ATTTCAACAA CAACAATATC
AACAATACCA CACCTAATAG TCATAACACG AACAGTGTCT ACAATACTAA TAATGGAAAC
TACTTGAGCG TTCATTCAAA TATCACTGGT AATGCTTTTT CCAATGCCTC GTTGGGACAA
AACAGCAACA ATAATTTCTA CAATGAACTC TCTCCCATCA CCACCACCAC CTCGCTCACA
CCTTCTATCA GCTCCGTTCA TTCTACTCAG CCGTCTTTCT TCTCAGCACA CCAATTCCTA
ACTCGTAATT CGCTTGATCA AGGTCCTCCT ACTCATCTTG TGTCTTCTTC GTTTGACTTA
TTCAATAAGG GAAGACCTTC AATGGACAGC CAGCAGTCGT CGTCTCGTAG AAACCCCAGC
AGTGGTCGCT ACACGAGTTT CACCAACTCG TTGACCAATA TGATTCCGTT CATGGGGGAC
AGAAACCAGC GATCTCCAAT TTCTGGACCT CCTTCTCCAC AGTCGCAAAA CTCATCTTCT
TTCATGTCTC AACCTCCTCC TCAGCAGCCT CGCCATTTGA TCCGTAGTAT CTTCAAAAGC
AACTCTGCTC CTAACAATGT CCAGGCTGCC AACGACGAGC TCACCAATGC TTTTGTCATT
GACGAGTCCA GTGATCCGTT CGTTTCTGGA AGTGGAAACA CCGAAGACTT TTTGATGATG
AGCCCCACAA AAGAGGAGCC TGAGCTAGAA GCAATCGATG TTTCTGTTCA GCCAAAGAAA
GCAAAGAGGT CCAAGAGAAG TTTGTTCACA CGTTTCAAAG GTCCTTCCGT GAAACAAGAG
CCGATAGACG AGAACGAGAT GTTGATGATT GATGAGTTTG CTGTGAAAGA GGGCGAGAAT
TTGGACAATT CCACTTCGAC AAGTGGGCCA TTCCAGCCCA CTTCCATAAG TCGCACTCCT
TCAACAGCCA CAGGCAACTT CCTTGATTCT GCGTCTCTGA GTAACACTTC TCATAACCAG
CTGCAACTGC AATTGCTTCC ACAGTCGCAA ACTCAGCCTC AGAATCCTCA ATCTCAGGAG
CCAGACTATG CATCGCTCTT CGAAAATGTT GGTAAACGTA AGATCGTGAA CACCTCTAGT
TACAGAAAAA GTAAGACCAA GGTCAAGAAC GAAGATGGAA CCACGAGCAA CAATTCTAAT
TCCACTGTAG AAAACTCACC CATCTTGAAC GTCTCCTTGG GAAATAAAAA AATAAAGACG
GATTCAGAAA TTGGATCGGG AAATAATTCC GGACATACTA CTGAGAAATC CTCATTGCAC
AACTTAAGAT TGTCCCACCA GCGGTCCAAC CAGAGTAGCG GTAACAACAT CTCTGTTAAG
GAAGAGTACA GTGACCGAGG CAGTGTTGGA GGCATGAGTC TGAAATCGAA CACCTCGAAA
GATAATAGAC TCCACCCTCA AAGTTCTGAG GAAGTAGACA TTTCAGACGA TGAATCCACA
ACATCAACAA CTGCTTCATC CAACTTTGCT ACTGCCTCGA AGAGAATTCT TGGATCCAAG
TTGATGAAGA AGAAGACCTC ACCTGTAAAA ATGCCTGTGG CTACTGTGAT TAACAAGGGA
GTGGAGGTAG AAGTTGATTT GAAGTCGCTA GATTTACCTC CCAACACGCA GATCTTCCCC
ACCAGTATAA TAAATTCCAA GAATAGAACC AGGGGTCGTA AGGAAAACAA AGAAGCAGAT
ATGGTTGATC TGACCAAGAT CTACTTGTGT AACTATTGTT CACGTAGATT CAAGCGACAA
GAACATCTCA AAAGACACTT CAGATCGTTG CATACTTTTG AGAAGCCATA CGACTGTACG
ATTTGCAATA AAAAATTTAG CAGATCTGAT AACCTTAACC AGCATTTGAA GATCCACAAG
CAGGAAGAAG AAGCTGCTGC TCTTGAAAAG GAGCTTTTAG AACAGGGTTC TATGGCTAAG
ACTAAAGTAG AAGACGAGCT AATGGAGTAG
 
Protein sequence
MDNDSDIYQY SSEPSQRYGY QDGETFDFQT IQENHHDVKD SEIVHSDIEE NLNRDLENGG 
LSNGKTLNHT SSNNTINNSI NNYNNNIKNI NNNSMSSHNN TNTINNTNNY NNNNVSNTIN
STDYESIFAL NSFGLPSNAL FGGTDPSNIV YRPNNMISNL ESPENFPGVD FEHAAENQNQ
ANVSFSMPGA FHDGDNMDLD ESPPFTIESE TYYDPHIEST SINHETESLF KANGNANTNV
GNTLLAPQQR TPTQRIESSF SPFDNSSRVS SFRVENGNGI NPNGGFVFNH VTTNADNGVN
QSGPVLGSVD NYYSNINGNS NANNIGNTNN SSGSKNGNDN SNSNGNGNGN GNGNFNNNNI
NNTTPNSHNT NSVYNTNNGN YLSVHSNITG NAFSNASLGQ NSNNNFYNEL SPITTTTSLT
PSISSVHSTQ PSFFSAHQFL TRNSLDQGPP THLVSSSFDL FNKGRPSMDS QQSSSRRNPS
SGRYTSFTNS LTNMIPFMGD RNQRSPISGP PSPQSQNSSS FMSQPPPQQP RHLIRSIFKS
NSAPNNVQAA NDELTNAFVI DESSDPFVSG SGNTEDFLMM SPTKEEPELE AIDVSVQPKK
AKRSKRSLFT RFKGPSVKQE PIDENEMLMI DEFAVKEGEN LDNSTSTSGP FQPTSISRTP
STATGNFLDS ASSSNTSHNQ SQSQLLPQSQ TQPQNPQSQE PDYASLFENV GKRKIVNTSS
YRKSKTKVKN EDGTTSNNSN STVENSPILN VSLGNKKIKT DSEIGSGNNS GHTTEKSSLH
NLRLSHQRSN QSSGNNISVK EEYSDRGSVG GMSSKSNTSK DNRLHPQSSE EVDISDDEST
TSTTASSNFA TASKRILGSK LMKKKTSPVK MPVATVINKG VEVEVDLKSL DLPPNTQIFP
TSIINSKNRT RGRKENKEAD MVDSTKIYLC NYCSRRFKRQ EHLKRHFRSL HTFEKPYDCT
ICNKKFSRSD NLNQHLKIHK QEEEAAALEK ELLEQGSMAK TKVEDELME