Gene PICST_29464 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_29464 
Symbol 
ID4837478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp328765 
End bp330902 
Gene Length2138 bp 
Protein Length712 aa 
Translation table12 
GC content39% 
IMG OID640388793 
ProductC2H2 zinc finger protein 
Protein accessionXP_001382294 
Protein GI150863728 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTTCA GGTGCTCTTT CTTGAACTGT GGGAAAAAAT TCACCCGGAA AGACTACCTC 
GCAAGACACG AGCTTAACCA TACAAATGAA AAACCGTATA AATGTGAACA GTGTCAGTTC
AGTTTTACCC GGAGTGATCT ACTCAATAAA CATTTCAAGT CTCAGAGTCA CAGAAAACGA
AAGCAAGAAC TTTTCAAGCT GCTGGAGAAG ACAGAGGGCT TAGACAAGCC AGATCAAGAT
TTTGAAGATA TTGAATCCCC CCTGAAAAAA CTAAGGTTTC TGACTCGAAA TGAGCCTAGT
TCTCAGAAGC AGCCTTTAAT GACAGTAACA TCCTCACCTA TCGAAGATCA TATCTTTACA
GAGTCGAATG ATCTCCCACC AGGGTCCTTC AACAACTACT TATGGTTGTT TGACGACTCT
CTTGACGTCA GAGATGATAA TTTTGTTAGT AAACCATTGA ACCATCAAAT TGAACCGTCT
CTACTGTCAA CATCACACAT AGTAGAAGAA GAACTGTTTC GTGAGATTGA TGGCCAAAGG
AGACTTGCTG TATTGGAATT GCTAAATATT CATGAGGTGT CATCCTTGGC ACCTGATAGA
TTTTCTCTTT TCTTAGACTT GTACTGGAGT GAGTTCAATC CCACTTTTCC AATTATACAC
TATGCAACAT TCAATAACAA CGAAGCCGAT GTATATTTGC TAACTGCTAT GATATGCATA
GGTATGGCAC ATTCCCCATT AGAAGCCGAA TATGAGCTTT CAATAGTCGT TACAATGCAG
TTTAGAAGGC TTATATTTGA CGCAGTAGGC GATGATGTGG TTTTGAGATT ACCATTGCTT
CAATCACTAT TACTTCATAA CTTCGCCTGC AAGTATTATG GCGATAAGTT ACTTTACGAA
ATGTCCCAAC TTTTTCACGG CACTAATATA AACTTTTTAA GATTTACTGG ATTTTTCGAC
GATTTAGTCG AACCAAATAT GTCATCTTCT CCAAGTGCAA CTTACCACCA GTTGGAGACT
GACTGGAAAA AATGGATTCA CTACGAGACA TGTAAGAGAA CAGCATACTT CGCATTTGTT
TGTGATTCTC AACATGCCAC ATTATTCAAA CACCAAGTTC TTTCGGCGTT CAGCTTGCAA
ATAGATTTAC CCAGCACCGA TGCTGTATGG AATGCGAGCA ATCCTATTAC CTTTTCTGAA
ATGTATAGAT TACAACCTAG AGGACTTTCC TATCAACATA AGGTAGTCCT AAATTTACAG
AGTGGCCAGA CGTTATCAGC TCCAGGTCCA TCAATGCCTT CAGTTAAAGC GGAGGGCAAT
TGGCCCGAAT TTTTGTGGAG TTTGAGATCT ATGATGATGC CATACAAGGA GAGCCAAAAA
GAATACTCTT TAGATTGTTA CTCCCAGTTT TCTCGAAGTA TTTTACTACA TGGAATTATA
TCTATTTGTT GGGATATGAG ATGGAGGGGA TTATTTGACT TGGGTATCGT TTCAAAAAAG
AAATTGAGCG ACCTTTCTGG CAAACTATTG AGAGCTTTCT ATAACTGGAA AGGTTATCTT
GATTTGCATA TTTCAAGTGC AAACGAGCGA GCTCTTGGAA AAAACGTAGA CCTGGAACCT
TCCGTTGGAT TAAACGATTA CGGCCTATCT CCAGCATTCT GGTCAAACTT AAGTTCCTAC
CAATTAGGGT TAATTTCTCT TTTTGCAGAT ACAGCATCGA TTGTGAAGTA CGCTACAGAA
CTTAAAAACA GTAGACGGGC AGGCGTATCA CGTAATAAGA TTCATATTGA ATCATGGGCA
AGGTCACCAA ATGGTGATCA ATCAATAAGA GAAGCAGCTA GGTTTATCAG AATTATTTCT
AATGCTGAGA ATGAGCACAT TATTTCTATA CCACATATTC CATGGACCCT ATTTATTTCA
TGCTTAGTAA TATGGTGCTA TGAAACTAAT CGAGATTGTT TAAATGGGCT AACAAGTAGA
GATCACATCG AGTTATCCTA CGCAAAATAT TATAATCCTT CCGTTTCATA CTTTGACGAG
CAGGCAGTTA AGCACGATAC ACTTGAGTAT ATTAGCCTTG CAATTGATAA CGAAGCAGAT
GATGTTGAAA CCTCTGGAAA CTTTTGGAAA AGGCAAAA
 
Protein sequence
MSFRCSFLNC GKKFTRKDYL ARHELNHTNE KPYKCEQCQF SFTRSDLLNK HFKSQSHRKR 
KQELFKSSEK TEGLDKPDQD FEDIESPSKK LRFSTRNEPS SQKQPLMTVT SSPIEDHIFT
ESNDLPPGSF NNYLWLFDDS LDVRDDNFVS KPLNHQIEPS LSSTSHIVEE ESFREIDGQR
RLAVLELLNI HEVSSLAPDR FSLFLDLYWS EFNPTFPIIH YATFNNNEAD VYLLTAMICI
GMAHSPLEAE YELSIVVTMQ FRRLIFDAVG DDVVLRLPLL QSLLLHNFAC KYYGDKLLYE
MSQLFHGTNI NFLRFTGFFD DLVEPNMSSS PSATYHQLET DWKKWIHYET CKRTAYFAFV
CDSQHATLFK HQVLSAFSLQ IDLPSTDAVW NASNPITFSE MYRLQPRGLS YQHKVVLNLQ
SGQTLSAPGP SMPSVKAEGN WPEFLWSLRS MMMPYKESQK EYSLDCYSQF SRSILLHGII
SICWDMRWRG LFDLGIVSKK KLSDLSGKLL RAFYNWKGYL DLHISSANER ALGKNVDSEP
SVGLNDYGLS PAFWSNLSSY QLGLISLFAD TASIVKYATE LKNSRRAGVS RNKIHIESWA
RSPNGDQSIR EAARFIRIIS NAENEHIISI PHIPWTLFIS CLVIWCYETN RDCLNGLTSR
DHIELSYAKY YNPSVSYFDE QAVKHDTLEY ISLAIDNEAD DVETSGNFWK RQ