Gene PICST_42941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_42941 
SymbolZAS1 
ID4838000 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1517562 
End bp1520006 
Gene Length2445 bp 
Protein Length794 aa 
Translation table12 
GC content39% 
IMG OID640389315 
Productzf-C2H2 Zinc finger, C2H2 type 
Protein accessionXP_001383909 
Protein GI150864904 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.48255 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.719128 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTTG GAGAGAGGTA CGTTTGCGAC GTACCAGAGT GTGGGAAGTC ATTCTCCAGA 
TCCGACCACT TATCTCGACA CAAATTTAAC CATGAACCTC GTGAGATCCA CACCTGTTCC
TGTCCTGGTT GTGATAAGCT GTTTGTCAGA AAGGATGTAA AGGAAAAGCA TGAATTACAT
CATAGAAGAA AGATTCAGAA ACAGAAACTA AAGTTGGCGA AGAAGCATAC GGCATCTGAT
TTGACAGTCA ATAGTAGTCT TGTTGAATCT ACCGACAAAC TAGAAGATGT ATTCGAAAAT
GGCGATCATG AGATAGTCAC CAATGGCAAC AGAAGCGAGG AGATACAAGA AAACCGTTCA
AAGATTGATC AATTGGGTTA TGCAACAACA CAACAACACC GGCAAAGTCA GCACACAAAT
CATGCAATAG GCGAAGGCAA TAATAGCTTA AGTGCACATG TTGCAGTTGC AGATCGGTTC
TATCCAATTG TCGATCATAA ACATCATAGT GAAAGTGAAA ACTTATCATA TTCTAGTCGT
TCAAGTTCAA TTGGCGAATA TACGAGCCAA CCAATCCCAA CTGAGCATAT CCCAGCCGAA
CATTCTCTAT TACCCATCGA TTTGACTCAG TGGCTCTTCA ATGATAATGA TTTCTTGAAC
CAAAAAAACG TCTTGAGTCC CTTTGATACC ACTTTGAGTG GACATACTAA TGCTAACCAC
ATGTTAGAAG AGGCTTTTTG TATATCACCT AAATTTCCAC AGTATAGTAC TCAGACTTTT
AGAGGAACGG TGTTGATTCA AAAGATATGT GGTTATATTC CACAATTGCA TGAATTTTTA
CTTGATACTG AGCAGTTACG CATATGTTTG GATATCTATT GGGCAGTTTA CCATGTTCAG
TTTCCAATAT TACATAGGCC CTCATTTAGT GCCGATGATG TTCACCCATT TTTGCTATTA
AGCATGATAA TGATGGGTGC TGGTTTGATG GACGTAAATG ACACCACCGC AATGTCCATC
ATCCCACATC CTCAGAAACT AGCTGATTGT ATTGCAGTGC CTTTGAGATG GCTCATATCT
TCATCAGAGG AGTTCGGGCT GCCTACAAGG GCATGGATGA TACAAAGCTT GGTCATATTA
GAGTCCTATG AAATGTTGTT TTCCAATAGA AAATTGCATG AAAGAGCATA CTTACATCAT
GGCCTCAAGA TACAGTTATT GAGGAGAAGC GCATTATTGG GAGGAGACCC ATTGAATAGA
ACCAGTGACG AGACTAACTT GACAGAAGAG AAAGACATTT GGAAAAAATG GATTGAGATT
GAGTCATTAA AGAGGGCTGC TCTCGTGTCT TTCTATTTAG ACACCTTTCA TGCTACTATT
TTTGGGCATG AAATCGTATT GTTCGCCCAT CAAATTAAAT TGCTGATGCC TTGTGATGAT
ATGTTGTGGG AAATGTCAGT AATAGACAAT AATAACTTGC CACCACAGAC TGAAACTCCA
AGGTTCATTG TTGCATTGAC CAAATTGCTT CACCAGGAAC AATCAGAAGC AAGTTCACTA
AGTAAAAAAT TGCTTTTGGC AGGTTTGTTA ACGATTAAAT TTCAGATGGA ACAAAAAGAT
CTTCAAATGA CATTCTTGGA TTGGAAATCA GTTGAAGAAT CGTGGAAATC GACAATATAC
AATGCTATAG AGGTGTGGAG GGAAGCAGTT GGAGACTGTT GTGATACTAG AAATGCTTTC
TATTTACCTC TGACAGTAGA GTCTACGAAT TCTGTTCGCT CTGGGTTATC TGTGAACGAT
ACCAAATGTA AATTTCCTAT CTATCACTTA TGTCAGGCAT TCATGAGGGT GAAGCAATAT
GATATGCTAA TCTATGCTGG GTCACCTAGA AGAATGAGTG TCAAGACAAC TGAAAGAGAC
TACAAAGTAG TAGAAGGAAG AATCAAGCAA TGGGCTAATA CTGCTAATGG GAAGATATCA
GTTCTTCATT GTTACATATT GTTAAATGAA ATATTGTTCA ATGGAGAAGA ACAATCGGTT
TACGACCCAG ACTCTGATCC CATTCTCTAC AGACCAAATA TAGTGGCATC ATCACTATTT
GTTATTTGGG TTTACAACTA TTCCCTATAT GGACCAGAGT CATTGGATCA AAGATATGCA
AAGCAGAACG TCGTAAGCGA AAAAGAAGAA GGCTACGCTT ATATCAGTAG AATATTTTCT
TCATTGACCA AGGGTACGGG CCAGAAGTCA TTGGACTATA AAAGTCTTGA AAAGTGTGCC
AAAGTAATTG ATGATATTCC GAATAAGCAT TATTTGGTTG GGTTATTGAA ATTATTCAAA
AACAAGTACA TTCTTTGTAG GTCAGAGATC TGCCGAGAGT ACGCCGGGCT AATTGAGAAT
TGTATTTTGA GAAGTACGGG AAAAGAAAGG GAAGCTTTTA CATAA
 
Protein sequence
MKVGERYVCD VPECGKSFSR SDHLSRHKFN HEPREIHTCS CPGCDKSFVR KDVKEKHELH 
HRRKIQKQKL KLAKKHTASD LTVNSSLVES TDKLEDVFEN GDHEIVTNGN RSEEIQENRS
KIDQLGYATT QQHRQSQHTN HAIGEGNNSL SAHVAVADRF YPIVDHKHHS ESENLSYSTE
HSLLPIDLTQ WLFNDNDFLN QKNVLSPFDT TLSGHTNANH MLEEAFCISP KFPQYSTQTF
RGTVLIQKIC GYIPQLHEFL LDTEQLRICL DIYWAVYHVQ FPILHRPSFS ADDVHPFLLL
SMIMMGAGLM DVNDTTAMSI IPHPQKLADC IAVPLRWLIS SSEEFGSPTR AWMIQSLVIL
ESYEMLFSNR KLHERAYLHH GLKIQLLRRS ALLGGDPLNR TSDETNLTEE KDIWKKWIEI
ESLKRAALVS FYLDTFHATI FGHEIVLFAH QIKLSMPCDD MLWEMSVIDN NNLPPQTETP
RFIVALTKLL HQEQSEASSL SKKLLLAGLL TIKFQMEQKD LQMTFLDWKS VEESWKSTIY
NAIEVWREAV GDCCDTRNAF YLPSTVESTN SVRSGLSVND TKCKFPIYHL CQAFMRVKQY
DMLIYAGSPR RMSVKTTERD YKVVEGRIKQ WANTANGKIS VLHCYILLNE ILFNGEEQSV
YDPDSDPILY RPNIVASSLF VIWVYNYSLY GPESLDQRYA KQNVVSEKEE GYAYISRIFS
SLTKGTGQKS LDYKSLEKCA KVIDDIPNKH YLVGLLKLFK NKYILCRSEI CREYAGLIEN
CILRSTGKER EAFT