Gene PICST_66069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_66069 
SymbolASH1 
ID4840489 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp532885 
End bp534913 
Gene Length2029 bp 
Protein Length466 aa 
Translation table12 
GC content47% 
IMG OID640391804 
ProductGATA-type transcription factor 
Protein accessionXP_001386121 
Protein GI150866495 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.651427 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTAG TGCAACTGCC AGCGATGTTC AACAATCACC ACCATAACGC CTCGTACCAA 
CAGGGCCACC TCCGGTCCCG CTCCTATAAC GAGATGTTGA CGCTGTTCAA CGCGTCCAGT
GCGTCTTCTG CTTCCTCTCA TTCTGCTCCT GGTTCCACAC CAAGACCTCC TTTCTCCACT
CCCGCCTACA TCAAGAAGAG ATCGCATTCC GATCTTGTCA AACAGCTGAC GGAAGGGCTT
GCTTTTTATG ACCAGAAACG GTCTAGAACA GCCCCTCCTT CTCCCCCCTA CGAAATCAAC
AATAAGTTCG GTCCTGGATC CGCACTCTCT GTATCCAGTT CCTCAACTGA TAAGCTCATC
ACCAGTAATG TTATCAGACC CAACCCCAAC CTCCGCTCAC GTTCCTTGCT GCCCTCCAAG
AAGTTCAGTA ACCCACTCTC GCCTTCCAAT TCACCCAACT CCTCGCCAGC ATCATCTCCT
TCCAAGCCTG CAACTGGAGC TGCACCTTCT ACTCCAGACA CAAGCTTGGC TGCAAAAGCG
CAACCGCAAA CGCAGACACA AGTGCAAACT CGATTGCAAT TGCGCGACGA AGACGAACAT
ATCCACAAAC GCATTAGGCT TCCCAGCATT TCTGCAGCAT TGCAATCGAC CAAGTCCTCC
TCGATTCGAT TGAAACCCGT CATCACTCCT CCCACTGTGT CTCTCGATTA CTTTGACACC
TACAAACCCA ACGACGAAAA CTGGAGATAC GAATTGCTCG ACACCATTAA CAAGGATTCA
AAATATTTCC ACTTGAACCA ATACAACTAC TTGAATAAAT ACGCCACTTC AGCCAAATAC
CAGCTGCAAC TGCAACAGCA TTCCATTAAC TCGTCTTATG GTTACAAACC CAACTTCGAC
TCAAGGATCA GCTCTAAGAT CGCTAACCAA CGTCCATCTC TTCCCAGTGT CAGCTCTATC
TGCCACGAAA AAAAAATCAA CTTCCCTTTC GAATCCAACT ACACTTACTT GAACAAAACC
TACATGAACG ACGTCGAAAA GTATCCCGAA TACTTGGAAT TGGCCCAGTC TTTGATCCAG
TTGTCGCAGC CACGTAGGCG AAGTGATTAC CACGACCAAT CTGCGGCAGT TTCGTCCACT
GCTCCAATAC CGACAGCTGT TACACCTCCA ACATCGTACT CGCCAAACTC TTCTGCTGCA
CCTCCTCCAA CCCGCTTGCC AATAGTCAAA CCACTGCAAT ATTCCCATCA AACCTACACG
GAAGCACCTG TACCTGTAGT ATATCATCAA AGCGGAACAA TAACTCCTCC AGCATCTAGA
TTGCCTAGCA TCCAAGCCAA CAAGTACAAC ACTTATCCAG GTATGATTCA GGAAAACAGC
ACTGCCAATA GCCATTCGCC TGAATTGGTT ACTTTGCACC ATGCTCCTGT CCAAGTCCAA
CCTCTTGCCA CCCCTGCTGC TCAGCAGAGT CACAAATTCA TCCCCATCAC CCCACCATCT
TCCAAGTCCA AGTCGAGAAC TGAGTTGTTG AAGTCTCCAC CCAAACATCA TTACAACCAC
CACAGCCCCA GAGTATGTAT TTCTTGTGGA TCGGATCAAT CGCCATGCTG GAGACCTTCA
TGGTCGATCA AGGAAGGACA ATTGTGTAAC TCTTGTGGAC TCAGGTACAA GAAGACATCT
GCCAGATGTC TCAATAACAA CTGTAAGAAG ATCCCAGCCA AAGGCGAGTG GTCGCTTATG
CAAAGCAAAG GCAAGACCAT GTTTGATGAT GGCCACGACG GCTACAGCTG CTTAGAGTGT
GGCTGGAGAG TCGAAATCAA GACTTAAACT AAATATCTAA TACTGTCGCG CCATCTTAAG
ATTACAAGTA TGGCCAGTAC TAGACGACAC AAGCGTCCAG GGTTTCTCCT AAGGTTTATT
GGGGAGCCAA CAACATGCAT TCGTCCGTTG AAGGACTAAA CCTGTACTAT GATGTATATT
TAATTCTTTT AGTTTCTAGA AAACAGTTTA ATGGTAATTA ATAGTTATG
 
Protein sequence
MSLVQSPAMF NNHHHNASYQ QGHLRSRSYN EMLTSFNASH SDLVKQSTEG LAFYDQKRSR 
TAPPSPPYEI NNKFGPGSAL SVSSSSTDKL ITSNVIRPNP NLRSRSLSPS KKLPSISAAL
QSTKSSSIRL KPVITPPTVS LDYFDTYKPN DENWRYELLD TINKDSKYFH LNQYNYLNKY
ATSAKYQSQS QQHSINSISS KIANQRPSLP SVSSICHEKK INFPFESNYT YLNKTYMNDV
EKYPEYLELA QSLIQLSQPP VTPPTSYSPN SSAAPPPTRL PIVKPSQYSH QTYTEAPVPV
VYHQSGTITP PASRLPSIQA NNHSPELVTL HHAPVQVQPL ATPAAQQSHK FIPITPPSSK
SKSRTELLKS PPKHHYNHHS PRVCISCGSD QSPCWRPSWS IKEGQLCNSC GLRYKKTSAR
CLNNNCKKIP AKGEWSLMQS KGKTMFDDGH DGYSCLECGW RVEIKT