Gene PICST_88332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_88332 
SymbolAZF1 
ID4838025 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp156946 
End bp158418 
Gene Length1473 bp 
Protein Length421 aa 
Translation table12 
GC content43% 
IMG OID640389340 
ProductDNA-binding transcription factor 
Protein accessionXP_001383315 
Protein GI150864481 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAATC AGCTACCAGC ACAGCAACAA TCTAACTACT ATACAAACTC ACAACCAGCA 
CAATTGCCAT ATCAACAACA ACAGCAGCAG CAACAACAAC AACAACAACA GCAGCAGCAA
CAACAACAGC AACAGCAGCA ACAGCGACAG CAGTATTTAC AGCAGAATTA CCCAGTTCAA
CAACAAGCGA CATCGCCTCA GATGGGCAAA CCAAGAACGT CAAAGCCCCG TAGAAATTCT
AACAACCGTG ATGCCATCTC GGATCAATCG TTGAAGTCTC CCATCGAAAG CAGCAGTAAC
GAAGGTAACT TGGTACCAGC ACAGCAGTAT GTAAAATCTG AAGATGGAAG ACCTTTATTG
GGCGCAACCA AGATTGACCA GTTGATGTTG GTTATTCAAG CCAGAGACAA AGGTATCACA
AGTCCTATTC AACAAGCTCC CGACGGAAGC ATTTTGGCTG CTCCTGACTA TTCCCTTTCC
AGAGACAAGA GCGAATTGGA TAATGGTGTT TTGCCACGTC CAATCAGCCT TGTTGGTGGG
GTGGACAAGC CTAGCAAGGC CAAGATAAAA GAGGACGAGG GTAGCGACGA TGAAGAATCC
AAAGGAAAGA GAAGAAAGCA CAAAAACCAG CAGTGTCCTT ATTGTTTCAA GTACTTTACT
CAGTCGACCC ATCTAGAAGT TCACATTAGA TCTCATATTG GCTACAAGCC ATTCGAGTGC
AACTACTGCC ACAAGAAGTT TACGCAAGGT GGCAATTTGA GAACACATTT GAGGCTTCAT
ACTGGTGAAA AGCCGTTCAC ATGCGACATC TGTAATCGAC AATTCAACAG GAAGGGAAAC
TTGGGTGCTC ACAAATTGAC GCACGAGAAC TTGAAACCAT ACGAATGCAA GTTGGATGGT
TGCGATAAGT CTTTCACTCA ATTAGGTAAT TTGAAGTCGC ATCAAAACAG ATTCCATCTC
AGCACTTTGA ACCATTTAAC ACAAAAGTTG GCTGAGTTAA GCGGTCTGTC GATCGAGAAC
TTGCCTCCAG ACGAGAAGGA CTTGCTTATG TATTTCAAAG ACTTGTACAA GAACTCAAAT
AAGGGTATTC GTGGCAGAGG TAAGGCTAAA TTATCCAAAG ATGATACTGG AGGTGCAACT
AGCAGTTCTC CAGATAATAG CCAGTTTAAT TTGCAATCCC AGTCACCCCA GCTGCAGCTG
CAGAATCTTC AAGCGCTTCC TCGACAACAG GACCAACAGC AAGGTTCACC GGAGTATTCG
CAATCGCAAC ATTCCCTCGA TTTTATGAAT CCCCATTTGG CTGGGTCTAT TAACGGTTAC
CAAGGGTAAT GTTTACGATG TACTAGTTTT TATTATGCGA GAGCCCCTCG CGTTTATTTC
ATTTCATGTT CATGACTTAT GCCATTTATT TCTTTATTGT TCTTTCTTTA TATTTTCATT
ATTAAGGTTA TTATATGTAT TATCAGAGGT AAA
 
Protein sequence
MRNQLPAQQQ SNYYTNSQPA QLPYQQQQQQ QQQQQQQQQQ QQQQQQQQRQ QTSKPRRNSN 
NRDAISDQSL KSPIESSSNE GNLVPAQQYV KSEDGRPLLG ATKIDQLMLV IQARDKGITS
PIQQAPDGSI LAAPDYSLSR DKSELDNGVL PRPISLVGGV DKPSKAKIKE DEGSDDEESK
GKRRKHKNQQ CPYCFKYFTQ STHLEVHIRS HIGYKPFECN YCHKKFTQGG NLRTHLRLHT
GEKPFTCDIC NRQFNRKGNL GAHKLTHENL KPYECKLDGC DKSFTQLGNL KSHQNRFHLS
TLNHLTQKLA ELSGSSIENL PPDEKDLLMY FKDLYKNSNK GIRGRGKAKL SKDDTGGATS
SSPDNSQFNL QSQSPQSQSQ NLQALPRQQD QQQGSPEYSQ SQHSLDFMNP HLAGSINGYQ
G