Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_88332 |
Symbol | AZF1 |
ID | 4838025 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 156946 |
End bp | 158418 |
Gene Length | 1473 bp |
Protein Length | 421 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640389340 |
Product | DNA-binding transcription factor |
Protein accession | XP_001383315 |
Protein GI | 150864481 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGAATC AGCTACCAGC ACAGCAACAA TCTAACTACT ATACAAACTC ACAACCAGCA CAATTGCCAT ATCAACAACA ACAGCAGCAG CAACAACAAC AACAACAACA GCAGCAGCAA CAACAACAGC AACAGCAGCA ACAGCGACAG CAGTATTTAC AGCAGAATTA CCCAGTTCAA CAACAAGCGA CATCGCCTCA GATGGGCAAA CCAAGAACGT CAAAGCCCCG TAGAAATTCT AACAACCGTG ATGCCATCTC GGATCAATCG TTGAAGTCTC CCATCGAAAG CAGCAGTAAC GAAGGTAACT TGGTACCAGC ACAGCAGTAT GTAAAATCTG AAGATGGAAG ACCTTTATTG GGCGCAACCA AGATTGACCA GTTGATGTTG GTTATTCAAG CCAGAGACAA AGGTATCACA AGTCCTATTC AACAAGCTCC CGACGGAAGC ATTTTGGCTG CTCCTGACTA TTCCCTTTCC AGAGACAAGA GCGAATTGGA TAATGGTGTT TTGCCACGTC CAATCAGCCT TGTTGGTGGG GTGGACAAGC CTAGCAAGGC CAAGATAAAA GAGGACGAGG GTAGCGACGA TGAAGAATCC AAAGGAAAGA GAAGAAAGCA CAAAAACCAG CAGTGTCCTT ATTGTTTCAA GTACTTTACT CAGTCGACCC ATCTAGAAGT TCACATTAGA TCTCATATTG GCTACAAGCC ATTCGAGTGC AACTACTGCC ACAAGAAGTT TACGCAAGGT GGCAATTTGA GAACACATTT GAGGCTTCAT ACTGGTGAAA AGCCGTTCAC ATGCGACATC TGTAATCGAC AATTCAACAG GAAGGGAAAC TTGGGTGCTC ACAAATTGAC GCACGAGAAC TTGAAACCAT ACGAATGCAA GTTGGATGGT TGCGATAAGT CTTTCACTCA ATTAGGTAAT TTGAAGTCGC ATCAAAACAG ATTCCATCTC AGCACTTTGA ACCATTTAAC ACAAAAGTTG GCTGAGTTAA GCGGTCTGTC GATCGAGAAC TTGCCTCCAG ACGAGAAGGA CTTGCTTATG TATTTCAAAG ACTTGTACAA GAACTCAAAT AAGGGTATTC GTGGCAGAGG TAAGGCTAAA TTATCCAAAG ATGATACTGG AGGTGCAACT AGCAGTTCTC CAGATAATAG CCAGTTTAAT TTGCAATCCC AGTCACCCCA GCTGCAGCTG CAGAATCTTC AAGCGCTTCC TCGACAACAG GACCAACAGC AAGGTTCACC GGAGTATTCG CAATCGCAAC ATTCCCTCGA TTTTATGAAT CCCCATTTGG CTGGGTCTAT TAACGGTTAC CAAGGGTAAT GTTTACGATG TACTAGTTTT TATTATGCGA GAGCCCCTCG CGTTTATTTC ATTTCATGTT CATGACTTAT GCCATTTATT TCTTTATTGT TCTTTCTTTA TATTTTCATT ATTAAGGTTA TTATATGTAT TATCAGAGGT AAA
|
Protein sequence | MRNQLPAQQQ SNYYTNSQPA QLPYQQQQQQ QQQQQQQQQQ QQQQQQQQRQ QTSKPRRNSN NRDAISDQSL KSPIESSSNE GNLVPAQQYV KSEDGRPLLG ATKIDQLMLV IQARDKGITS PIQQAPDGSI LAAPDYSLSR DKSELDNGVL PRPISLVGGV DKPSKAKIKE DEGSDDEESK GKRRKHKNQQ CPYCFKYFTQ STHLEVHIRS HIGYKPFECN YCHKKFTQGG NLRTHLRLHT GEKPFTCDIC NRQFNRKGNL GAHKLTHENL KPYECKLDGC DKSFTQLGNL KSHQNRFHLS TLNHLTQKLA ELSGSSIENL PPDEKDLLMY FKDLYKNSNK GIRGRGKAKL SKDDTGGATS SSPDNSQFNL QSQSPQSQSQ NLQALPRQQD QQQGSPEYSQ SQHSLDFMNP HLAGSINGYQ G
|
| |