Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_47398 |
Symbol | YIN0 |
ID | 4839279 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | - |
Start bp | 796910 |
End bp | 799312 |
Gene Length | 2403 bp |
Protein Length | 800 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640390594 |
Product | zinc finger transcription factor of the Zn(2)-Cys(6) binuclear cluster domain type |
Protein accession | XP_001385163 |
Protein GI | 150865801 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.654245 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGTTC AACAGAACGG CGTAGTTCAT GAAGCTGCAC ACGCCTTTTC CAAGCCTGTT GCCAAGAAGA GAATCAGGAA ACTTCCTCCA GAAAAGAGAC AAAAAGTGTC CACTGCTTGT GACTCTTGCA AGAAGAGGAA GTTCAAGTGC ACTGGTGAAA AACCTTGCTC CGTCTGTGTC AAAAAGGGTG TCCTGTGCAC TTATACGATC ATTGATAAAA GGTCGCTCAA GTCCGAAAGA ATGGCCAAGC TTCGTAAACA ACAAGAGAAT CCCGACTCGG AATCTAACTA CACAACTTCA ACTTCTAGTT CTACAGGACG TAGTCAAGCG ACGTCTGTTG GTAGTTCAGT TCATTCTATG AGCACTTCTT CGCCAGATTC CGTTTCCTCT ACACCTCCAT CGTCTGCTGC TAGTTCGATT TCGCTCCATT CTCACGGCTC CCTTACGTCG GGCATGAAGG AAGACTCGCC TCCCTCTGGC GAGTCTGCCT CGGTTATCAA AGAAGAGCAC TCACCCATGT CCGTAAACTC TCCATTGTCG ACCAATGGAA ATACATATAT CCCCAAATCA CTTCAGCCCC TCTTGTCGTT TCCCTTGAAT GACGATAAGG ATCAGAATGA ATCTGGTGCT TCTGCGAGGA ACGGAATCTC CAACCAAAAC GGGAAGTGTG TAATTCTCTT GAACGACGCA ACAGGAACAT TCAGATACAT GGGCGAAACT TCACCTCTTT CGCTATTGTA TGAAACAAGA AACGTCTTCA TCCAATACGT GGGGCGTACC CCATTTACCA CCAACTTGCA AAAGTGCCCT GTTGTGGACA AACCGTATTT CGTTCCACGT GATTACGTCA ACTCGAAGTT GCCCCCTCGT CACGAAGCCG ATATGTACGT GGAGATCTTC AAGCGTAATA TCAACGACTC ATACTTCGTC CTAGAGATGG ACTCGTTCCA CCACGACTTT GTTGATGCAG TTTATGCAGC AGAGGGCAAT GTAGACGAGG ACAATATTGT TAGCACCATG ATCGTGTACT TTGTGCTTGC CTTAGGTGCG CTATATCACG ATTACAATGC GGATGCATAT GCTCTGCCTG AAAGTGAAGC GTTCTTGAAG ACTGGCTTGA ACTTGTTGAA GGACACGGTC CAAGACTCGG AGCTCTGGGT AGTGCAGGTT CATTATCTTA TCTTCTTCTA CTACCAGGCC ACGATAAGGA AATCCACGGG ATGGATCCAC CTCAACTTGG CCATCAAATA CGCCCAGTCT TTAGGCTTGC ACCGTAATTT TGTTAACGAG CAGTATCCAT ATAATCCAGA AGAAATACAG TACAGAAAGA AGTTGTTCAG ATCACTCTAC ATTTCTGACA GAATCGCATC CATTTTCATT GGACGGCCGT TAACTATCAA CGACTACGAC TGGGACGATC CCTCGCGTGT AGAAGCAGGA TATGGAGCTT TAGATTTTAA CACTAAGGCC CAGATCGAAT TGTCACGGAT TACCTGTTTA ATTGGTAAGA TCGTGGGAAA CTTCTACCGC GACCGCATTA TTGATATCGG TCGTACCAAG AAATTGGCTG TAGACTTGAA ATTGTGGTCC ATCAACTTAG ATCAGCAATT GGCCATAGAA AATATCATGA AGCCAATGGA GATTCCTAAC AACGAGCACC ACGAAAATAC TCATATCTTG TTGTTGATAC ACTTATTGCA GCTCTATGCC ATCATGCTCT TGAGTCGTCC ATTCTTCATG TACGAAGCTG TTCGTAAGTT GTCACCAGAG TTAGGAAAAG TCCCTATGAA AAACAAGTCG TTGTCTCGTC ATTTCTACCA AGCAGCTATG AAGGCTTCTA TCTTGGCAAT CAAGTTGATG CATTACTACA TGAACACTGC CTACAAGGAA TGCATGCGTA AGGAGTGTTA TGTAGTTATA ACTTGTTCTT TCTACGCTTC CATCCTTATT GGAGTTGGTA TTGTCAATGG CGACTACCAG GATCAGGACT ATACGGAGGC CGATTTGTTT AACTATGCCA AGATGGCCAT CAGTGTGTTG AACCACTTCG GCCCAACCAA CCCTGGTGCC GACAGATATG CTGTGGTAGT TGGCGAGATG ATCGATGCTT TGAACTTGGC CAAGTCCACC AGAGCTTCGG AAAAGGCTGA AGAAGCCAAG GAACAGGTCG AAAAGATGTT AGAAACGCAC ATGGACACAA TGGATTTCCG TATCTTGAAT GACTACAACT TCATCGACGA TCCGAACAAC AACTTGCAGT CATTGATTGA ATTCCAGAGG CTATTTGTTC CTCAGGAAAC TGCTCCTACA ACCGGTATAA CCTCGGTTAA TGGCGATTTC ACTTTCACCA CTATGCCACA CGACTACGGT AACTATGAGC TCTTTTTTGG CGACAAGTAT TAG
|
Protein sequence | MAVQQNGVVH EAAHAFSKPV AKKRIRKLPP EKRQKVSTAC DSCKKRKFKC TGEKPCSVCV KKGVSCTYTI IDKRSLKSER MAKLRKQQEN PDSESNYTTS TSSSTGRSQA TSVGSSVHSM STSSPDSVSS TPPSSAASSI SLHSHGSLTS GMKEDSPPSG ESASVIKEEH SPMSVNSPLS TNGNTYIPKS LQPLLSFPLN DDKDQNESGA SARNGISNQN GKCVILLNDA TGTFRYMGET SPLSLLYETR NVFIQYVGRT PFTTNLQKCP VVDKPYFVPR DYVNSKLPPR HEADMYVEIF KRNINDSYFV LEMDSFHHDF VDAVYAAEGN VDEDNIVSTM IVYFVLALGA LYHDYNADAY ASPESEAFLK TGLNLLKDTV QDSELWVVQV HYLIFFYYQA TIRKSTGWIH LNLAIKYAQS LGLHRNFVNE QYPYNPEEIQ YRKKLFRSLY ISDRIASIFI GRPLTINDYD WDDPSRVEAG YGALDFNTKA QIELSRITCL IGKIVGNFYR DRIIDIGRTK KLAVDLKLWS INLDQQLAIE NIMKPMEIPN NEHHENTHIL LLIHLLQLYA IMLLSRPFFM YEAVRKLSPE LGKVPMKNKS LSRHFYQAAM KASILAIKLM HYYMNTAYKE CMRKECYVVI TCSFYASILI GVGIVNGDYQ DQDYTEADLF NYAKMAISVL NHFGPTNPGA DRYAVVVGEM IDALNLAKST RASEKAEEAK EQVEKMLETH MDTMDFRILN DYNFIDDPNN NLQSLIEFQR LFVPQETAPT TGITSVNGDF TFTTMPHDYG NYELFFGDKY
|
| |