Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_66679 |
Symbol | STP2 |
ID | 4851861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 3034868 |
End bp | 3037259 |
Gene Length | 2392 bp |
Protein Length | 664 aa |
Translation table | |
GC content | 44% |
IMG OID | 640393569 |
Product | zf-C2H2 Zinc finger, C2H2 type |
Protein accession | XP_001386910 |
Protein GI | 126275820 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.124089 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CATATAGATA GCTAATTTTT TCCAGCTAGG ATATCTACTT ATATCGCTGA AGTCTACTAG AAAAGCCTAT AAGTTTCAAC AACTTCCACT TCACCAAACA GCTCTACTGC TGACACCTCC TTGTATCTCT GCTGGGTCTT TTTCTGCTCC CTTCGCTACC GTTTCCAGAC CCATATTTGC ATCGCTGGAT TCCCCAAATT CAGGAGATGC AACAAGCACA ACAACGGGCC TCTTCTGTGG TGAGAATGAA GCTGCCCTCT GCTTCCACGA CCGGGATCTC CAGCTTCCCC TCTTCTCTCC AGAACAGCCA TGCTAAGACT ACTTCCTCGA GTAAAGGCTC CAGCTCTTTT TTCCACTTCA CGTTGGTAGC CATCTTGTAC AAGTTGGCTC AGCTTGGAAT CGCCTTCATT CTCCACTTAC TCGTACCTTC TTCCAGCAAC TTGGTGTTGC CCAGTGAACC TGAATCTTCA GGTGGCAAGT CGGAGAAACG TTTGTTTCCA TCTATTAGTA ATCCTATTAT CCAACACGAA CAAGTCAAAG ATGAAACCGA TGATGCCGAC GCAGTCATCG AAAGGCCCTC TACCCCTGTT ACCAATGTAG CCCAGCCCTG TTCGACTGGT CTTACCACCC GTTTGCTTCC TTTTAAGAAC GCCAAGGGTG AAATTGAGTG GGCTTTCACA GACGACGTGA CTCCCGGAAA CGAGTTGGAC GTATTCAAGA TGAGCCGTTT TGACGAAAAG AACGCCCCAA AACCTAAAAA CAACAACGAA AATGAGAACG AAGATGACAA CTTGTCGCCT ACGATTTCCA ACTCATCAAA CAACGAATCC ATCCTCTCTA ACAACAGTAA AAAGCTCAAG ACTAAGAAGG AGGACGAAGA CAACGAACAC ACAGGAACTC CTTTGACCAA CCACTCTTCC CCTTTCTCGC CTGAAGACGA CGATGATGCC AAAGAAAAAG AGGACTTGTC TTCTTCATCT TCTGATAACT TGGCTGATTC CAACGAAGGT GAACATGACG ATGTAGATGA TGAAGACAAA GTACACCAGT GCCCTCATTG CGATGCCACC TTTAAGATCA GAGGATACTT GACCAGACAC TTGAAGAAGC ACGCAACAAA GAAGGCTTAT AGTTGTCCTT TCCACAAGTT TAGCATCTAC ATCGACGAGA ATAATATCAC TCACAAGTGC CATCCAAACG GTGGCTTTTC TCGGAGAGAC ACATACAAGA CTCATTTGAA GTCCAGGCAT TTCAAATACC CAAAAGGCAC CAAAACTAAA GACAGAGCCA ATTCTCCTGG TACTTGTTCA ATGTGTGGCT GCTACTTCCC AAATGCTGAG ATCTGGTGTG AATTACATGT GGAAGGTGGA GAGTGTAAGT TCTTGCCAGA AGGGTTTAAA GGTAAGTCTA GAATCAAGAA CAGATTGAAG AAACAGTTGT CGAAGCTCAG AAAGTCCGAT GCCGAAGCCA ACGAATTGAC ATCGTCTATA ATCAACTCAC ATTACTTGAA CGGCACTTCC ATGAATGGTA ACTTGAATTC GAACATCAAT GGCAGTATGA ATGGTATCTC CAACAATAAT ATCCAAGATG TGCCTACTCC ATACATAAAT ACACCCAATT CAGTACACTC TATTGGAACT CCAGCACCCA TGAATAGCAA TCCTCAGTAT GAGTATCACA ACTCGCAGTC GCCGGAATCT GTAGTATCTC ACACACAGCA CAACACTCCC ATGTCCACGA AGTCACCTAT TAGTTACTCA CAAATATTGC AGGCACAAGT GGCGAACCAA AACTCGCAAG CCCGTACTTT TATGCAAAAG ACTGATATGC CCACTTCTAA CGACCATATG GTACCACAAT TTGAACAATT CACTAGAAAC TCCACACCTG TAAATGCAGT TGAAGACTAT GACGATGAAT ACTGTTTGGA TGTGGACCAG TTGAACACAG CTTCCTTGAA AAACTACAAC GAAGTAGTCG ACTTCCTCAA GAACCAGACC CCATCTACTT TTCTTCAACA GCAGCAGTTC CAACAACCTG CCAATGACAC GCATCCATCT GCACAGCTGG CTCATTTCCT TCAAGAGTAC CAAGAGTTAC CTCCACAGCC TCAAGTGCAA TACCAACAAC AATACCAACC TTCTCCTATG CAGCAACAAT ATGCTCACAC GTCGTTTAAT CAGGGCATGT ACCAAGCTTA AGTTAAGGTT CGTATCCTAC ATTCCATCCA TCATGAGCGA CGTTAATTAT TCTATTCACC TTCTATATTT GGGTAATCTA TGGGTTCATT TCAATAGCGA TCATCACAGA TGATATTCAG TGAAGTCTGT CTTTTTTGTA TAATACTACG GTTTTAGGTA TTACAAATAT ATTTTCAATA CAGAGATTTC GT
|
Protein sequence | MQQAQQRASS VVRMKLPSAS TTGISSFPSS LQNSHAKTTS SSKGSSSFFH FTLVAILYKL AQLGIAFILH LLVPSSSNLV LPSEPESSGG KSEKRLFPSI SNPIIQHEQV KDETDDADAV IERPSTPVTN VAQPCSTGLT TRLLPFKNAK GEIEWAFTDD VTPGNELDVF KMSRFDEKNA PKPKNNNENE NEDDNLSPTI SNSSNNESIL SNNSKKLKTK KEDEDNEHTG TPLTNHSSPF SPEDDDDAKE KEDLSSSSSD NLADSNEGEH DDVDDEDKVH QCPHCDATFK IRGYLTRHLK KHATKKAYSC PFHKFSIYID ENNITHKCHP NGGFSRRDTY KTHLKSRHFK YPKGTKTKDR ANSPGTCSMC GCYFPNAEIW CELHVEGGEC KFLPEGFKGK SRIKNRLKKQ LSKLRKSDAE ANELTSSIIN SHYLNGTSMN GNLNSNINGS MNGISNNNIQ DVPTPYINTP NSVHSIGTPA PMNSNPQYEY HNSQSPESVV SHTQHNTPMS TKSPISYSQI LQAQVANQNS QARTFMQKTD MPTSNDHMVP QFEQFTRNST PVNAVEDYDD EYCLDVDQLN TASLKNYNEV VDFLKNQTPS TFLQQQQFQQ PANDTHPSAQ LAHFLQEYQE LPPQPQVQYQ QQYQPSPMQQ QYAHTSFNQG MYQA
|
| |