Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31115 |
Symbol | CTA4.2 |
ID | 4837965 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 1805645 |
End bp | 1807942 |
Gene Length | 2298 bp |
Protein Length | 765 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640389280 |
Product | zinc finger transcription factor |
Protein accession | XP_001383632 |
Protein GI | 150864697 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGCCA AAACTAGACA GAGAAAGTCG TATCTGTGTA TCAATTGCAG AGATAAGAAA CGCAAGTGTG ATAGAGGAAA GCCCTGTTCT TCTTGTGTAA GGCTGGGAAT AGCTTCAACA TGTAAATACA ACACACCGAC ACTTACCGTG CTAGGACCAG CGTCCATCTT ATATGATAAC GAAAATGCTT CCAGTCTAGA ACCAGAAGGT GAAGATGATA TAAGAGACTT CTTGAGACTC AAGAATCCTG TCGGAATACC TGGAGGCAAG ATCAACTTCT TCAAGCTCTT GTGCAACAAC TTGATAGAGC CAGATTACAA TGTGTTTTCG TGGATTAATG TCTACAAATC GGACCCGGGA ATGAGGATCT ACTTGAAGAA CCTTCCTCTG AATAGCAAGC TCTTTGTAGA ACAAGTAGTA GTAGACGACT TCCACAGAGA CGATATTTTC AAGGGTCTTC TCAATCTTGA ATTGGACGAA TTGTACAGAC GGTCGGTGGG CCCGGACCTG GAGAACAAGA TAATACACAA CGAAGAGGAA TTGCTACTCG CCATCAGATC AGTATTGCCA AATACCGTAA TTATAGCAAC CCATGTGGAA AAGTTCTTCT CTGAACTATA CCCCGGTTTT CCATTTTTAG ATGAAAGCGC ATTTAGGAAA AGGCTAATGA GAATGGGTAA TGGACCTCGA ATAACAGACG AGAATGACTG GGCCATCATT GGTATATTAC TTATAATGGT GAGAATAAGC TATTTGTCCA GTCTCTGGAA CATATCTGGC AGTGATAGGT TAGGAGCTGT TGCAAACAGA GCCGACTACG AGCTTGTACT GTCGGCTACT ATAGGCCCTG AATTTATTCG ATTAGCAAGA AGTTGTTTAA AGCAATATGA TCTCCAGTCT AGAAATTCGT TGTGGGTTCT ACAATGCAAT ATCTTCCTCC AGATCTATGA GCGGGTTTCG CCTGAGGAAG GAGAGTGTGC TACAGGAAAA AGCAGTCAAA TCATTCATGC CTCGTTGATT CAGCATGCAC TATCATTAAA GCTCCACATT GACCCAGACT TGCTCCCTCA GCTATACGGC AAGAATGAAA AGTACAAGAA TCTCGTGAGA AAGATTTGGC ATTTCCTCGT TTCTTATGAT TATTTTGATA GTATTAGCTA CGGCAATTAT CCTTCGACTA CAAAAATGGT GTTTAATACA AGGCTACCAA CTCATATTGA AGGAAATGAG AATATTCAGG ATACAAAGTT GGAAAAGGAA GTTCTTGGTT CGTTTGAATT TCTCGACTTC ACTTATGGAC CAGTGCATGA CCTTTTAGTT GATATCCAGA GCTTGAAAAC CGATCATAAA ATTGAGAGAA TAGTTTTTCT TGCAACTGCA ATAGAGTCTA CTATTGCTTC AATAATTGGA CAAGTTGATA CCTTTTTTTC TGTTGAAACG AATCCCAGTA CAGATAAAGG AATTAGATTT GCTCTCTTTC TATACCTCAA GTTCTTTCTT TTAACAATTT TTTCTTATCT ACAGCTGTAT TATGAGAGGA CTAAGCAGTA TGAAACGGCA TTTTTCTACT TCAAGAAAGT ATTAATTATT GGATCTTATG AAATATTGCC AGCAATGTTT AGATTGGTTG GTTCATTTGG AGACCACTTC AAGAAAGAGA CATTCATGTT GACTCCCCAG ATTATCCAAT TGTCATGTCA CAGACTAATC ATTACCTTAG TTTCAGCATT TATTCGAATC GAAATTACTT TGAAGAGCAA TGGTCAGGAG CTATCAGGTC TACATGGGGT AAAGATCAAA GTATTGCAGA TACTTCATTT GTTTCTTGTC AACGCGTCTG CGTTGAGTAT CTATGTACAT TATACGTGGA GAGTGAAAAA GGCACTAGAA TTTGCATTGA GGAAGATTTT TGACGGACAG CTCTACAAAC TTCAAAATTT GACAGAATTA GAGATATCAA AGGCAGAATT GAAAATTCTG AAGGAACAAG TTGCTGATTT AGACAAGGCA TTACAATCAG CTCTTCTAGC AATAGAGAAA GCTTGGAAAC CTTCACTTCC AATTCCATTT GCGTCGATTG ATAACTTGGA TTACATTTAC GGACCTAAAA ACAGAGCACC CTTCGAAGCG TATGATCGAT CCGAACCAGA TAGAATGTGG TACAAAGTTT CTAGTTGCAT GAGTAATGGG TCCAATCTGA AGATTAGTCG AAATAATATT CCTAATAAGC TACAACTAAA CAACGTTGAA TATGACTTCA ACATCTTCAA AGGGATGACA TTTGTGGAAC AGGGCTAG
|
Protein sequence | MTAKTRQRKS YSCINCRDKK RKCDRGKPCS SCVRSGIAST CKYNTPTLTV LGPASILYDN ENASSLEPEG EDDIRDFLRL KNPVGIPGGK INFFKLLCNN LIEPDYNVFS WINVYKSDPG MRIYLKNLPS NSKLFVEQVV VDDFHRDDIF KGLLNLELDE LYRRSVGPDS ENKIIHNEEE LLLAIRSVLP NTVIIATHVE KFFSELYPGF PFLDESAFRK RLMRMGNGPR ITDENDWAII GILLIMVRIS YLSSLWNISG SDRLGAVANR ADYELVSSAT IGPEFIRLAR SCLKQYDLQS RNSLWVLQCN IFLQIYERVS PEEGECATGK SSQIIHASLI QHALSLKLHI DPDLLPQLYG KNEKYKNLVR KIWHFLVSYD YFDSISYGNY PSTTKMVFNT RLPTHIEGNE NIQDTKLEKE VLGSFEFLDF TYGPVHDLLV DIQSLKTDHK IERIVFLATA IESTIASIIG QVDTFFSVET NPSTDKGIRF ALFLYLKFFL LTIFSYLQSY YERTKQYETA FFYFKKVLII GSYEILPAMF RLVGSFGDHF KKETFMLTPQ IIQLSCHRLI ITLVSAFIRI EITLKSNGQE LSGLHGVKIK VLQILHLFLV NASALSIYVH YTWRVKKALE FALRKIFDGQ LYKLQNLTEL EISKAELKIS KEQVADLDKA LQSALLAIEK AWKPSLPIPF ASIDNLDYIY GPKNRAPFEA YDRSEPDRMW YKVSSCMSNG SNSKISRNNI PNKLQLNNVE YDFNIFKGMT FVEQG
|
| |