Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_29072 |
Symbol | GLN32 |
ID | 4851808 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 2887425 |
End bp | 2889242 |
Gene Length | 1818 bp |
Protein Length | 605 aa |
Translation table | |
GC content | 44% |
IMG OID | 640393516 |
Product | zinc finger transcription factor |
Protein accession | XP_001387117 |
Protein GI | 126275662 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.965221 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACAAG GAAATAACTC CAATCATAAA CAAGCTAAAC CTTCCCTCGG CTCCAAGATC TCCTACAAGC CCTCATTGTC AAATTCAGCA GTAAACGAGA AACAACCCCT TGTCGTCTCC AACCTTATTG CTGACACAGA TAACTCCTCC ATCGAAATCT TCAAGATGTA CCAGAACAAG AACTACTTAC CCCACAATCA GAGAATCTCC AACATAGCAT GGAGAATCCA GAACAAGAAG TTGATGCTGG GTGCCTCCGG AACGGCTCCA GCATCCAACC GTTCAAGCTC TGGCTCTGTT AATGGCATTG CCAAACCCAT CCATCGTCTG AATTCAGTGT CTAGTAATCG TTCCAACTCC ATCTCAGCCA AGAACTCAGG TCCTGTAACT GTCAATGGAT CCAAACGGGA TCTGCTTGAT AATCTCAACG ACCCCAACTT GGACGAATTC GACTACGTAG CCCATATCCG TAGAATCAGC CAGGAAGAAT ATAACCAGGC TAACATAATG GCGAAGAATA ACAACCAAAG TAATAAGCAA AGTGACAGCA TCAATAACAA CACCACGAAC AGTATCACTT CCCCGGATTC AAGCACAAAC ACTCTTACGT CGCTGAACTC CGCCATCTTT GGAACGATGA AATCTTCAGC TACAACTGCG ACAAGCACTT CCAGCAATAA ACCACTTGAA GTTTCATTTG CTAACAATAA TAATGCTAAG AACATTCCTG GAAATAACAA TTTTCTCTCG TCATATATAA ATTCTTTGGA ATCGACGTTG AAGCTGGACT ATAAGCTCAA CCAGAATTCC GAGTTCGATA CTTCTTTACA ATCAAATACG GTGTCCAACT CCACGTCGGT ATCTCCACCA AAGTTCAAAC AGAGGCAGCC ATCGGTCGGT ACTGGTATTG GCAAACGAGT ATTGCAATGC ACCAACTGCC AAACCAAGAC GACTCCCTTA TGGAGAAAGG CTAACAACGG CGATTTGCTC TGTAACGCCT GTGGATTGTT TTACAAATTA CATGGAGTAT TGAGGCCATT GAATAATAAT TCCGGTTCTG GTTCAAGCAC CAATCATATC GCTAACTCCG ATCCTATCTC TAATTCTGGC AACAATTCTG GCAATTCCTC TGCCTCGGTG AAAAATCCTA GTGACAAGAT CATACTGAAT AACAATACCA ATCTTTTCAA CGGCTTGCAA CTGTTGAAGA GCAACTTCTC TCCTTCTTCT GCGCCTAAGT CTAATATCAA CACTAGCAAC GTCAACCAGA ACGATAGGTT TAGCTCTAAC GACTTTGACT TGTCCAATTA CGACGGTAAT AAGTTCTATG ATTCTACCAA CAAGGATATG GTCAACATGG ACAGCTTCCT TGACTTTACT CAACCAGGTA CCAACGCTGA CAACAATCCT TCTCGCACCA ACATCGGTAT CAACTCCAAC AATGCCAATA TCTCTATAAA CGCAAATTCA GGATTATCGA GCAGTTTGCC CGTTAACAAC TTCCAGAACC ATCAGCATAC GCCCGTAGGT GGCAACAACG TGGACGAAAT AGACAAGCTC TTGAACATCA ACTTGTTCCA GTCGGATTCG TTCACGATAG GCAATAAGTC TGGCTCTGGC TTCTACGATT TGGATTCGCA GCCTGGTCAA TCTGGCTTAG CTGGAGTTAA TGAAGACATG TATGTTGGCG ATCAGATGCA ACAATCTCAT TTGAATGCCA ATTTGGATCT CGATTTGATC GATGGCAGTC AGACTAATGG CAATGCTAAT GGAAGTGCAG GCTGGAATTG GTTGGACTTC AGTCCACCTC AGAACTAG
|
Protein sequence | MAQGNNSNHK QAKPSLGSKI SYKPSLSNSA VNEKQPLVVS NLIADTDNSS IEIFKMYQNK NYLPHNQRIS NIAWRIQNKK LMLGASGTAP ASNRSSSGSV NGIAKPIHRL NSVSSNRSNS ISAKNSGPVT VNGSKRDLLD NLNDPNLDEF DYVAHIRRIS QEEYNQANIM AKNNNQSNKQ SDSINNNTTN SITSPDSSTN TLTSLNSAIF GTMKSSATTA TSTSSNKPLE VSFANNNNAK NIPGNNNFLS SYINSLESTL KLDYKLNQNS EFDTSLQSNT VSNSTSVSPP KFKQRQPSVG TGIGKRVLQC TNCQTKTTPL WRKANNGDLL CNACGLFYKL HGVLRPLNNN SGSGSSTNHI ANSDPISNSG NNSGNSSASV KNPSDKIILN NNTNLFNGLQ LLKSNFSPSS APKSNINTSN VNQNDRFSSN DFDLSNYDGN KFYDSTNKDM VNMDSFLDFT QPGTNADNNP SRTNIGINSN NANISINANS GLSSSLPVNN FQNHQHTPVG GNNVDEIDKL LNINLFQSDS FTIGNKSGSG FYDLDSQPGQ SGLAGVNEDM YVGDQMQQSH LNANLDLDLI DGSQTNGNAN GSAGWNWLDF SPPQN
|
| |