Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_62201 |
Symbol | UGA3.2 |
ID | 4840309 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 404050 |
End bp | 405972 |
Gene Length | 1923 bp |
Protein Length | 576 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640391624 |
Product | putative transcriptional regulator |
Protein accession | XP_001385430 |
Protein GI | 150865985 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0514301 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATCTTG TTTTAGACGT CAATTCGTTG GAGACAGTCG AATCCAGAAG CTCTCCTACT AGTGCTCACA TAGGAAGCTA TAGCTCAAAG AGAATCCTCT CGAGAAACAT AATCCTCGGC CCGACAAAGA GATCTCGCAA TGGATGTTTG AATTGCAGAA AGAGAAAGAA GAAATGTGAC GAAAGTTTTC CCACATGCGG TTCGTGTAAG TATCGAGGAG CTGAATGTAT CTGGAGAGAC TCTACTAAGT TCAAGATGAA GAGGTATTCC GATTCCAGAG ACTCTACAAA GAAGTCAGCA GTTGTGCGGC ATGTCTCAAC GAAATCTCCG TCCGAAGAAC TATCAGAAGC GATTGTTGAA TCACAGGATA AGATGGAACT TTCAAAAGGC ATTATAGAAC TTGTTAATGA TGAGATAGAT CAAGTGAGTC ATACTGACAT CTTGAACGCA GCTGAATTGG ACCAATTCGA GAATGTAGAA GACGAATTGT TTACGAACAA CAATCATGAA TTGACACCTT ACACAACAGA CTTGCGATTG GATACAAGTG AATTCACGAC CAAGAACATA GATTTCCTAA TGTCTGATGA TTTTAGCGAC TTCCCCTATT TATCACCTAC TACAACACCA ATATTCAATC CGTTTAGACA CCTCGACGAT AAAGCAAAGT ATTTCCTTGA CGGATTTATT CATAAGGTGG CACGTAATTT ATGTATAGGG CCAGACTGGT GCAATTACTT TCTCAAAACG TTCTATCAGA TGGCAGAACA AGACAAATCT GTTCTGTTCG CATTGGCCTC TTGGGGAGGT TTGTTCCTCG AAGGAAGCAC CGACGCAACT AAGTCATATA TGATCAAGGC ATATAAATCG ATTACAGAGA GATTTCCTAA CTTCAACGAA CTCAGCAAAG AGGACATTTA TATCTTGCTC AACTTTTTTT TGATAGGCAT AGGTGTTCAT GTTTGTGCTG GAGATGTTTC TCAGTGGAAT ATCTTATTTA AGCAGTGTAT TGAAGTGATT CAAAAGAACG GCGGTTTGTC AGAAATATGT CGCATGTTTG ACTATTCCAA TGATATTAAG TGGTTAATAT CCGATGTACA GTTTCATGAT ATAATGTCCT CAAGAGCATT TTCTAAAGGA ACAATCTTAC CAATGGAAGA GTACAATACC ATTTTCCAAC GCAACAAGAT CTTGGAGCTA GGTAATTATG GTTTGGACCC ACTCCAAGGA TGTATCCAAC CTATTTATTT GTTGTTAGGC GAGATTCTAC AGGTTTCATC TGATCTAAAG TCTAAGAAGA AACATATCAA TAAACTCTTA GAGGATGCAC GAAAAGCATA CAACGATAGT GATAAAACTA ATCCAGATTT ACTCAACACT GCTACAGTTC AAGGAGAAGT AATAAACTTG CGCATATTGC GACAAAACTT CTATAATGAA ATGGAGGAAG TTATCGATCA GTTGAAGGAA AAATTGAAGC GGTGTCAACC AAATATACAG CAAATGGAGC CAATTATCGA TGACAAACAC GAAGTCGAGT TGCATCTTAC TTTGTTTGAG GTTTATCTGT ATACCTGTCA ATTATCGATG AATTATCAGA TCAAAGGAAT GCCGGCTTCG TCAGCAGAGA TGCAGTCAAT ATTGGTCAAT GCTGTGAGCT GTATCGATAT TCTCGTAGAT ACGAAATTGG TGTCTTCATT ATCTTTGCTG TTACTCTTGT GTGGTATCAC ATGTTGTACA GCCACTGATA GACTAGATAT GGAGGTGCGA ATAAAGAAGA TCCAGCTGGC ATACGAGGTT GCCAATCTTA CCAGAATGGT TGATATCATC AAAGAAGTAT GGAAGAGGAA TAGTAACGGA AACGTATGTA TAGATTGGGT GGAGGTGTGC AACGAGAAGG ACTGGAATCT TTCTGTATGC TAA
|
Protein sequence | MYLVLDVNSL ETVESRSSPT SAHIGSYSSK RILSRNIILG PTKRSRNGCL NCRKRKKKCD ESFPTCGSCK YRGAECIWRD STKFKMKRYS DSRDSTKKSA VVRHVSTKSP SEELSEAIVE SQDKMELSKG IIELVNDEID QVNLRLDTSE FTTKNIDFLM SDDFSDFPYL SPTTTPIFNP FRHLDDKAKY FLDGFIHKVA RNLCIGPDWC NYFLKTFYQM AEQDKSVSFA LASWGGLFLE GSTDATKSYM IKAYKSITER FPNFNELSKE DIYILLNFFL IGIGVHVCAG DVSQWNILFK QCIEVIQKNG GLSEICRMFD YSNDIKWLIS DVQFHDIMSS RAFSKGTILP MEEYNTIFQR NKILELGNYG LDPLQGCIQP IYLLLGEILQ VSSDLKFTQH CYIINLRILR QNFYNEMEEV IDQLKEKLKR CQPNIQQMEP IIDDKHEVEL HLTLFEVYSY TCQLSMNYQI KGMPASSAEM QSILVNAVSC IDILVDTKLV SSLSLSLLLC GITCCTATDR LDMEVRIKKI QSAYEVANLT RMVDIIKEVW KRNSNGNVCI DWVEVCNEKD WNLSVC
|
| |