Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_85888 |
Symbol | SSN7 |
ID | 4850816 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 132786 |
End bp | 135301 |
Gene Length | 2516 bp |
Protein Length | 669 aa |
Translation table | |
GC content | 46% |
IMG OID | 640392524 |
Product | transcription factor which mediates glucose repression |
Protein accession | XP_001387682 |
Protein GI | 126273570 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AGTTTTTGTT TTTGTTTTGA ATTCACATTT CATATTAGAG CTCACAACTA CGTCATCTCA ACTGCAACCC AGCGTATTCA CTAGCTCTGC GAGATAGATC CTTCAAGTGT CACGGAAAAA GAACTGCTGA CTATAAAAGA ATCCTCGCCT ATTTGTTCGC AAGTCCATAA TTTTTTTATC AGAACTTTCA TCTAAGAATT TTCATCTTAC AGTGGCCCTC ATATCATAGA TAAGTAACCA CCTGTTCATC CACTAGCTCT AATTAACTTA CCGTTGTCTA TACCAGCACA ACATTAACAT TTTCTACCGC TATACCTTTC CACCTGTTCC AGCAAACACT AACAGTGCAA CCAGCTCTTC CACTTTTTTT ACACTAAAGC CAGAAGATAC TCGAGGATAT ACAATCAACT TTCCTTTGTA TAATCACATT TTCATCTCAT AAATCAATAG AATGTACGCT GCCAAAAGAC CCTCCCAGGA GCTTTCGCCT GGGTCTAAAC GCCAGCGACT CCACGACAAG CTGCTCTCTG ACTACGCATC ATCAGCTACA GAAACGTGGA TAGCCTGTGG GAACTGTGCC AATACTTTAG GTCTCGTTGT TCCCGCTATC AAGTCGTTTG AAAACGCTGT TGTGCATGAT CCTTCCAATG CTGCTGCCCT CTGTGGATTG GCTTCTTCGC TTAGACTTAA CGATATCAGC CTCAACGAAA CGATAGGAAC CCAAAGTGCC ATCGACAAAT TGAATAAGCT GTTAGAACTG TTTCCCTCTT TGCTCAAGCA GCCAGCAGTT TTTAAGGAGT TGGCAGAATG TTACTTACTC ATTGGTCTCA ACGACCAGGC TCACCAGGCT ATCCAGACGG CTTTGCAACT TGCTGAGACC GATGCTTCGC TTTGGCTCTT GCTGGCTCAA ACATTAATAC GTGTCGGCGC CAGAAGTCAT GCTGCCGGTT CGTTGACCCA CTGTTTGTCG CTTTTACCAG ACTCAATCCA TCAGTTTTCT ACTGCTGATA TCGAAACTGC CAGAGCTGCT CATGCTGAAT TGGCAGCTAT TGCTGCTGCT GATGGAAGCA TCGAGTTGTC AATTGCAGAA TTGACTGCAA CTCTTTCTCT TCCACCTCCT CCATTGTCGA GAATCGACGA ACATATTGCT CTCTGGTGTG CTTTATCAAC AGCCAAAGAA AGAGCTAACG ATATCGCTGG AGCCATCCAG GCATGTGAGC AGTCAGAAAA GGCCGTTGGA ATGTCACCAC GTATCTTGAT GACTCACTCG TACTTGCTTC TCTTCAGCAA CGACAGAAAT AATGCCGAAA CGGCCATTAA CTTACTTTCT AAAATCATTG ACTTGGAAAA GGAAAAGGAA TCAGAACAGC CCAAAAACGA CGGTGACTTC TTACCGTGGT ACTTGCTCGG AAAAGCGTAT TCTCTCGTAG ACCAACCTCG GTTGGCATAT GATTCTTACC AGGTTGCTTT ACGTAGAGCC TCGAACTCGC CCATCACGTG GCTCGCTGTA GGGAAGTTGT ATTTGGAGTT GAAGCAGTTG CCCGATGCCT TGGCAGCCTA TTCGCAGGCG TTACGGTTGC AAATCGATGA AAGCTCACCC GGAACTGCTA CAGCTTGGGA CGGTTTGAGT TGTGTCTATG AAAGATGCGA CGACCAGCTT ATGGATGCCT CTGATGCTTG TGCCAGGTCT GCCTCTTGTT TCAAGGTGAT AGGTGACTTG AAGTCTGCCG CCTTCTTTGA AGAAAGAGCT GAACTTTTGG CTAAAGTATC GAAAAAAGAA GCTCCTGTAC CAGAATTGAG AGATCCTCCA GATGTTCCCA GCTTCTTGTT GAGAGACCTT GTAGCCTTGT TGCCTTCTGA GAGAATTGCC TTTATACAAG GTCCTCAACA ACAAGAGCAA ACTCAACAAC CTCAACAACA GCAACAACAG CAACAACAGG GTACACCTTT ACAACAGACT CCAAATCCAC AGCATCACCA ACCTCAACAC AGTCCTGCTA TTTCCATGAT TCAACTGCAG CAAACTCCAC AACCACCTCA GGCTCATTAT CCTCAACCTC AGCAACAACA GCCTCAACGT ACTCCTCAGC AACATCTTTT CCATCAGGGT TTCAAACAGG AACAAAAGTC GCCCAGACAG CAGCAGATTC CTCCTCAGTT GCCTCCTCCA CAGATTCAGC AAGTTTGGTC TCCAAATCAA CAACCACAAT ACTTCTATCA ACAGGGTCCT CCACAGGGTC CTCCAGCATT GCACCACCAG GGAGGTCCAG CTGCTCCGCC TTCTTCTCAT CGTTCACCTT TGAATCCTCA ACTCATTCCG GCGGGAACTT ACCCCGTACC AGCACCTCCT GGTGTAGCAC CTCCAGGTTA TCCATATGGT CAATACGTAC CTGTACAAGG TGGTGTAATG ACTCATATCC AACAACAGTA TGCTCCACCG GTGAACAACT GGAGAAGATA GGCATTGTAT TATATTCGCT CAGGAAGTAT GTATTAATAA ACACGGGCCA AAGAAG
|
Protein sequence | MYAAKRPSQE LSPGSKRQRL HDKLLSDYAS SATETWIACG NCANTLGLVV PAIKSFENAV VHDPSNAAAL CGLASSLRLN DISLNETIGT QSAIDKLNKL LELFPSLLKQ PAVFKELAEC YLLIGLNDQA HQAIQTALQL AETDASLWLL LAQTLIRVGA RSHAAGSLTH CLSLLPDSIH QFSTADIETA RAAHAELAAI AAADGSIELS IAELTATLSL PPPPLSRIDE HIALWCALST AKERANDIAG AIQACEQSEK AVGMSPRILM THSYLLLFSN DRNNAETAIN LLSKIIDLEK EKESEQPKND GDFLPWYLLG KAYSLVDQPR LAYDSYQVAL RRASNSPITW LAVGKLYLEL KQLPDALAAY SQALRLQIDE SSPGTATAWD GLSCVYERCD DQLMDASDAC ARSASCFKVI GDLKSAAFFE ERAELLAKVS KKEAPVPELR DPPDVPSFLL RDLVALLPSE RIAFIQGPQQ QEQTQQPQQQ QQQQQQGTPL QQTPNPQHHQ PQHSPAISMI QLQQTPQPPQ AHYPQPQQQQ PQRTPQQHLF HQGFKQEQKS PRQQQIPPQL PPPQIQQVWS PNQQPQYFYQ QGPPQGPPAL HHQGGPAAPP SSHRSPLNPQ LIPAGTYPVP APPGVAPPGY PYGQYVPVQG GVMTHIQQQY APPVNNWRR
|
| |