Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_66227 |
Symbol | EFG1 |
ID | 4850800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 89185 |
End bp | 91665 |
Gene Length | 2481 bp |
Protein Length | 485 aa |
Translation table | |
GC content | 45% |
IMG OID | 640392508 |
Product | Nuclear receptor coregulator SMRT/SMRTER, contains Myb-like domains |
Protein accession | XP_001387672 |
Protein GI | 126273547 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.271146 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AATTGCTTGA AATCAGCGAA GGATATCCAG CGGCTTCTGA CAGTAGACGG AGTCGGAGCA TATTTGAAGA TTCCAAGGAC AAAATAGTGA ATCGCACCAG GGCGTAAATT CCATAGCCAA ATTCCATAGT TTCAGTAATA TTGCTCAAGT GTACTCCTTG AAATAATTGT GCCAAACTGT TTAAAACAAC AACTTCACTT TCTTCCCCAT CAATAACAGT TGACTAGTTT TTCGTTGATA TCAAAATCCA GTTAGAAGGG GCCTGAGAAT TTCATTTGTA CATTTCCACA GCCTGTTAAC TACAACGTTA CCAACCTAGC GATTCAAGCT AGCACAACAC ACGAGCTTTT TTCCTACAGA GTCCAGCACC TGTGCCCTTG ATATTGGTAT TCTTTTTCCT CGTCACGTTT GACTTTAGAG GTCCAGTTCA TTGCATATCC TTTTTCTCGT AGCCTCAACA TATCCCAACG TTACCGAAAA GGTTGCTCAA GTTTTTCGAG GTTTACGTAC GTCTGTAAAG CAGAGCTATT GATCCAGAAT ACTTGTGAGT TCTCCTGAAC CATACAAGCT TTCACCAGCA TCAGATTTCA AGATCATTTC CAATTATTTG TATTTCAGCT GTAACGTTCC TTAACACCAA GAGTAACCAT TTATCCCCCA TTTAGTTCTA GTACTTGTTT CCCCACCTGT ATTACTTTCG TTTTGTATTA TTTCTACCAA ATCATCTGTA TTAGTATCCA TCACAATGTC CTCCGAACAA AATAATATTG GAATGCCTAA AGCCCTTTCC GCTCAGTTAG AGGATGGCAC GGCCAAGTCT TTGCTAGACG GACAAGCAGC ACAAAAGTTA AAGGATGATG TGACTGACCC GGTGTTGGTT GCCGATGAGG GCCTTGACGC CGACGGTTCT CCCAAGAATT ACAAGAATTT GTCGATAAAC CTGATACTAA ACATCCATGG AAAGGTCCAG GTTCAAAACC CCAATCAAGC CTATGGTACC AACCCCAAGT TGCCTTCCAT CGGTTCCGTT CCCGGAGTAC CGAAAGATAT GTCAACAAGT CAACAGCAAC AGCAACAGCA ACAAGATCTT TCGCCCAAAT ACCTTCCTTA CCAACGTAAT CAATATTTCA ATCACCACCT GAGGTCTGTT TCATCTATCG ACAACAAGTT GGCTGCGACT TCTTTCACAG ACTCGCCTCA ACAAACACAA CAGAGAACGC TGGTCGGACC TTATGGTGAT AATATAGCGT ACCAGCAGCC TCCGGGAGTA GCTGAAGCTG CCCATTTGGT CAACCTTCAG CAGACGACAG GCTCCAACGC TTCCCTGGTC CAGAGTCAGT ATGGCCAAAA TGTAGCATTA GGCTATGGCC ACTTGGTTCA GCCAGGTAAC CAGCCGATGT ATCCGTCACA TGCTGCAACT AATCTGGTTC CTCAGCAGAT GATTAGTGGA GGAAACCAAA TGCATCAATT GCTGTACAAT CCTCTTCATC ACCAATCGCA CCTGGCTCTG GACCTCGATC CGTTGCACGA GAGCAAGAGA GGAAGGCGTT TCAGAAGGAG ATACAACCAG ATTGTCCGCA AGTACAACTG TTCGTATCCT GGATGCGTCA AAAGTTATGG GTCGCTCAAC CATTTGAACA CCCATATCGT GACCAAAAAG CACGGTCATA GAAAGTCTAA GGCGGATTTT CAACACAACC AATTGTCGGA AGATGGAACC AGCAACAATA CCCAACAGGG GCCCTACGAC GCAAGTAACT ATCCGTCACA CCTTCAGCAA CACTCTCCGT CAGATTACAC ACAGGGTAAC TACTGGTACG GCTACAATCC TCAGGTTAGA AGCAACCAGC AAGTAGCAGC TCCACAGCAA CAAATGGAGG TACATGCCAA TACCGTAGCA CCGCCAGGAT CGATACCAGC ACCTACGTAT ATGTACTACC AGCAGGGCTA TCCGCAACAT ATTCCTCCTC CAATTTCACA GCAGCGACCG CCAATGGGCT GGCCGCAACA AACATCGTAT CCATACACAC AAATGCAAGG CCTGACTCTG CAACAGTCAT ATCAACAGAC AGCACAATCT ACGCAAACCT CCAGCATCCT CCAGCAACAC CAAGTTCAGC ATGACCCCGG CCAGCATTCT GACCCTCAAA TGAAGTCTGC ACTGGAATCT TCCACTGGCA CAAGCCCACC GTTGAAGAGA TGAACAGCGG CTGCGATTGC TTGTTCTAGG TGTACATTGT AAGGCTGGAA AAAGGATTGA CCATGATCCT ATCCTTACTT ATTTATTTGA CTTTTATGAT AGATACGAAT AGATACATTC TTTATCAAAA TTTTGCCCGT CGTGCTACCG CAACTCTCGC AGTGTTTGCT CCTTACATGG ACCTTCTGCT GACGAGCAAC ATTCTTTCGC TTTTAAGGCA TTTTTTGACG TTTCTTGATA AAACCAAAAG TTCTGATCTT TTAATGCACG TCATCCCCCA C
|
Protein sequence | MSSEQNNIGM PKALSAQLED GTAKSLLDGQ AAQKLKDDVT DPVLVADEGL DADGSPKNYK NLSINLILNI HGKVQVQNPN QAYGTNPKLP SIGSVPGVPK DMSTSQQQQQ QQQDLSPKYL PYQRNQYFNH HLRSVSSIDN KLAATSFTDS PQQTQQRTLV GPYGDNIAYQ QPPGVAEAAH LVNLQQTTGS NASLVQSQYG QNVALGYGHL VQPGNQPMYP SHAATNLVPQ QMISGGNQMH QLLYNPLHHQ SHLALDLDPL HESKRGRRFR RRYNQIVRKY NCSYPGCVKS YGSLNHLNTH IVTKKHGHRK SKADFQHNQL SEDGTSNNTQ QGPYDASNYP SHLQQHSPSD YTQGNYWYGY NPQVRSNQQV AAPQQQMEVH ANTVAPPGSI PAPTYMYYQQ GYPQHIPPPI SQQRPPMGWP QQTSYPYTQM QGLTLQQSYQ QTAQSTQTSS ILQQHQVQHD PGQHSDPQMK SALESSTGTS PPLKR
|
| |