Gene PICST_66227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_66227 
SymbolEFG1 
ID4850800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp89185 
End bp91665 
Gene Length2481 bp 
Protein Length485 aa 
Translation table 
GC content45% 
IMG OID640392508 
ProductNuclear receptor coregulator SMRT/SMRTER, contains Myb-like domains 
Protein accessionXP_001387672 
Protein GI126273547 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.271146 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AATTGCTTGA AATCAGCGAA GGATATCCAG CGGCTTCTGA CAGTAGACGG AGTCGGAGCA 
TATTTGAAGA TTCCAAGGAC AAAATAGTGA ATCGCACCAG GGCGTAAATT CCATAGCCAA
ATTCCATAGT TTCAGTAATA TTGCTCAAGT GTACTCCTTG AAATAATTGT GCCAAACTGT
TTAAAACAAC AACTTCACTT TCTTCCCCAT CAATAACAGT TGACTAGTTT TTCGTTGATA
TCAAAATCCA GTTAGAAGGG GCCTGAGAAT TTCATTTGTA CATTTCCACA GCCTGTTAAC
TACAACGTTA CCAACCTAGC GATTCAAGCT AGCACAACAC ACGAGCTTTT TTCCTACAGA
GTCCAGCACC TGTGCCCTTG ATATTGGTAT TCTTTTTCCT CGTCACGTTT GACTTTAGAG
GTCCAGTTCA TTGCATATCC TTTTTCTCGT AGCCTCAACA TATCCCAACG TTACCGAAAA
GGTTGCTCAA GTTTTTCGAG GTTTACGTAC GTCTGTAAAG CAGAGCTATT GATCCAGAAT
ACTTGTGAGT TCTCCTGAAC CATACAAGCT TTCACCAGCA TCAGATTTCA AGATCATTTC
CAATTATTTG TATTTCAGCT GTAACGTTCC TTAACACCAA GAGTAACCAT TTATCCCCCA
TTTAGTTCTA GTACTTGTTT CCCCACCTGT ATTACTTTCG TTTTGTATTA TTTCTACCAA
ATCATCTGTA TTAGTATCCA TCACAATGTC CTCCGAACAA AATAATATTG GAATGCCTAA
AGCCCTTTCC GCTCAGTTAG AGGATGGCAC GGCCAAGTCT TTGCTAGACG GACAAGCAGC
ACAAAAGTTA AAGGATGATG TGACTGACCC GGTGTTGGTT GCCGATGAGG GCCTTGACGC
CGACGGTTCT CCCAAGAATT ACAAGAATTT GTCGATAAAC CTGATACTAA ACATCCATGG
AAAGGTCCAG GTTCAAAACC CCAATCAAGC CTATGGTACC AACCCCAAGT TGCCTTCCAT
CGGTTCCGTT CCCGGAGTAC CGAAAGATAT GTCAACAAGT CAACAGCAAC AGCAACAGCA
ACAAGATCTT TCGCCCAAAT ACCTTCCTTA CCAACGTAAT CAATATTTCA ATCACCACCT
GAGGTCTGTT TCATCTATCG ACAACAAGTT GGCTGCGACT TCTTTCACAG ACTCGCCTCA
ACAAACACAA CAGAGAACGC TGGTCGGACC TTATGGTGAT AATATAGCGT ACCAGCAGCC
TCCGGGAGTA GCTGAAGCTG CCCATTTGGT CAACCTTCAG CAGACGACAG GCTCCAACGC
TTCCCTGGTC CAGAGTCAGT ATGGCCAAAA TGTAGCATTA GGCTATGGCC ACTTGGTTCA
GCCAGGTAAC CAGCCGATGT ATCCGTCACA TGCTGCAACT AATCTGGTTC CTCAGCAGAT
GATTAGTGGA GGAAACCAAA TGCATCAATT GCTGTACAAT CCTCTTCATC ACCAATCGCA
CCTGGCTCTG GACCTCGATC CGTTGCACGA GAGCAAGAGA GGAAGGCGTT TCAGAAGGAG
ATACAACCAG ATTGTCCGCA AGTACAACTG TTCGTATCCT GGATGCGTCA AAAGTTATGG
GTCGCTCAAC CATTTGAACA CCCATATCGT GACCAAAAAG CACGGTCATA GAAAGTCTAA
GGCGGATTTT CAACACAACC AATTGTCGGA AGATGGAACC AGCAACAATA CCCAACAGGG
GCCCTACGAC GCAAGTAACT ATCCGTCACA CCTTCAGCAA CACTCTCCGT CAGATTACAC
ACAGGGTAAC TACTGGTACG GCTACAATCC TCAGGTTAGA AGCAACCAGC AAGTAGCAGC
TCCACAGCAA CAAATGGAGG TACATGCCAA TACCGTAGCA CCGCCAGGAT CGATACCAGC
ACCTACGTAT ATGTACTACC AGCAGGGCTA TCCGCAACAT ATTCCTCCTC CAATTTCACA
GCAGCGACCG CCAATGGGCT GGCCGCAACA AACATCGTAT CCATACACAC AAATGCAAGG
CCTGACTCTG CAACAGTCAT ATCAACAGAC AGCACAATCT ACGCAAACCT CCAGCATCCT
CCAGCAACAC CAAGTTCAGC ATGACCCCGG CCAGCATTCT GACCCTCAAA TGAAGTCTGC
ACTGGAATCT TCCACTGGCA CAAGCCCACC GTTGAAGAGA TGAACAGCGG CTGCGATTGC
TTGTTCTAGG TGTACATTGT AAGGCTGGAA AAAGGATTGA CCATGATCCT ATCCTTACTT
ATTTATTTGA CTTTTATGAT AGATACGAAT AGATACATTC TTTATCAAAA TTTTGCCCGT
CGTGCTACCG CAACTCTCGC AGTGTTTGCT CCTTACATGG ACCTTCTGCT GACGAGCAAC
ATTCTTTCGC TTTTAAGGCA TTTTTTGACG TTTCTTGATA AAACCAAAAG TTCTGATCTT
TTAATGCACG TCATCCCCCA C
 
Protein sequence
MSSEQNNIGM PKALSAQLED GTAKSLLDGQ AAQKLKDDVT DPVLVADEGL DADGSPKNYK 
NLSINLILNI HGKVQVQNPN QAYGTNPKLP SIGSVPGVPK DMSTSQQQQQ QQQDLSPKYL
PYQRNQYFNH HLRSVSSIDN KLAATSFTDS PQQTQQRTLV GPYGDNIAYQ QPPGVAEAAH
LVNLQQTTGS NASLVQSQYG QNVALGYGHL VQPGNQPMYP SHAATNLVPQ QMISGGNQMH
QLLYNPLHHQ SHLALDLDPL HESKRGRRFR RRYNQIVRKY NCSYPGCVKS YGSLNHLNTH
IVTKKHGHRK SKADFQHNQL SEDGTSNNTQ QGPYDASNYP SHLQQHSPSD YTQGNYWYGY
NPQVRSNQQV AAPQQQMEVH ANTVAPPGSI PAPTYMYYQQ GYPQHIPPPI SQQRPPMGWP
QQTSYPYTQM QGLTLQQSYQ QTAQSTQTSS ILQQHQVQHD PGQHSDPQMK SALESSTGTS
PPLKR