Gene PICST_47969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_47969 
SymbolSGS1 
ID4840279 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1663949 
End bp1667977 
Gene Length4029 bp 
Protein Length1148 aa 
Translation table12 
GC content40% 
IMG OID640391594 
ProductATP-dependent DNA helicase 
Protein accessionXP_001386017 
Protein GI150866421 
COG category[L] Replication, recombination and repair 
COG ID[COG0514] Superfamily II DNA helicase 
TIGRFAM ID[TIGR00614] ATP-dependent DNA helicase, RecQ family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAATT CAAATAACCT CCTGGTGCAG GTGGCGTGGC TCCAGCGAGA AGCACCACAT 
CTACCGAAAA AGAATTTGGT TGATCTCGTT CTCCAGCCTT TTCCAGAAAG CACACGGTTA
TTAGGACGCA CTGGAACTGG GACAAACGGA GAAATAAGGA ATGGAGTTCT GGTGCTAATA
CCGTCAAATT CTGGAAACAG CAATATAAAT AACAGAAATC TGAATGCTCA GAATAATAAT
AGTACTAGTA TCAATACCAA CGATAGCATC ACCAATACCA ACAAAACCAA TAGTACTTAT
AATAACAATA ATAGCATATC TAATACCGGC GGTTCAGATT TACGACAGAC AGCAAACGCG
AACAACAGAC TGGAAAATTC TCCCAATCTT GATACTCTCA GCGATGTTGA CGAAATTATA
GACTTGACTG TGGATGCCGA TGGACTCTTT CGTTTGCATA AACGACTGCA AACCGTAAGT
GCTGAAAGCA CCAGCAACAA TATCAACAAC AATATCAACA GGAACACTGT TAGCACCGTT
AAAGAAAATG ATTATGCAAA TACTGCTATT GGGCAGAAAC GAACAATGAC AGTGACTAAT
CGTCTTGAAA AACGATCTCG AAATGTCAAC GATACGACTG TACAAGATCA AAGTATTCCA
GAATCACAAG CCGCTACGTC CAAGAGTGAA GTCGCGCTCT TGAAGAAATT GATAGAGTTA
CAGGATATGA AGCTAGCATT ATATGATGAG CGCTTCAAAG TGAGCGAGTC GACTTCCATA
TCGTTTGATG CAAAGAAAGA TTTCTATAAG AAAACGTTTG AACCCAAGTC CAGTGCCATA
GAAGTCCAGA TTAACAATGT GCGAAAGGAG ATCTACAACT CCAATTCTTC ACATGTTCAA
GTTAATAACA GTATCTCTGT ACCGGAGAAG CTATCGGTGC CACTGACATT CCCTGCTTCT
ATCAACTCTG TTCCAATGTC CACTCCTCTT CCACCCCATC TGACTATGGC AATTTCAGTA
CGTGCTGTAG AAGAAGACGA TTTCAACACT TACGAATTCT CCGATTTAGA TGAACCAGAA
GTTAAAGAAG TAGAAGAACG TGTGTTTCGA GTACCAGAAG TAGCACCTAA AGATAATTCT
GTTCCAATAC GACCTGTGGT GATAGAAACT CCTCCGAGAC GGCAACCTCA TGTAGCTATG
CCAGTAATCC CAGATGTAAG CGTTAATGAA GTCGAGGATG ACTTTGGAGA AGGAACCATG
GATGGATTGA GAACCCCGAC ACAGGAGCGA GATGAAGTCA ACGATTTGGG CAGCTTTATT
GCCGACGACT ACTTGGAGTC TGATGTTGAT GGTTCCTTCC AAAATGATTC TGACCATTCA
GAAGAAGAAA CAGAAGGAGC AATTGCCCCA GATGAAATCG ATGACATAAG ACTATCGCCA
GATGTGGCAG GAAAATTGGG TGTGCGGTAT GTTGATCAAC CAGCCCCCAT AGCTTTGGTA
GATCTGGATT CTGAATCTGT TCATAAAGAA TATGCGGATG ATGATTATGA CGATGATGAA
ATTGAGGAAA TAGAAGACTT CACCACGCAA TTAAATGAAG AAAGAGAATT AAATAATGAT
GTCATTGACC TCATTTCAGA TCAAGAGGGC GAAAATGATC TATTCGAACA ACATCTCCCA
ACTAATTTGG CAGAATCAGG AACACATTCA AAGGCACTCG GTGAATCCAC CAATATTGAC
AGACATATCG CGCCTAAGGT AGAAAGTGAT CTAGAATTCA GTGATGATGA TGACGAATTA
ATGAATATAT TGAACAATCA GCAGCCTATC GTTGGAAATG GACCTAATAA AGAGAATATA
CCTCCGGGCT CTGAACATTT CATTGATGAA GTATACTCTG TATTAAATTC TGTCTTCAAA
TTGCAGTCAT TCAGATCAAA TCAATTGGAA GCAGTGTGTG CTAGTTTGCA ATCTAAGGAT
GTGTTTGTAT TGATGCCAAC AGGTGGTGGA AAATCCTTAT GTTATCAGTT GCCTGCGCTT
GTGAAAGGTG GGAAGACTAA TGGTACAACT GTTGTTATTT CTCCTTTAAT TTCATTGATG
CAAGATCAAG TTCAGCACTT ATTGGACAAG AATGTGAAAG CAGGAATGAT CAGTTCCAAA
GCAACAGCAG AAGAGAATAA ACAAACAATG CATTTATTCA GGGAGGGCTT TCTTGATTTG
GTATATCTTT CCCCAGAAAA AGCAAACACT TCTAATGTTG TCCAAAAGAT AATAAGCAAA
TTGTATGAAA CCAACAGATT GGCCAGAGTT GTGATAGATG AAGCACATTG CTTGAGTTCG
TGGGGACATG ATTTCAGACC TGATTATCAG AGTATGGGGC TCTTTAAGGA GAGGTATCCC
AATGTTCCAA TAATGGCCTT AACAGCAACG GCTAATGAAA AGGTAAGATT AGATATTGTC
CACAATTTGA AAATGGAAAA TGCCGTTCTT TTGAAACAAA GTTTCAATAG AACAAACTTG
TACTATGAAA TTAAGTGGAA AGCAGCCAAC TACGTCGAAT GGATAAAAGA TTACATTTTG
AAAAACCAGA ATAATAAGAC GGGTATCATA TATTGTCATT CGAAACAGTC CTGTGAACAG
ACAAGTGCTA AACTCAATCT GTTCGGGCTT CATACTGCTT TCTATCACGC TGGAATGTCT
CCCCAAGATA GATTCGATAT CCAATCACAA TGGCAAACTG GAAGAATTCA GTTGATTTGT
GCCACAATTG CTTTCGGGAT GGGAATTGAT AAGCCCGACG TCAGATATGT CATCCATTTA
TTCATTCCTC GAAGTTTGGA AGGATACTAC CAAGAAACCG GAAGAGCTGG AAGAGATGGC
AAACAATCTG ATTGTATTAT GTTTTACTCT TATAAGGACG CTCGACTGCT TCAGAGTATG
ATACAAAGAG ACGAAGAATT GACGAAGGAA GGGAAAGAAA ATCATCTTGC TAAGCTTAGA
CAGGTGGTTC AATATTGTGA GAACACAACT GACTGTAGAA GACAGCAGGT TTTGCAATAC
TTCAACGAGT CTTTCAGCCC GGCAGATTGT CGAAAGCAAT GTGACAATTG CCAAAATTCA
ACTGGTGTTT CAGTGGTCGA AAGAAACTGT ACAGAGTATG CCAAAAATAT CATAAATTTG
GTGCAATCTA TCCAGGAAGA AAGAGTGACA GTACTTCATT GCCAGGATGT CTTTAAGGGA
GCCAGAAATA GCAAAATTAT GAAAATGGGA CATAATCTTA ATCCGTATCA TGGGAAAGGG
AGCTCCTTGG ACAAGACAGA CGTCGAGAGA ATCTTTTTCT ACCTATTGAG TGAAGAGTGC
CTTGTAGAAT ACCAGATCAT GAAAGGTGGA TTTGCGTCGA ACTACGTCCG TACCGGCAAG
AACGCATACC AAGTTTTGAG AGGAATGAAG CAGATCCAGA TTCAATTCAG CACTGAGAAA
AGAGTACGGC AAAATACTGG AAATGCATCA TCGACTACTA GTACATCAGC AGTACACTCC
AATCTCAACA GTTTCAAATA TCGTGAATCG TTTGTCACAG CACGTGAGGT ATCGAGGATG
AACTCGATGA ATAGTAATGT ACCGATAACT CTCCCGCAAA CAAGAATGCT GTCTGATGGA
AGTGGTGTCA CTGTTGAACA GGCTAATCAT GCTTATAATG AGCTTAATAA GATAAGAATT
GAAGCTCTGT CGGAGATTGG CATCCCATTG AGTCAGTTTG TCAGCGAGAT ATCTCTACGA
GAAATGTCTA ATAAATTGCC GACGAACAAG AGAGACTTCT CTAAAATTCA GGGTATATTA
AAGGAACAAG TTGAATATTT CACTCTATTC AAGAAGACGT TGGGTATACT ATCTAGAGAA
AGGAAAAAGC AATCACCCAA CTCGAGTTTT GTTAGTAATT CAGACATAGC CAGTGCTGGT
GCAGATATGT CTATTTCGCC ATATTTTCCA CCACCCCAAC CTGAACGTCA TGTTTTGGAT
AACCTAAGG
 
Protein sequence
MINSNNLSVQ VAWLQREAPH LPKKNLVDLV LQPFPESTRL LGRTGTGTNG EIRNGVSVLI 
PSNSGNSNIN NRNSNAQNNN STSINTNDSI TNTNKTNSTY NNNNSISNTG GSDLRQTANA
NNRSENSPNL DTLSDSEVAL LKKLIELQDM KLALYDERFK VSESTSISFD AKKDFYKKTF
EPKSSAIEVQ INNVRKEIYN SNSSHVQVNN SISVPEKLSV PSTFPASINS VPIVNEVEDD
FGEGTMDGLR TPTQERDEVN DLGSFIADDY LESDVDGSFQ NDSDHSEEET EGAIAPDEID
DIRLSPDVAG KLGVRYVDQP APIALVDSDS ESVHKEYADD DYDDDEIEEI EDFTTQLNEE
RELNNDVIDL ISDQEGENDL FEQHLPTNLA ESGTHSKALG ESTNIDRHIA PKVESDLEFS
DDDDELMNIL NNQQPIVGNG PNKENIPPGS EHFIDEVYSV LNSVFKLQSF RSNQLEAVCA
SLQSKDVFVL MPTGGGKSLC YQLPALVKGG KTNGTTVVIS PLISLMQDQV QHLLDKNVKA
GMISSKATAE ENKQTMHLFR EGFLDLVYLS PEKANTSNVV QKIISKLYET NRLARVVIDE
AHCLSSWGHD FRPDYQSMGL FKERYPNVPI MALTATANEK VRLDIVHNLK MENAVLLKQS
FNRTNLYYEI KWKAANYVEW IKDYILKNQN NKTGIIYCHS KQSCEQTSAK LNSFGLHTAF
YHAGMSPQDR FDIQSQWQTG RIQLICATIA FGMGIDKPDV RYVIHLFIPR SLEGYYQETG
RAGRDGKQSD CIMFYSYKDA RSLQSMIQRD EELTKEGKEN HLAKLRQVVQ YCENTTDCRR
QQVLQYFNES FSPADCRKQC DNCQNSTGVS VVERNCTEYA KNIINLVQSI QEERVTVLHC
QDVFKGARNS KIMKMGHNLN PYHGKGSSLD KTDVERIFFY LLSEECLVEY QIMKGGFASN
YVRTGKNAYQ VLRGMKQIQI QFSTEKRVRQ NTGNASSTTS TSAVHSNLNS FKYRESFVTA
REVSRMNSMN SNANHAYNEL NKIRIEASSE IGIPLSQFVS EISLREMSNK LPTNKRDFSK
IQGILKEQVE YFTLFKKTLG ILSRERKKQS PNSSFVSNSD IASAGADMSI SPYFPPPQPE
RHVLDNLR