Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_47969 |
Symbol | SGS1 |
ID | 4840279 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | - |
Start bp | 1663949 |
End bp | 1667977 |
Gene Length | 4029 bp |
Protein Length | 1148 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640391594 |
Product | ATP-dependent DNA helicase |
Protein accession | XP_001386017 |
Protein GI | 150866421 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0514] Superfamily II DNA helicase |
TIGRFAM ID | [TIGR00614] ATP-dependent DNA helicase, RecQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAAATT CAAATAACCT CCTGGTGCAG GTGGCGTGGC TCCAGCGAGA AGCACCACAT CTACCGAAAA AGAATTTGGT TGATCTCGTT CTCCAGCCTT TTCCAGAAAG CACACGGTTA TTAGGACGCA CTGGAACTGG GACAAACGGA GAAATAAGGA ATGGAGTTCT GGTGCTAATA CCGTCAAATT CTGGAAACAG CAATATAAAT AACAGAAATC TGAATGCTCA GAATAATAAT AGTACTAGTA TCAATACCAA CGATAGCATC ACCAATACCA ACAAAACCAA TAGTACTTAT AATAACAATA ATAGCATATC TAATACCGGC GGTTCAGATT TACGACAGAC AGCAAACGCG AACAACAGAC TGGAAAATTC TCCCAATCTT GATACTCTCA GCGATGTTGA CGAAATTATA GACTTGACTG TGGATGCCGA TGGACTCTTT CGTTTGCATA AACGACTGCA AACCGTAAGT GCTGAAAGCA CCAGCAACAA TATCAACAAC AATATCAACA GGAACACTGT TAGCACCGTT AAAGAAAATG ATTATGCAAA TACTGCTATT GGGCAGAAAC GAACAATGAC AGTGACTAAT CGTCTTGAAA AACGATCTCG AAATGTCAAC GATACGACTG TACAAGATCA AAGTATTCCA GAATCACAAG CCGCTACGTC CAAGAGTGAA GTCGCGCTCT TGAAGAAATT GATAGAGTTA CAGGATATGA AGCTAGCATT ATATGATGAG CGCTTCAAAG TGAGCGAGTC GACTTCCATA TCGTTTGATG CAAAGAAAGA TTTCTATAAG AAAACGTTTG AACCCAAGTC CAGTGCCATA GAAGTCCAGA TTAACAATGT GCGAAAGGAG ATCTACAACT CCAATTCTTC ACATGTTCAA GTTAATAACA GTATCTCTGT ACCGGAGAAG CTATCGGTGC CACTGACATT CCCTGCTTCT ATCAACTCTG TTCCAATGTC CACTCCTCTT CCACCCCATC TGACTATGGC AATTTCAGTA CGTGCTGTAG AAGAAGACGA TTTCAACACT TACGAATTCT CCGATTTAGA TGAACCAGAA GTTAAAGAAG TAGAAGAACG TGTGTTTCGA GTACCAGAAG TAGCACCTAA AGATAATTCT GTTCCAATAC GACCTGTGGT GATAGAAACT CCTCCGAGAC GGCAACCTCA TGTAGCTATG CCAGTAATCC CAGATGTAAG CGTTAATGAA GTCGAGGATG ACTTTGGAGA AGGAACCATG GATGGATTGA GAACCCCGAC ACAGGAGCGA GATGAAGTCA ACGATTTGGG CAGCTTTATT GCCGACGACT ACTTGGAGTC TGATGTTGAT GGTTCCTTCC AAAATGATTC TGACCATTCA GAAGAAGAAA CAGAAGGAGC AATTGCCCCA GATGAAATCG ATGACATAAG ACTATCGCCA GATGTGGCAG GAAAATTGGG TGTGCGGTAT GTTGATCAAC CAGCCCCCAT AGCTTTGGTA GATCTGGATT CTGAATCTGT TCATAAAGAA TATGCGGATG ATGATTATGA CGATGATGAA ATTGAGGAAA TAGAAGACTT CACCACGCAA TTAAATGAAG AAAGAGAATT AAATAATGAT GTCATTGACC TCATTTCAGA TCAAGAGGGC GAAAATGATC TATTCGAACA ACATCTCCCA ACTAATTTGG CAGAATCAGG AACACATTCA AAGGCACTCG GTGAATCCAC CAATATTGAC AGACATATCG CGCCTAAGGT AGAAAGTGAT CTAGAATTCA GTGATGATGA TGACGAATTA ATGAATATAT TGAACAATCA GCAGCCTATC GTTGGAAATG GACCTAATAA AGAGAATATA CCTCCGGGCT CTGAACATTT CATTGATGAA GTATACTCTG TATTAAATTC TGTCTTCAAA TTGCAGTCAT TCAGATCAAA TCAATTGGAA GCAGTGTGTG CTAGTTTGCA ATCTAAGGAT GTGTTTGTAT TGATGCCAAC AGGTGGTGGA AAATCCTTAT GTTATCAGTT GCCTGCGCTT GTGAAAGGTG GGAAGACTAA TGGTACAACT GTTGTTATTT CTCCTTTAAT TTCATTGATG CAAGATCAAG TTCAGCACTT ATTGGACAAG AATGTGAAAG CAGGAATGAT CAGTTCCAAA GCAACAGCAG AAGAGAATAA ACAAACAATG CATTTATTCA GGGAGGGCTT TCTTGATTTG GTATATCTTT CCCCAGAAAA AGCAAACACT TCTAATGTTG TCCAAAAGAT AATAAGCAAA TTGTATGAAA CCAACAGATT GGCCAGAGTT GTGATAGATG AAGCACATTG CTTGAGTTCG TGGGGACATG ATTTCAGACC TGATTATCAG AGTATGGGGC TCTTTAAGGA GAGGTATCCC AATGTTCCAA TAATGGCCTT AACAGCAACG GCTAATGAAA AGGTAAGATT AGATATTGTC CACAATTTGA AAATGGAAAA TGCCGTTCTT TTGAAACAAA GTTTCAATAG AACAAACTTG TACTATGAAA TTAAGTGGAA AGCAGCCAAC TACGTCGAAT GGATAAAAGA TTACATTTTG AAAAACCAGA ATAATAAGAC GGGTATCATA TATTGTCATT CGAAACAGTC CTGTGAACAG ACAAGTGCTA AACTCAATCT GTTCGGGCTT CATACTGCTT TCTATCACGC TGGAATGTCT CCCCAAGATA GATTCGATAT CCAATCACAA TGGCAAACTG GAAGAATTCA GTTGATTTGT GCCACAATTG CTTTCGGGAT GGGAATTGAT AAGCCCGACG TCAGATATGT CATCCATTTA TTCATTCCTC GAAGTTTGGA AGGATACTAC CAAGAAACCG GAAGAGCTGG AAGAGATGGC AAACAATCTG ATTGTATTAT GTTTTACTCT TATAAGGACG CTCGACTGCT TCAGAGTATG ATACAAAGAG ACGAAGAATT GACGAAGGAA GGGAAAGAAA ATCATCTTGC TAAGCTTAGA CAGGTGGTTC AATATTGTGA GAACACAACT GACTGTAGAA GACAGCAGGT TTTGCAATAC TTCAACGAGT CTTTCAGCCC GGCAGATTGT CGAAAGCAAT GTGACAATTG CCAAAATTCA ACTGGTGTTT CAGTGGTCGA AAGAAACTGT ACAGAGTATG CCAAAAATAT CATAAATTTG GTGCAATCTA TCCAGGAAGA AAGAGTGACA GTACTTCATT GCCAGGATGT CTTTAAGGGA GCCAGAAATA GCAAAATTAT GAAAATGGGA CATAATCTTA ATCCGTATCA TGGGAAAGGG AGCTCCTTGG ACAAGACAGA CGTCGAGAGA ATCTTTTTCT ACCTATTGAG TGAAGAGTGC CTTGTAGAAT ACCAGATCAT GAAAGGTGGA TTTGCGTCGA ACTACGTCCG TACCGGCAAG AACGCATACC AAGTTTTGAG AGGAATGAAG CAGATCCAGA TTCAATTCAG CACTGAGAAA AGAGTACGGC AAAATACTGG AAATGCATCA TCGACTACTA GTACATCAGC AGTACACTCC AATCTCAACA GTTTCAAATA TCGTGAATCG TTTGTCACAG CACGTGAGGT ATCGAGGATG AACTCGATGA ATAGTAATGT ACCGATAACT CTCCCGCAAA CAAGAATGCT GTCTGATGGA AGTGGTGTCA CTGTTGAACA GGCTAATCAT GCTTATAATG AGCTTAATAA GATAAGAATT GAAGCTCTGT CGGAGATTGG CATCCCATTG AGTCAGTTTG TCAGCGAGAT ATCTCTACGA GAAATGTCTA ATAAATTGCC GACGAACAAG AGAGACTTCT CTAAAATTCA GGGTATATTA AAGGAACAAG TTGAATATTT CACTCTATTC AAGAAGACGT TGGGTATACT ATCTAGAGAA AGGAAAAAGC AATCACCCAA CTCGAGTTTT GTTAGTAATT CAGACATAGC CAGTGCTGGT GCAGATATGT CTATTTCGCC ATATTTTCCA CCACCCCAAC CTGAACGTCA TGTTTTGGAT AACCTAAGG
|
Protein sequence | MINSNNLSVQ VAWLQREAPH LPKKNLVDLV LQPFPESTRL LGRTGTGTNG EIRNGVSVLI PSNSGNSNIN NRNSNAQNNN STSINTNDSI TNTNKTNSTY NNNNSISNTG GSDLRQTANA NNRSENSPNL DTLSDSEVAL LKKLIELQDM KLALYDERFK VSESTSISFD AKKDFYKKTF EPKSSAIEVQ INNVRKEIYN SNSSHVQVNN SISVPEKLSV PSTFPASINS VPIVNEVEDD FGEGTMDGLR TPTQERDEVN DLGSFIADDY LESDVDGSFQ NDSDHSEEET EGAIAPDEID DIRLSPDVAG KLGVRYVDQP APIALVDSDS ESVHKEYADD DYDDDEIEEI EDFTTQLNEE RELNNDVIDL ISDQEGENDL FEQHLPTNLA ESGTHSKALG ESTNIDRHIA PKVESDLEFS DDDDELMNIL NNQQPIVGNG PNKENIPPGS EHFIDEVYSV LNSVFKLQSF RSNQLEAVCA SLQSKDVFVL MPTGGGKSLC YQLPALVKGG KTNGTTVVIS PLISLMQDQV QHLLDKNVKA GMISSKATAE ENKQTMHLFR EGFLDLVYLS PEKANTSNVV QKIISKLYET NRLARVVIDE AHCLSSWGHD FRPDYQSMGL FKERYPNVPI MALTATANEK VRLDIVHNLK MENAVLLKQS FNRTNLYYEI KWKAANYVEW IKDYILKNQN NKTGIIYCHS KQSCEQTSAK LNSFGLHTAF YHAGMSPQDR FDIQSQWQTG RIQLICATIA FGMGIDKPDV RYVIHLFIPR SLEGYYQETG RAGRDGKQSD CIMFYSYKDA RSLQSMIQRD EELTKEGKEN HLAKLRQVVQ YCENTTDCRR QQVLQYFNES FSPADCRKQC DNCQNSTGVS VVERNCTEYA KNIINLVQSI QEERVTVLHC QDVFKGARNS KIMKMGHNLN PYHGKGSSLD KTDVERIFFY LLSEECLVEY QIMKGGFASN YVRTGKNAYQ VLRGMKQIQI QFSTEKRVRQ NTGNASSTTS TSAVHSNLNS FKYRESFVTA REVSRMNSMN SNANHAYNEL NKIRIEASSE IGIPLSQFVS EISLREMSNK LPTNKRDFSK IQGILKEQVE YFTLFKKTLG ILSRERKKQS PNSSFVSNSD IASAGADMSI SPYFPPPQPE RHVLDNLR
|
| |