Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_34564 |
Symbol | TOP3 |
ID | 4851720 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 2655678 |
End bp | 2657528 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | |
GC content | 47% |
IMG OID | 640393428 |
Product | DNA topoisomerase |
Protein accession | XP_001386841 |
Protein GI | 126275387 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.496706 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATCC TATGTGTGGC CGAAAAGCCG TCCATCGCCC GCGAGGTGTC GCGAATACTC AGTGGAGGCA GATTCACCAC GAGAAACTCC CGGAACAAGT TCATCAAAAA CTACGACTTC ACCTACAACT TCACCAGCCT CGGTATATGC GACGTTACAA TGACTTCTGT GGTGGGGCAC ATCACCAACA TGGACTTCCC GCCCGCGTAC CAGTGGGGAC GGTGTGTTCC CGGAAGGTTG TTTGATGTAG AGGTGATCGA GAAGGTCACC AAACAAGACG TGTTCGACAA CATTTCCAAC GAAGCGCGTA CCGCGTCCAG GTTGATGATC TGGACAGACT GCGATCGCGA AGGCGAGTTC ATTGGGTTTG AGATTTATAA AGCCGCTTTC AAGGGAAACA GCGCCATCCA GGTCGGGGAC ATCTGGCGTA GCCAGTTTTC GCATCTAGAA CGAAGCCATA TTATTGATGC TGCATCACAT CCACGGCTGC TTGATATGAA CTCCGTCAAC GCTGTCGCGT GCCGTATGGA AATAGACTTC CGTGTTGGCA CCAGTTTCAC GCGGTTGCTC ACTGACTGCT TGAAACAGAA CAGAATCATC GAGAAAGGAG GGCTAGCCTC TTACGGAACC TGTCAGTTCC CCACCCTCGG CTTCGTAGTT GATAGATACA AAAGGGTCAA ATCGTTTATC CCCGAGAAGT TTTGGTACAT TGCGGTAGAT ATCCGTAAAC AGAACCAGAA GTCGTCGTTT GCTTGGACAA AAGGTCGTTT CTTCGACCGC ATGTTTGTGA CACAGTTGTA CCAGGATTGT CTCCAGACAG AAGAAGGAAC CATAACCAAT GTAGAAAGTA AACGAACCAC CAACTGGAGA CCCTTGCCGT TGACTACAGT CGAGCTCCAG AAGGATTGTG CGCGATTCTT CAGGATGAGC GCAAAAGCAG CTCTAGATGC TGCCGAAAGA TTGTATAACA AGGGGTTTCT TTCCTACCCA AGAACAGAAA CTGATAGTTT CCCAGCTACC ATGGACTTTG CTGGTGTAAT TGCCAAACAG ACTGGTGATG GAAGATGGGG CGCCTATGCA AATCTGTTGA TGGCTAACGG CCATGAAATG CCGAGAAATG GTAGCCACGA CGATAAAGCG CATCCAGCTA TCCACCCAGT AAACTACGTA GCTATAGATT CGTTGACTTC GGCAGACGAA AAGAAGGTAT ACGAATATGT GGTCCGTAGA TTTTTAGCCT GTTGCTCGAA AGATGCTGTA GGTCACCAGA CAACAGCCAC TTTGCAATGG GGTAATGAAA CATTCACAGC CAGTGGGTTG ATTGTTACTG AGAAAAACTA TTTGGAAATA TATACATACA AGAAGTGGGA AACTACAAAG CAATTGCCGC CTCTAGAGGA GGGAGAAAAA GTCCGTATCT CCAGTGGCCA AATGAAGGAA GGAGAAACCA GCCCTCCCAA CCACATGACA GAAACAGAAT TGATTGCATT AATGGATGCC AATGGTATCG GAACGGACGC CACTATCGCA GAGCATATTG AGAAAATCAT GCAGCGAGAT TACATCGTCA AACACAAACA AGGTGGAAAG GAATACATTG TTCCTACTCC GTTGGGTATG GGGCTAATTG AAGGATTTGA CCAGATGGAG TTTGACAATA TTTCTCTCTC GAAGCCGTTT TTACGTAAGT TGTTAGAGAA TTCTCTTCAG AAAATCGTGG ATGGCGAGCG CACCAAAGCT GACGTCCTTG AAGAAGTTAA ACAAATCTAT CGACAGGCGT ACGGAGTCAG CTCTCAGAAG ATGACGTTGT TGGCTCTGGT ATGTCGACAG ATAATTGCAC AGAATCTGTG A
|
Protein sequence | MKILCVAEKP SIAREVSRIL SGGRFTTRNS RNKFIKNYDF TYNFTSLGIC DVTMTSVVGH ITNMDFPPAY QWGRCVPGRL FDVEVIEKVT KQDVFDNISN EARTASRLMI WTDCDREGEF IGFEIYKAAF KGNSAIQVGD IWRSQFSHLE RSHIIDAASH PRLLDMNSVN AVACRMEIDF RVGTSFTRLL TDCLKQNRII EKGGLASYGT CQFPTLGFVV DRYKRVKSFI PEKFWYIAVD IRKQNQKSSF AWTKGRFFDR MFVTQLYQDC LQTEEGTITN VESKRTTNWR PLPLTTVELQ KDCARFFRMS AKAALDAAER LYNKGFLSYP RTETDSFPAT MDFAGVIAKQ TGDGRWGAYA NLLMANGHEM PRNGSHDDKA HPAIHPVNYV AIDSLTSADE KKVYEYVVRR FLACCSKDAV GHQTTATLQW GNETFTASGL IVTEKNYLEI YTYKKWETTK QLPPLEEGEK VRISSGQMKE GETSPPNHMT ETELIALMDA NGIGTDATIA EHIEKIMQRD YIVKHKQGGK EYIVPTPLGM GLIEGFDQME FDNISLSKPF LRKLLENSLQ KIVDGERTKA DVLEEVKQIY RQAYGVSSQK MTLLALVCRQ IIAQNL
|
| |