Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_46944 |
Symbol | TOP1 |
ID | 4839471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | - |
Start bp | 213595 |
End bp | 215937 |
Gene Length | 2343 bp |
Protein Length | 780 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640390786 |
Product | DNA topoisomerase I |
Protein accession | XP_001385059 |
Protein GI | 150865726 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3569] Topoisomerase IB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000227493 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTCCT CTGAAGACGA AATCGTTCTT TCGAAGAGAG TCAAAAAAAC TTCCAAAAAG AACGGATCCA TGACCAGCAC TGCTGCCAGT GAACTCGACG ATGATCTCCC ATTGTCACAG CGTAACAACG GTGTTGTCAA ATCCAAAGAC ACATCTGTAG ATGAAGACTA CGAAGAGCCA ATCGCAGAAA AGGTGAACAG AAAACGTAAG AGTGAAAATG GCTCTTCTAC AGTGCCTAAG AAAACAAAGA AGGTCAAAAC CGAGACAGAT GCATCAGCGA AGAAGTCAGA TAAAGAACCG AAACAAAAAA GGGAAACAAA ACCAAAGAAA GAAACAAAGT CCAAAAATGC TGCAGTGAAA GCAGAGAAGG ATGAAGACGT TCCTACTTCG CAAAACGAAG AGAAGGATGA AGATGAAGAT GAAGGCTATA AATGGTGGGA AGCTGAAGAC GTTGATGGAG TTCAGAAATG GGAAACTTTG GAACACAATG GTGTTCTTTT CCCACCTGAG TATGAACCTC TCCCTCTGCA TGTGAAGTTG TACTATGATG GGAAACCAGT GAAGTTGTCC TTAGAAGCTG AAGAAGTCGC TGGATTCTAT GGTGCCATGT TGGAAACAGA TCATGCCAAA AACCCTGTTT TCCAAAAGAA CTTCTTCGGT GACTTCTTAG ACGTAATCAA GGAAACTAAT GGTTCTGATG TTGAAATCAA AGACTTTGAA AAACTCGACT TCTCCAAGAT ATTCGCTCAC TTTGAGAAAC TCAGAGAGGA GAAAAAGCTT CTCACGAAGG ATCAAAAGAA AGCCATGAAG GAAGAAAAGG AGAGAATTGA AGAACCATAC AAGACTTGTT TATTCAACGG TCACAAGGAA CTCGTAGGTA ATTTCAGAGT AGAACCTCCA GGTTTGTTCA GAGGTAGAGG AGCCCATCCT AGAACTGGTA AGTTGAAGAG AAGGGTCTAC CCTGAGATGG TTACTTTGAA CATTGGAGCT GGTGCTAAGA TACCTGAAGC TCCTCCGGGT CATAGCTGGG GTGAAATCAA GAACGATAAC ACCGTTACTT GGTTGGCTAT GTGGAGAGAA AACATCGCCG ATTCATTCAA GTATGTTAGA TTTGCTGCCT CGTCTTCCAT TAAGGGTGTT TCTGATTTCA AAAAGTTTGA AACGGCCAGA AGATTACGTA GTCATGTAGA TGCCATCAGG AAGGACTACA CTAAGATGTT GAAGAGCGAA TTAATGCAAG ATAGACAGAT GGCATCTGCA ATTTATCTTA TTGATGTGTT TGCATTGAGA GCTGGTGGCG AAAAGGGTGA CGATGAAGCG GACACTGTTG GGTGTTGTTC TTTAAGATAT GAGCATATTA CCTTGAAACC CCCTAACAAG GTTATCTTCG ACTTCTTGGG TAAGGATTCC ATCAGATTCT ACCAAGAAGT TGAAGTTGAC AAGCAAGTGT TCAAGAACTT GAGGATCTTC AAGAAAGCGC CTAAACAACC CGGTGATGAC TTGTTTGATA GAATCAATCC TACGATGTTG AACAAGCAAT TGCAGAATTA CATGAAAGGC TTGACAGCTA AAGTTTTCCG TACCTATAAT GCCTCGAAGA CAATGCAAGA TCAGTTGGAT TTGATTCCAA ATGAAGGCAC AGTAGCCGAA AAAGTTGTGA AGTTCAATGC TGCTAATAGA ACTGTTGCTA TCTTGTGTAA TCACCAGCGT ACGGTAAGTA AAGGACATGG CAGTTCTGTT CAGAAAATCA ATGACAAGTT AAAGGAGTTG ATGTGGCAGA AAATAAGATT GAAGAGAATG ATACTTGTTT TAGAACCAAA ATTGAAGAAT AAGCAGCTGC ATTATTTTTG TGAAATCGAT GATCTTGCAA AGGAAGATGA AGAGCACATT CATCACACAG TAATTGCTAG ACAAAGAGAA CAGGTCTTGA AGAAGATGCA AAGAGATAAT GAAAAACTAA AATTGGAAAA GCAGGAGATT TTGACTGAAA AATCAGATGA AATCAAAGAA AAGATGGCCA AGATTGATGA TCTTGAGAAG GAATACAAGG CTGAATTGAA TGGCGCAAAA CCAGAAGTAA AGAAGAATCT CACTGTGGAG AAGTTGCAGC AGCAGGTTGA AGTGATTGAA AACAGAATTG TTACCACGAC TCTTCAATTG AAAGATAAGG AAGACAATTC TGAAGTTTCC TTAGGTACAT CCAAGATGAA CTATATCGAT CCAAGATTAA CGGTGATGTT TTCGAAGAAG TTCGATGTTC CCATCGAGAA ACTCTTCACC AAGACCTTGC GTGACAAATT CAAATGGGCC ATCGAATCAG CAGATGAAAA CTGGAGATTC TAA
|
Protein sequence | MSSSEDEIVL SKRVKKTSKK NGSMTSTAAS ELDDDLPLSQ RNNGVVKSKD TSVDEDYEEP IAEKVNRKRK SENGSSTVPK KTKKVKTETD ASAKKSDKEP KQKRETKPKK ETKSKNAAVK AEKDEDVPTS QNEEKDEDED EGYKWWEAED VDGVQKWETL EHNGVLFPPE YEPLPSHVKL YYDGKPVKLS LEAEEVAGFY GAMLETDHAK NPVFQKNFFG DFLDVIKETN GSDVEIKDFE KLDFSKIFAH FEKLREEKKL LTKDQKKAMK EEKERIEEPY KTCLFNGHKE LVGNFRVEPP GLFRGRGAHP RTGKLKRRVY PEMVTLNIGA GAKIPEAPPG HSWGEIKNDN TVTWLAMWRE NIADSFKYVR FAASSSIKGV SDFKKFETAR RLRSHVDAIR KDYTKMLKSE LMQDRQMASA IYLIDVFALR AGGEKGDDEA DTVGCCSLRY EHITLKPPNK VIFDFLGKDS IRFYQEVEVD KQVFKNLRIF KKAPKQPGDD LFDRINPTML NKQLQNYMKG LTAKVFRTYN ASKTMQDQLD LIPNEGTVAE KVVKFNAANR TVAILCNHQR TVSKGHGSSV QKINDKLKEL MWQKIRLKRM ILVLEPKLKN KQSHYFCEID DLAKEDEEHI HHTVIARQRE QVLKKMQRDN EKLKLEKQEI LTEKSDEIKE KMAKIDDLEK EYKAELNGAK PEVKKNLTVE KLQQQVEVIE NRIVTTTLQL KDKEDNSEVS LGTSKMNYID PRLTVMFSKK FDVPIEKLFT KTLRDKFKWA IESADENWRF
|
| |