Gene PICST_46944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_46944 
SymbolTOP1 
ID4839471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp213595 
End bp215937 
Gene Length2343 bp 
Protein Length780 aa 
Translation table12 
GC content40% 
IMG OID640390786 
ProductDNA topoisomerase I 
Protein accessionXP_001385059 
Protein GI150865726 
COG category[L] Replication, recombination and repair 
COG ID[COG3569] Topoisomerase IB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000227493 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCCT CTGAAGACGA AATCGTTCTT TCGAAGAGAG TCAAAAAAAC TTCCAAAAAG 
AACGGATCCA TGACCAGCAC TGCTGCCAGT GAACTCGACG ATGATCTCCC ATTGTCACAG
CGTAACAACG GTGTTGTCAA ATCCAAAGAC ACATCTGTAG ATGAAGACTA CGAAGAGCCA
ATCGCAGAAA AGGTGAACAG AAAACGTAAG AGTGAAAATG GCTCTTCTAC AGTGCCTAAG
AAAACAAAGA AGGTCAAAAC CGAGACAGAT GCATCAGCGA AGAAGTCAGA TAAAGAACCG
AAACAAAAAA GGGAAACAAA ACCAAAGAAA GAAACAAAGT CCAAAAATGC TGCAGTGAAA
GCAGAGAAGG ATGAAGACGT TCCTACTTCG CAAAACGAAG AGAAGGATGA AGATGAAGAT
GAAGGCTATA AATGGTGGGA AGCTGAAGAC GTTGATGGAG TTCAGAAATG GGAAACTTTG
GAACACAATG GTGTTCTTTT CCCACCTGAG TATGAACCTC TCCCTCTGCA TGTGAAGTTG
TACTATGATG GGAAACCAGT GAAGTTGTCC TTAGAAGCTG AAGAAGTCGC TGGATTCTAT
GGTGCCATGT TGGAAACAGA TCATGCCAAA AACCCTGTTT TCCAAAAGAA CTTCTTCGGT
GACTTCTTAG ACGTAATCAA GGAAACTAAT GGTTCTGATG TTGAAATCAA AGACTTTGAA
AAACTCGACT TCTCCAAGAT ATTCGCTCAC TTTGAGAAAC TCAGAGAGGA GAAAAAGCTT
CTCACGAAGG ATCAAAAGAA AGCCATGAAG GAAGAAAAGG AGAGAATTGA AGAACCATAC
AAGACTTGTT TATTCAACGG TCACAAGGAA CTCGTAGGTA ATTTCAGAGT AGAACCTCCA
GGTTTGTTCA GAGGTAGAGG AGCCCATCCT AGAACTGGTA AGTTGAAGAG AAGGGTCTAC
CCTGAGATGG TTACTTTGAA CATTGGAGCT GGTGCTAAGA TACCTGAAGC TCCTCCGGGT
CATAGCTGGG GTGAAATCAA GAACGATAAC ACCGTTACTT GGTTGGCTAT GTGGAGAGAA
AACATCGCCG ATTCATTCAA GTATGTTAGA TTTGCTGCCT CGTCTTCCAT TAAGGGTGTT
TCTGATTTCA AAAAGTTTGA AACGGCCAGA AGATTACGTA GTCATGTAGA TGCCATCAGG
AAGGACTACA CTAAGATGTT GAAGAGCGAA TTAATGCAAG ATAGACAGAT GGCATCTGCA
ATTTATCTTA TTGATGTGTT TGCATTGAGA GCTGGTGGCG AAAAGGGTGA CGATGAAGCG
GACACTGTTG GGTGTTGTTC TTTAAGATAT GAGCATATTA CCTTGAAACC CCCTAACAAG
GTTATCTTCG ACTTCTTGGG TAAGGATTCC ATCAGATTCT ACCAAGAAGT TGAAGTTGAC
AAGCAAGTGT TCAAGAACTT GAGGATCTTC AAGAAAGCGC CTAAACAACC CGGTGATGAC
TTGTTTGATA GAATCAATCC TACGATGTTG AACAAGCAAT TGCAGAATTA CATGAAAGGC
TTGACAGCTA AAGTTTTCCG TACCTATAAT GCCTCGAAGA CAATGCAAGA TCAGTTGGAT
TTGATTCCAA ATGAAGGCAC AGTAGCCGAA AAAGTTGTGA AGTTCAATGC TGCTAATAGA
ACTGTTGCTA TCTTGTGTAA TCACCAGCGT ACGGTAAGTA AAGGACATGG CAGTTCTGTT
CAGAAAATCA ATGACAAGTT AAAGGAGTTG ATGTGGCAGA AAATAAGATT GAAGAGAATG
ATACTTGTTT TAGAACCAAA ATTGAAGAAT AAGCAGCTGC ATTATTTTTG TGAAATCGAT
GATCTTGCAA AGGAAGATGA AGAGCACATT CATCACACAG TAATTGCTAG ACAAAGAGAA
CAGGTCTTGA AGAAGATGCA AAGAGATAAT GAAAAACTAA AATTGGAAAA GCAGGAGATT
TTGACTGAAA AATCAGATGA AATCAAAGAA AAGATGGCCA AGATTGATGA TCTTGAGAAG
GAATACAAGG CTGAATTGAA TGGCGCAAAA CCAGAAGTAA AGAAGAATCT CACTGTGGAG
AAGTTGCAGC AGCAGGTTGA AGTGATTGAA AACAGAATTG TTACCACGAC TCTTCAATTG
AAAGATAAGG AAGACAATTC TGAAGTTTCC TTAGGTACAT CCAAGATGAA CTATATCGAT
CCAAGATTAA CGGTGATGTT TTCGAAGAAG TTCGATGTTC CCATCGAGAA ACTCTTCACC
AAGACCTTGC GTGACAAATT CAAATGGGCC ATCGAATCAG CAGATGAAAA CTGGAGATTC
TAA
 
Protein sequence
MSSSEDEIVL SKRVKKTSKK NGSMTSTAAS ELDDDLPLSQ RNNGVVKSKD TSVDEDYEEP 
IAEKVNRKRK SENGSSTVPK KTKKVKTETD ASAKKSDKEP KQKRETKPKK ETKSKNAAVK
AEKDEDVPTS QNEEKDEDED EGYKWWEAED VDGVQKWETL EHNGVLFPPE YEPLPSHVKL
YYDGKPVKLS LEAEEVAGFY GAMLETDHAK NPVFQKNFFG DFLDVIKETN GSDVEIKDFE
KLDFSKIFAH FEKLREEKKL LTKDQKKAMK EEKERIEEPY KTCLFNGHKE LVGNFRVEPP
GLFRGRGAHP RTGKLKRRVY PEMVTLNIGA GAKIPEAPPG HSWGEIKNDN TVTWLAMWRE
NIADSFKYVR FAASSSIKGV SDFKKFETAR RLRSHVDAIR KDYTKMLKSE LMQDRQMASA
IYLIDVFALR AGGEKGDDEA DTVGCCSLRY EHITLKPPNK VIFDFLGKDS IRFYQEVEVD
KQVFKNLRIF KKAPKQPGDD LFDRINPTML NKQLQNYMKG LTAKVFRTYN ASKTMQDQLD
LIPNEGTVAE KVVKFNAANR TVAILCNHQR TVSKGHGSSV QKINDKLKEL MWQKIRLKRM
ILVLEPKLKN KQSHYFCEID DLAKEDEEHI HHTVIARQRE QVLKKMQRDN EKLKLEKQEI
LTEKSDEIKE KMAKIDDLEK EYKAELNGAK PEVKKNLTVE KLQQQVEVIE NRIVTTTLQL
KDKEDNSEVS LGTSKMNYID PRLTVMFSKK FDVPIEKLFT KTLRDKFKWA IESADENWRF