Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31531 |
Symbol | |
ID | 4838374 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 1021260 |
End bp | 1022504 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 12 |
GC content | 36% |
IMG OID | 640389689 |
Product | predicted protein |
Protein accession | XP_001384151 |
Protein GI | 150865082 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1697] DNA topoisomerase VI, subunit A |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.288827 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0593553 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTCG AAGTAGCTTT TGGCTGCAAA TACAGTTATA ACAACACAGT CATGAAGCAC CACCTAAAGT TCATGGCAAT AAAGGCCGAG TACTCCTGTG AGGCGTCCTT CAGTTTAGCA GTATGCTTGA AAAGTGAAAG TAGTGGAGAC GTCCAAATCT TCCATTCACA AGAAAATCAT AACAAATCAA AAGAGAACAT GATCTTGACT TCCAAGTGGA TGGACATTGT ACAAACAATA AATTCTGAAC GTGAGTTCGT TCTTTCGTAT GGTAGCAGAA ATGGTTCGAA GATACACTTT TTAAGGCAGT TTGCAGACCT GGATATATTG GTCCAAAGAT TTACTGCCAC ATTAAAGGTG TTGAAGATCC TACTTCTACA GGCACAATCA AATTCTAAGA AATCAACAAC AATAAGAGAT ATTTACTATC AAGATGTTGA AGCCTTTCAC TGGAAACAAA GATATTGCAA TGAAATTTTG CACCTGATTG TTGTCGATTC CTTGGGTTTG AGTTTGGAAC ACAATTTCAG CATATACCCT TCTCAAAAGG GTCTTGTGTA TGGTGATTTT GCAATACAGT CTAATGAAGG AACTATATTT CAAATGAGCT ATTCTGAAGA ACCAGTTTTA ATTCCTCTAC ACACCAAGTT TGAACACATC TTACCAAATG AGGAAAGTCA CTATGCTATA GTCATTTTAG AAAAAGAAGC TGTTTTCCAA TCCTTTTGCT ATTACATCAA GACCAGATAT ACGTTGAAAG ACAATTTCGT ACCTGACAAC TTAATTGTGG TGACTGGAAA GGGGTTTCCA GATAATTTGA CCAAGAAATT TGTCAATATC CTTGCAAATA CTGCATTCAC CAATTCAGTA ACTCTAGGCT TTTTCGACTC TGACGTTTAT GGAATAAATA TCTGCAAAAA TTACCAAGAT GAAATTGCTA CGGAATCTAA ATCAAGAGAT ATCTATGCTG GGGTCTATTT GATGGACTAC ATTGCTGGAT GGAGTGACAT TACGGCTAGA GAAAGGATAC TAATAATGAG CACAATTACG AAGATAACTA CGGTGTATCC CACTATTCAA AATAAAAGAT TTCACAGAGA GTTAACGAGA GGATTGTGGT TGTCCAAGAA ATGTGAAATG AACGTATACC AAGGTGATGC AGACCAATCA GAAGGAATTT CATCAATTGC GATCAATGAA TACATACTAT CTCAAATCAA TTCCCACAAA AAAGTGATCA AATAA
|
Protein sequence | MKFEVAFGCK YSYNNTVMKH HLKFMAIKAE YSCEASFSLA VCLKSESSGD VQIFHSQENH NKSKENMILT SKWMDIVQTI NSEREFVLSY GSRNGSKIHF LRQFADSDIL VQRFTATLKV LKILLLQAQS NSKKSTTIRD IYYQDVEAFH WKQRYCNEIL HSIVVDSLGL SLEHNFSIYP SQKGLVYGDF AIQSNEGTIF QMSYSEEPVL IPLHTKFEHI LPNEESHYAI VILEKEAVFQ SFCYYIKTRY TLKDNFVPDN LIVVTGKGFP DNLTKKFVNI LANTAFTNSV TLGFFDSDVY GINICKNYQD EIATESKSRD IYAGVYLMDY IAGWSDITAR ERILIMSTIT KITTVYPTIQ NKRFHRELTR GLWLSKKCEM NVYQGDADQS EGISSIAINE YILSQINSHK KVIK
|
| |