Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_641 |
Symbol | SMC3 |
ID | 4838830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 1555970 |
End bp | 1559002 |
Gene Length | 3033 bp |
Protein Length | 1011 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640390145 |
Product | chromosome condensation and segregation protein |
Protein accession | XP_001384253 |
Protein GI | 150865152 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning |
COG ID | [COG1196] Chromosome segregation ATPases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.15922 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACATCA AGAAGATCAT CATCCAGGGC TTCAAGACGT ACAAGAATGC CACGGTTATA GATCTCGTGT CTCCACACCA CAATGTTGTG GTGGGACGAA ATGGGTCTGG AAAATCCAAT TTTTTCGCCG CGATCAGATT CGTATTATCC GACGACTACA CCCATATGGG CCGGGAAGAA AGACAGGCGT TGATCCACGA GGGCTCCGGA ACTGTGATGT CAGCATATGT AGAGATTGTA TTCGACAATC GCGATGGCAG AATTCCTCTT AACAGAAACG AAGTAGTAAT CAGAAGAACT ATCGGTTTGA AGAAGGACGA TTACGCGTTG GATGGAAAAT CAGCTACGCG TTCTGATATC CTCAATCTTT TGGAAAGCGC AGGGTTTTCG AGATCAAACC CCTACTATAT AGTTCCGCAG GGAAGGATTA CCAGTTTGAC CAATGCCAAG GATTCGGATA GACTCGTCTT GTTGAAAGAA GTCAGTGGAG CCACAGTGTT TGAAAACAAG CTCAAGGAAT CCGAAAAGGA AATGAATAAC TCCACTTATA AAAAGCAGCG TATTGATGAA ACTCTTGCTT CCATCGACGA AAGATTGTCG GATTTACAGA TCGAATCAGC CGACTTGAAG AAATTTCAAA GTTTGGATAA ATCCAAGAAG ATCTTGGAAT ATAATTTGTT TGACCGTGAG TTCACGGACT TGAAGACCTC CATAGACGAA ACTGACGAAA CATACAACGA ACTTTTGACA GAATCGCAAC AAGATCTCCA GGATCTAGAT AATCGTGAAA AGTTATGTCA GCAGTTATCT GACACAATTA ACGACTTAAA GATCTCTATA AAGGTATCGC AATTGAATAA GGAGCAGTCT GATTTGGATT ACAACCAGAT GTTGAAAATA GTAGCAGAGA AGGAAGTCAA ACTAAGCGAC TTGAAATCTA CGCTCCATTC GTCGCGTCAT AATGTGGAAG AAGTGAACCG CCAAATTGTA AAGTACAGAC AGGTGATTGC AGAACACGAA TCCAAGGTAT CTAGCTTGAA ACCCCAGTTA GATAGCTTGC AAGGAAGAGA ACGTGAATAC AAGGAAAAAT TGGTAGATTT AACTTCTAAA CAAAGAGCTT TATATTCTAA ACAGAATAGA TTTCTGAAGT TCAAGACCAA AAGAGAGAGA GATACGTGGT TAACGACAGA AATATCGACT CTTAAGAAGC AGCTTCAATC TAAGGAAACC GATATCAGTC AGTTGAATTC AGAAATCAAG AACCAGGAAA GTGATATCAG TGGATGGAAT ACTCAGATTG ATAAGTTAAA TGGACAACTC CACGACGGAT CCCATTCCAA TTCAATCCAC AAATTGAAGA CCACTGTAGG TGATTTGAAA CTGCAAATCA ATGAGCTCAA CGATAGGCGT AAGCTCTTGT GGCGTGATGA AATTAGATTC AGAAGTATCT ATGATTCCAT CAATAATGAC TTAAACAATG CCAATAACAT GGTCAATCTG ACTATGGATA GAGCACAGGC CCAAGGACTT GCTGCTGTGA AAACTATTGC TGTCAACTTG AACTTGACTG AGAACGTTTA TGGTCCATTG GCTTCGTTAT TCAGTGTCAG CGACAAATAC AAGGTAGCTG CTGAAGTAAT TGCAGGAAAC TCTTTGTTCC ATGTAGTTGT TGACAATGAT AACACGGCAT CGTTGATAAT GAACGAGCTT GCAAGATCCA AAGCAGGAAG AGTAACTTTC ATTCCGCTCA ATAGAATTGA CTTCTCACCA ATTGAGTTCC CAGATAGCAA TGAGCATCAA TGTATTCCAT TGATAAAGAA ATTGAAATTC AATGAAAACG TTAGTAAAGC TATTCATCAA GTTTTTGGCA AGACCATCGT TGTAGGTGAC TTACCAACAG GAGCAGAATT AGTTAGATCG TACAATGTTT CTGCTATCAC ATTGGATGGT GACAGAGCTG ATAGAAGAGG AGTCTTGACA GGTGGTTTCA GAGATTATAA AAGGTCGAGA ATAGACGCAT TGAAAATACA AGCAAAAAAG AAGGTTGACT TAGAAAAGAT TGAATCTGAA TTGCAAGAAT GTGTCAAGGA AATTGAGCTG GTCAATCAGA ATATTACATC TTTGAATAAT GAATATCAAC TCAGTGTCCG TGATTTGGAC CGATTGCAGC AGGGACAAGA ACCAATTAAG ATTGAATTAT CTCAATTGTC CAACAAGAAG TTTAATGTCG AACAAGAATT GAATAGTCTA AGATACAACG TTAGCAACGC CCAAGCAACA AGAAGTACGT TGTTGATCAA AATTAAACAG CATGAAGGTG AATTGGATAA TAATTTCACT CAATCATTGA GCGATGAGGA ACTTCAAACC TTGGAGGACT TAACTAGTCA AATAAAAGAC GTGGAAGCGC AATTGGACAA GGTCGTCACT CAGTTGTCTG ATTCAGAAAC CCAGATTTCT GCCTTGGATT CGGCGATGCT TAATGATTAC AAGCCAACTT TGTCGAAGCT TCTTAAACAG AGTACTCTGT TTGGTGATAG TAATACAAAC GACGAAGAAG TAAGGTCACT TGAAAAGGAA ATTAAAAATT TGCAAGTGGA ATTGCATTCT ATTCAGATAA GAAATGAATC TGCCACACAA GAATTCGATA GAATCAGTAA GGAAATCGCA GATAGTGAAA ATTCATTGAA GAAAGCAAAT GCTCAACAGC TTATTCTAAT CAAGAAATTG GAGAAGTTCT CGAAGTCTTC CGAACAAATC TTGAACAGAA AGGCGATTTT GACAAACAGA CGTGAAGAAA TTCATAAAAA GATCAAGGAA TTGGGTGTTT TACCTGAAGA AGCTTTTCAG GCGTCGAACT ACGATCAATA CAACTCAGAT CAATTGTTAG AAAAGTTGAA CAAAGTCAAT GAAGACTTAT CCAAATACTC TCATATTAAC AAAAAGGCAA TGGAACAGTA CAATACGTTC ACGAAACAGA GAGACGATTT GGTCAAAAGA AGGGAGGAAT TGGATACTTC TCGTGAGTCC ATC
|
Protein sequence | MHIKKIIIQG FKTYKNATVI DLVSPHHNVV VGRNGSGKSN FFAAIRFVLS DDYTHMGREE RQALIHEGSG TVMSAYVEIV FDNRDGRIPL NRNEVVIRRT IGLKKDDYAL DGKSATRSDI LNLLESAGFS RSNPYYIVPQ GRITSLTNAK DSDRLVLLKE VSGATVFENK LKESEKEMNN STYKKQRIDE TLASIDERLS DLQIESADLK KFQSLDKSKK ILEYNLFDRE FTDLKTSIDE TDETYNELLT ESQQDLQDLD NREKLCQQLS DTINDLKISI KVSQLNKEQS DLDYNQMLKI VAEKEVKLSD LKSTLHSSRH NVEEVNRQIV KYRQVIAEHE SKVSSLKPQL DSLQGREREY KEKLVDLTSK QRALYSKQNR FSKFKTKRER DTWLTTEIST LKKQLQSKET DISQLNSEIK NQESDISGWN TQIDKLNGQL HDGSHSNSIH KLKTTVGDLK SQINELNDRR KLLWRDEIRF RSIYDSINND LNNANNMVNS TMDRAQAQGL AAVKTIAVNL NLTENVYGPL ASLFSVSDKY KVAAEVIAGN SLFHVVVDND NTASLIMNEL ARSKAGRVTF IPLNRIDFSP IEFPDSNEHQ CIPLIKKLKF NENVSKAIHQ VFGKTIVVGD LPTGAELVRS YNVSAITLDG DRADRRGVLT GGFRDYKRSR IDALKIQAKK KVDLEKIESE LQECVKEIES VNQNITSLNN EYQLSVRDLD RLQQGQEPIK IELSQLSNKK FNVEQELNSL RYNVSNAQAT RSTLLIKIKQ HEGELDNNFT QSLSDEELQT LEDLTSQIKD VEAQLDKVVT QLSDSETQIS ALDSAMLNDY KPTLSKLLKQ STSFGDSNTN DEEVRSLEKE IKNLQVELHS IQIRNESATQ EFDRISKEIA DSENSLKKAN AQQLILIKKL EKFSKSSEQI LNRKAILTNR REEIHKKIKE LGVLPEEAFQ ASNYDQYNSD QLLEKLNKVN EDLSKYSHIN KKAMEQYNTF TKQRDDLVKR REELDTSRES I
|
| |