Gene PICST_641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_641 
SymbolSMC3 
ID4838830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1555970 
End bp1559002 
Gene Length3033 bp 
Protein Length1011 aa 
Translation table12 
GC content39% 
IMG OID640390145 
Productchromosome condensation and segregation protein 
Protein accessionXP_001384253 
Protein GI150865152 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG1196] Chromosome segregation ATPases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.15922 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACATCA AGAAGATCAT CATCCAGGGC TTCAAGACGT ACAAGAATGC CACGGTTATA 
GATCTCGTGT CTCCACACCA CAATGTTGTG GTGGGACGAA ATGGGTCTGG AAAATCCAAT
TTTTTCGCCG CGATCAGATT CGTATTATCC GACGACTACA CCCATATGGG CCGGGAAGAA
AGACAGGCGT TGATCCACGA GGGCTCCGGA ACTGTGATGT CAGCATATGT AGAGATTGTA
TTCGACAATC GCGATGGCAG AATTCCTCTT AACAGAAACG AAGTAGTAAT CAGAAGAACT
ATCGGTTTGA AGAAGGACGA TTACGCGTTG GATGGAAAAT CAGCTACGCG TTCTGATATC
CTCAATCTTT TGGAAAGCGC AGGGTTTTCG AGATCAAACC CCTACTATAT AGTTCCGCAG
GGAAGGATTA CCAGTTTGAC CAATGCCAAG GATTCGGATA GACTCGTCTT GTTGAAAGAA
GTCAGTGGAG CCACAGTGTT TGAAAACAAG CTCAAGGAAT CCGAAAAGGA AATGAATAAC
TCCACTTATA AAAAGCAGCG TATTGATGAA ACTCTTGCTT CCATCGACGA AAGATTGTCG
GATTTACAGA TCGAATCAGC CGACTTGAAG AAATTTCAAA GTTTGGATAA ATCCAAGAAG
ATCTTGGAAT ATAATTTGTT TGACCGTGAG TTCACGGACT TGAAGACCTC CATAGACGAA
ACTGACGAAA CATACAACGA ACTTTTGACA GAATCGCAAC AAGATCTCCA GGATCTAGAT
AATCGTGAAA AGTTATGTCA GCAGTTATCT GACACAATTA ACGACTTAAA GATCTCTATA
AAGGTATCGC AATTGAATAA GGAGCAGTCT GATTTGGATT ACAACCAGAT GTTGAAAATA
GTAGCAGAGA AGGAAGTCAA ACTAAGCGAC TTGAAATCTA CGCTCCATTC GTCGCGTCAT
AATGTGGAAG AAGTGAACCG CCAAATTGTA AAGTACAGAC AGGTGATTGC AGAACACGAA
TCCAAGGTAT CTAGCTTGAA ACCCCAGTTA GATAGCTTGC AAGGAAGAGA ACGTGAATAC
AAGGAAAAAT TGGTAGATTT AACTTCTAAA CAAAGAGCTT TATATTCTAA ACAGAATAGA
TTTCTGAAGT TCAAGACCAA AAGAGAGAGA GATACGTGGT TAACGACAGA AATATCGACT
CTTAAGAAGC AGCTTCAATC TAAGGAAACC GATATCAGTC AGTTGAATTC AGAAATCAAG
AACCAGGAAA GTGATATCAG TGGATGGAAT ACTCAGATTG ATAAGTTAAA TGGACAACTC
CACGACGGAT CCCATTCCAA TTCAATCCAC AAATTGAAGA CCACTGTAGG TGATTTGAAA
CTGCAAATCA ATGAGCTCAA CGATAGGCGT AAGCTCTTGT GGCGTGATGA AATTAGATTC
AGAAGTATCT ATGATTCCAT CAATAATGAC TTAAACAATG CCAATAACAT GGTCAATCTG
ACTATGGATA GAGCACAGGC CCAAGGACTT GCTGCTGTGA AAACTATTGC TGTCAACTTG
AACTTGACTG AGAACGTTTA TGGTCCATTG GCTTCGTTAT TCAGTGTCAG CGACAAATAC
AAGGTAGCTG CTGAAGTAAT TGCAGGAAAC TCTTTGTTCC ATGTAGTTGT TGACAATGAT
AACACGGCAT CGTTGATAAT GAACGAGCTT GCAAGATCCA AAGCAGGAAG AGTAACTTTC
ATTCCGCTCA ATAGAATTGA CTTCTCACCA ATTGAGTTCC CAGATAGCAA TGAGCATCAA
TGTATTCCAT TGATAAAGAA ATTGAAATTC AATGAAAACG TTAGTAAAGC TATTCATCAA
GTTTTTGGCA AGACCATCGT TGTAGGTGAC TTACCAACAG GAGCAGAATT AGTTAGATCG
TACAATGTTT CTGCTATCAC ATTGGATGGT GACAGAGCTG ATAGAAGAGG AGTCTTGACA
GGTGGTTTCA GAGATTATAA AAGGTCGAGA ATAGACGCAT TGAAAATACA AGCAAAAAAG
AAGGTTGACT TAGAAAAGAT TGAATCTGAA TTGCAAGAAT GTGTCAAGGA AATTGAGCTG
GTCAATCAGA ATATTACATC TTTGAATAAT GAATATCAAC TCAGTGTCCG TGATTTGGAC
CGATTGCAGC AGGGACAAGA ACCAATTAAG ATTGAATTAT CTCAATTGTC CAACAAGAAG
TTTAATGTCG AACAAGAATT GAATAGTCTA AGATACAACG TTAGCAACGC CCAAGCAACA
AGAAGTACGT TGTTGATCAA AATTAAACAG CATGAAGGTG AATTGGATAA TAATTTCACT
CAATCATTGA GCGATGAGGA ACTTCAAACC TTGGAGGACT TAACTAGTCA AATAAAAGAC
GTGGAAGCGC AATTGGACAA GGTCGTCACT CAGTTGTCTG ATTCAGAAAC CCAGATTTCT
GCCTTGGATT CGGCGATGCT TAATGATTAC AAGCCAACTT TGTCGAAGCT TCTTAAACAG
AGTACTCTGT TTGGTGATAG TAATACAAAC GACGAAGAAG TAAGGTCACT TGAAAAGGAA
ATTAAAAATT TGCAAGTGGA ATTGCATTCT ATTCAGATAA GAAATGAATC TGCCACACAA
GAATTCGATA GAATCAGTAA GGAAATCGCA GATAGTGAAA ATTCATTGAA GAAAGCAAAT
GCTCAACAGC TTATTCTAAT CAAGAAATTG GAGAAGTTCT CGAAGTCTTC CGAACAAATC
TTGAACAGAA AGGCGATTTT GACAAACAGA CGTGAAGAAA TTCATAAAAA GATCAAGGAA
TTGGGTGTTT TACCTGAAGA AGCTTTTCAG GCGTCGAACT ACGATCAATA CAACTCAGAT
CAATTGTTAG AAAAGTTGAA CAAAGTCAAT GAAGACTTAT CCAAATACTC TCATATTAAC
AAAAAGGCAA TGGAACAGTA CAATACGTTC ACGAAACAGA GAGACGATTT GGTCAAAAGA
AGGGAGGAAT TGGATACTTC TCGTGAGTCC ATC
 
Protein sequence
MHIKKIIIQG FKTYKNATVI DLVSPHHNVV VGRNGSGKSN FFAAIRFVLS DDYTHMGREE 
RQALIHEGSG TVMSAYVEIV FDNRDGRIPL NRNEVVIRRT IGLKKDDYAL DGKSATRSDI
LNLLESAGFS RSNPYYIVPQ GRITSLTNAK DSDRLVLLKE VSGATVFENK LKESEKEMNN
STYKKQRIDE TLASIDERLS DLQIESADLK KFQSLDKSKK ILEYNLFDRE FTDLKTSIDE
TDETYNELLT ESQQDLQDLD NREKLCQQLS DTINDLKISI KVSQLNKEQS DLDYNQMLKI
VAEKEVKLSD LKSTLHSSRH NVEEVNRQIV KYRQVIAEHE SKVSSLKPQL DSLQGREREY
KEKLVDLTSK QRALYSKQNR FSKFKTKRER DTWLTTEIST LKKQLQSKET DISQLNSEIK
NQESDISGWN TQIDKLNGQL HDGSHSNSIH KLKTTVGDLK SQINELNDRR KLLWRDEIRF
RSIYDSINND LNNANNMVNS TMDRAQAQGL AAVKTIAVNL NLTENVYGPL ASLFSVSDKY
KVAAEVIAGN SLFHVVVDND NTASLIMNEL ARSKAGRVTF IPLNRIDFSP IEFPDSNEHQ
CIPLIKKLKF NENVSKAIHQ VFGKTIVVGD LPTGAELVRS YNVSAITLDG DRADRRGVLT
GGFRDYKRSR IDALKIQAKK KVDLEKIESE LQECVKEIES VNQNITSLNN EYQLSVRDLD
RLQQGQEPIK IELSQLSNKK FNVEQELNSL RYNVSNAQAT RSTLLIKIKQ HEGELDNNFT
QSLSDEELQT LEDLTSQIKD VEAQLDKVVT QLSDSETQIS ALDSAMLNDY KPTLSKLLKQ
STSFGDSNTN DEEVRSLEKE IKNLQVELHS IQIRNESATQ EFDRISKEIA DSENSLKKAN
AQQLILIKKL EKFSKSSEQI LNRKAILTNR REEIHKKIKE LGVLPEEAFQ ASNYDQYNSD
QLLEKLNKVN EDLSKYSHIN KKAMEQYNTF TKQRDDLVKR REELDTSRES I