Gene PICST_64147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_64147 
SymbolSMC1 
ID4841167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp661198 
End bp664920 
Gene Length3723 bp 
Protein Length1240 aa 
Translation table12 
GC content39% 
IMG OID640392482 
ProductStructural maintenance of chromosome protein 1 (sister chromatid cohesion complex Cohesin, subunit SMC1) 
Protein accessionXP_001386531 
Protein GI150866808 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG1196] Chromosome segregation ATPases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.471232 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAGGC TAATCGGCCT TGAGTTACAC AACTTTAAGT CGTACCGAGG AACAACAAAA 
ATCGGATTTG GTAGCTCGTT TTTCACCTCC ATCATTGGCC CAAATGGTGC TGGAAAATCA
AATTTGATGG ATGCAATTTC GTTTGTATTA GGAGTCAGAT CCTCTCATTT GAGATCGCAA
AACTTGAAAG ACTTGATTTA CCGAGGAAGA AGAACAAACG GCAACAGCGA TCTTTCTGTA
GACGAATTGG AACAGGATCC TAATAGAGCC CATGTGACTG CTATCTATGA AAAAGACGAT
GGAGAAATCG TCAAGTTCAA GAGAACAATT TCGTCCTCAG GAAACAGCGA ATACCGCGTC
AACGACGTGT CTGTCACGTC ATTAAACTAT TCATTGGTCT TAAAAGCTGA AAACATTCTT
ATCAAAGCGC GCAACTTCTT GGTATTCCAA GGCGATGTAG AACAGATAGC ATCGCAGTCG
CCAACTGATC TCACAAAACT CATTGAGAAT ATCAGTGGTT CCAACGAATT TACCAAAGAA
TACGAAAGCT TGAAGGAAGA ATATGAAAGA GCCAGAGAGT TCTCGAATTC CGTATTCTCA
CGTAAAAGAA ACTTAAACTC TGAATCAAGA CAGTATAAGG AGCAGTTGAT AGAGCAAAGG
CAGTTCGAAG AAAGATTAAT GGAAAAAAAT GAGACCATAA AGAGAATCAA CTTATACCAT
ATTTATCACA ATGAGCGCAA GCACTTCCAA ATTCAAGAAG AAATTGATGC CAAGACTGCT
GAACTCAAAG AATTGAAAAA AGGTCTCTCT TCCGAACAGA AGCAGTTCAA AACGATCTCT
GCCGATTATT CCAAGAAGGT ATTGGAATCC AAAAAGCACA CTAAGAAATT GGAACAGGTT
GCAACGCAAA TTGAAAGCGC AAAGAGAGAT CTCATACCAA TGCAAGCTAA CAAGAGAGCA
ATGACTTCGA AGGCGAACTC AACGAAGACT AAAATTGAAG ATTTGCAAGC AGATTTGAAA
AGACAGAAGG CGCTGGCAAG CTCAGTTCAA AAACAATTAG ACCAAGCTCA GAAGTTATTT
GCTGACTTTG AAAATAAAAT TGCTTCGTCA ACTTCACTTT CAATTTCGCC TGAAGGACAA
AAGGAATATC TGAAGTTGAG ATCGCAATTC TTATCTAGAG GTGGATCTGC TTTTGAGGAA
GATATATCTA TTCTTCTTAA CGAGAAAGAC TCATTATTGG CTGCTATCAC CGGTTTGGAG
AACCAAAGAG CTAATTCCGC AACTAGAATT AACGAGTTAC AGTCGACTAT AAATTCAGAA
TTGAAATCCA GTCTTGCGGA CATCAACACT GAAATAAACG ATGTTTTATC TAGAAAGCAA
GAGAAAGTCG ATGCCAGAAG TGCGTTGATC AAGTCCAAAG AAGAATTCCA ATACCAAGAA
CTACAATTGA AATCTCAATT GAGAGACGTC TTGATCAAGT TGGATGAAAT ATCATCGCAA
CAAAGAGAAT CCAACAAGCA AAAGAAGTTA AGAGAGAATG TAGCCATGTT GAAGAGGCTT
TTTCCTGAAG GAGCAATCAA AGGAATAGTG TACGAGTTGG TACGCCCTTC AGAACAGAAG
TTCGAAAGTG CTGTTCAAAC GGTTCTAGGC AGAAACATTG ACAGCGTTAT CGTTCAAACA
ACCAGCGTGG CATACAAGTG TATTGAAATT TTAAAGGAAA GAAGAGCTGG TGTAGTTACG
TTTATTCCGT TGGATTCCAT TCAAAGCGAG CCGATAAATT TGAACTATTT GAGATCTATC
CATGAATCTG CCCAACCGGG TATTGATATC CTTAAATACG ACGACAAGTC TTTGGAGCAA
GCAATTAACT ACATTGCTGG CGATGCTTTA GTTGTCAAAG ATATTAATCT CGCCAGAAAC
TTGAAGTGGG ACTCGCACCA CAAGTTGGAA AACAGAATCA TATCTCTCAA TGGGTCCGTC
ATTCACAAGT CTGGTTTAAT GACAGGAGGA CAACAGAGCC AGAAGAGTAG CGCCTCTTTG
ACCTGGGATA GAGAAGAATG GATATCATTG AACTCAGTAA AGGACGAACT AACTACAAGA
CTCTCTACAT TGCAAGAGAA CAAACCAAAG GAACTTGAAA TCAATCTCTT GGCTGACGAA
ATCAGCCTGT TGGACGATGG ATTACCTGTG CTAAGAAACC AAAAGATGAG CACCGAACGT
ACAATTAGAG ACAGAGAAGC CGAAGTGAAG TTCCAGACAG AACTCCAAAA GAGTTTTGAT
GATTCTATTA ATAGCAAGAA AGCAAAGTTA GTAAAACTCG ATCAGAAGAC TGATGAGATT
CGCAACAAAA CAGCTTCTTT GAAGAATGAG ATTTATTCCG AGTTCTGTTA TGATTATGGA
TTTTCAAATG GTATCGATGA TTATGAAAAC TTGCATGGCG CAACATTAAG AGTAAGAGTC
AAGGAAAGGG CTCAATATTC GAAGGCCATA GCCACTTTTA GTAACAAGTT AAAGTTCGAG
AATGAAAGAG TGAACGAAAC TATACAAAGA GAAGAATCCC TTAAATCACA ATTATTGGAA
CTAGAGGAAA ACACTCTGAC TGTAATGTCT GAAATTCAGC TTGTTGAAAG TAAGATAGAT
AATCTCGAGG CAGAACTTGA GGTATTGGAA CACGAGCAAC TGAATCAAAA CAAGGAGCTT
CAATCAAACT TGAAGAAATC TAAAACCTTT GAAGCGTCCG TCGCGGAGTT GGAATCCAAT
ATATCTACAT TGAACAAAGG GATACTTTCT TTAGAGGAGC AGTTGTTGAA GATTGATACT
GAGAGAGTTA ACATCTTGAA GAATTGTAAG ATAGAAAATG TCAATATTCC GTTGAAAGAT
GGGCTCTTGG ATTCTATTTC GATTGGTGAG ACTTCGGATA ACTTGGTGAA GGAGATCTAC
GATATCGAGA TAGATTATTC CAATTTGGAT GAATCATTAC GAAGAACATA TAGTGCCAAA
CTAGAAGCCG AACTTCAAAC TAAGTTGGAA GAGATTATCG AGCAATTGGA GAGATTGACA
CCAAATGCAA AAGCAGTGGA TAGATTGAAG GAAGCTGAAG CGAAACTTAG AAATTTTGAT
AAAGAGCATA CACTTGCAAG ACAAAAGGAA CGTAAAGTGT ATGACAAATT CCAGGAGGTT
CGTGAAAAGA GATACCAGAC TTTTATGGAA GCATTCAATC ATATTTCTTC CAAAATAGAT
TCGATCTACA AAGAGCTCAC TAAGTTCCCT GCTTCTCCTT TGGGTGGTGC TGCCTATTTG
ACATTAGAGG ATGATGAATA TCCATACAAT TCTGGTATCA AATACCACGC TATGCCACCT
ATGAAAAGAT TCAGAGACAT GGAATTACTT TCAGGTGGTG AAAAGACGAT GGCTGCACTT
GCATTACTTT TTGCTATTCA TTCATATCAA CCATCTCCCT TTTTTGTACT TGATGAAGTT
GATGCTGCCC TTGATAATGC TAATGTTAGC AAAATTGCTA ACTATATTAG GAAATATGCT
GGACCTAACT ACCAGTTCAT TGTTATATCT TTGAAGAACT CGTTGTTTGA AAAGTCAGAC
GCTTTAGTGG GTATATATAG AGACCAGAGA CAGAATAGCT CATCGACACT TACATTGGAC
TTGACCGAGT ACTCTGAAGA AGGTTTATCA GTATCCGGCC AAGCAGTTAC TGCTTCTGGC
TAG
 
Protein sequence
MGRLIGLELH NFKSYRGTTK IGFGSSFFTS IIGPNGAGKS NLMDAISFVL GVRSSHLRSQ 
NLKDLIYRGR RTNGNSDLSV DELEQDPNRA HVTAIYEKDD GEIVKFKRTI SSSGNSEYRV
NDVSVTSLNY SLVLKAENIL IKARNFLVFQ GDVEQIASQS PTDLTKLIEN ISGSNEFTKE
YESLKEEYER AREFSNSVFS RKRNLNSESR QYKEQLIEQR QFEERLMEKN ETIKRINLYH
IYHNERKHFQ IQEEIDAKTA ELKELKKGLS SEQKQFKTIS ADYSKKVLES KKHTKKLEQV
ATQIESAKRD LIPMQANKRA MTSKANSTKT KIEDLQADLK RQKASASSVQ KQLDQAQKLF
ADFENKIASS TSLSISPEGQ KEYSKLRSQF LSRGGSAFEE DISILLNEKD SLLAAITGLE
NQRANSATRI NELQSTINSE LKSSLADINT EINDVLSRKQ EKVDARSALI KSKEEFQYQE
LQLKSQLRDV LIKLDEISSQ QRESNKQKKL RENVAMLKRL FPEGAIKGIV YELVRPSEQK
FESAVQTVLG RNIDSVIVQT TSVAYKCIEI LKERRAGVVT FIPLDSIQSE PINLNYLRSI
HESAQPGIDI LKYDDKSLEQ AINYIAGDAL VVKDINLARN LKWDSHHKLE NRIISLNGSV
IHKSGLMTGG QQSQKSSASL TWDREEWISL NSVKDELTTR LSTLQENKPK ELEINLLADE
ISSLDDGLPV LRNQKMSTER TIRDREAEVK FQTELQKSFD DSINSKKAKL VKLDQKTDEI
RNKTASLKNE IYSEFCYDYG FSNGIDDYEN LHGATLRVRV KERAQYSKAI ATFSNKLKFE
NERVNETIQR EESLKSQLLE LEENTSTVMS EIQLVESKID NLEAELEVLE HEQSNQNKEL
QSNLKKSKTF EASVAELESN ISTLNKGILS LEEQLLKIDT ERVNILKNCK IENVNIPLKD
GLLDSISIGE TSDNLVKEIY DIEIDYSNLD ESLRRTYSAK LEAELQTKLE EIIEQLERLT
PNAKAVDRLK EAEAKLRNFD KEHTLARQKE RKVYDKFQEV REKRYQTFME AFNHISSKID
SIYKELTKFP ASPLGGAAYL TLEDDEYPYN SGIKYHAMPP MKRFRDMELL SGGEKTMAAL
ALLFAIHSYQ PSPFFVLDEV DAALDNANVS KIANYIRKYA GPNYQFIVIS LKNSLFEKSD
ALVGIYRDQR QNSSSTLTLD LTEYSEEGLS VSGQAVTASG