Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_64147 |
Symbol | SMC1 |
ID | 4841167 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | + |
Start bp | 661198 |
End bp | 664920 |
Gene Length | 3723 bp |
Protein Length | 1240 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640392482 |
Product | Structural maintenance of chromosome protein 1 (sister chromatid cohesion complex Cohesin, subunit SMC1) |
Protein accession | XP_001386531 |
Protein GI | 150866808 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning |
COG ID | [COG1196] Chromosome segregation ATPases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.471232 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTAGGC TAATCGGCCT TGAGTTACAC AACTTTAAGT CGTACCGAGG AACAACAAAA ATCGGATTTG GTAGCTCGTT TTTCACCTCC ATCATTGGCC CAAATGGTGC TGGAAAATCA AATTTGATGG ATGCAATTTC GTTTGTATTA GGAGTCAGAT CCTCTCATTT GAGATCGCAA AACTTGAAAG ACTTGATTTA CCGAGGAAGA AGAACAAACG GCAACAGCGA TCTTTCTGTA GACGAATTGG AACAGGATCC TAATAGAGCC CATGTGACTG CTATCTATGA AAAAGACGAT GGAGAAATCG TCAAGTTCAA GAGAACAATT TCGTCCTCAG GAAACAGCGA ATACCGCGTC AACGACGTGT CTGTCACGTC ATTAAACTAT TCATTGGTCT TAAAAGCTGA AAACATTCTT ATCAAAGCGC GCAACTTCTT GGTATTCCAA GGCGATGTAG AACAGATAGC ATCGCAGTCG CCAACTGATC TCACAAAACT CATTGAGAAT ATCAGTGGTT CCAACGAATT TACCAAAGAA TACGAAAGCT TGAAGGAAGA ATATGAAAGA GCCAGAGAGT TCTCGAATTC CGTATTCTCA CGTAAAAGAA ACTTAAACTC TGAATCAAGA CAGTATAAGG AGCAGTTGAT AGAGCAAAGG CAGTTCGAAG AAAGATTAAT GGAAAAAAAT GAGACCATAA AGAGAATCAA CTTATACCAT ATTTATCACA ATGAGCGCAA GCACTTCCAA ATTCAAGAAG AAATTGATGC CAAGACTGCT GAACTCAAAG AATTGAAAAA AGGTCTCTCT TCCGAACAGA AGCAGTTCAA AACGATCTCT GCCGATTATT CCAAGAAGGT ATTGGAATCC AAAAAGCACA CTAAGAAATT GGAACAGGTT GCAACGCAAA TTGAAAGCGC AAAGAGAGAT CTCATACCAA TGCAAGCTAA CAAGAGAGCA ATGACTTCGA AGGCGAACTC AACGAAGACT AAAATTGAAG ATTTGCAAGC AGATTTGAAA AGACAGAAGG CGCTGGCAAG CTCAGTTCAA AAACAATTAG ACCAAGCTCA GAAGTTATTT GCTGACTTTG AAAATAAAAT TGCTTCGTCA ACTTCACTTT CAATTTCGCC TGAAGGACAA AAGGAATATC TGAAGTTGAG ATCGCAATTC TTATCTAGAG GTGGATCTGC TTTTGAGGAA GATATATCTA TTCTTCTTAA CGAGAAAGAC TCATTATTGG CTGCTATCAC CGGTTTGGAG AACCAAAGAG CTAATTCCGC AACTAGAATT AACGAGTTAC AGTCGACTAT AAATTCAGAA TTGAAATCCA GTCTTGCGGA CATCAACACT GAAATAAACG ATGTTTTATC TAGAAAGCAA GAGAAAGTCG ATGCCAGAAG TGCGTTGATC AAGTCCAAAG AAGAATTCCA ATACCAAGAA CTACAATTGA AATCTCAATT GAGAGACGTC TTGATCAAGT TGGATGAAAT ATCATCGCAA CAAAGAGAAT CCAACAAGCA AAAGAAGTTA AGAGAGAATG TAGCCATGTT GAAGAGGCTT TTTCCTGAAG GAGCAATCAA AGGAATAGTG TACGAGTTGG TACGCCCTTC AGAACAGAAG TTCGAAAGTG CTGTTCAAAC GGTTCTAGGC AGAAACATTG ACAGCGTTAT CGTTCAAACA ACCAGCGTGG CATACAAGTG TATTGAAATT TTAAAGGAAA GAAGAGCTGG TGTAGTTACG TTTATTCCGT TGGATTCCAT TCAAAGCGAG CCGATAAATT TGAACTATTT GAGATCTATC CATGAATCTG CCCAACCGGG TATTGATATC CTTAAATACG ACGACAAGTC TTTGGAGCAA GCAATTAACT ACATTGCTGG CGATGCTTTA GTTGTCAAAG ATATTAATCT CGCCAGAAAC TTGAAGTGGG ACTCGCACCA CAAGTTGGAA AACAGAATCA TATCTCTCAA TGGGTCCGTC ATTCACAAGT CTGGTTTAAT GACAGGAGGA CAACAGAGCC AGAAGAGTAG CGCCTCTTTG ACCTGGGATA GAGAAGAATG GATATCATTG AACTCAGTAA AGGACGAACT AACTACAAGA CTCTCTACAT TGCAAGAGAA CAAACCAAAG GAACTTGAAA TCAATCTCTT GGCTGACGAA ATCAGCCTGT TGGACGATGG ATTACCTGTG CTAAGAAACC AAAAGATGAG CACCGAACGT ACAATTAGAG ACAGAGAAGC CGAAGTGAAG TTCCAGACAG AACTCCAAAA GAGTTTTGAT GATTCTATTA ATAGCAAGAA AGCAAAGTTA GTAAAACTCG ATCAGAAGAC TGATGAGATT CGCAACAAAA CAGCTTCTTT GAAGAATGAG ATTTATTCCG AGTTCTGTTA TGATTATGGA TTTTCAAATG GTATCGATGA TTATGAAAAC TTGCATGGCG CAACATTAAG AGTAAGAGTC AAGGAAAGGG CTCAATATTC GAAGGCCATA GCCACTTTTA GTAACAAGTT AAAGTTCGAG AATGAAAGAG TGAACGAAAC TATACAAAGA GAAGAATCCC TTAAATCACA ATTATTGGAA CTAGAGGAAA ACACTCTGAC TGTAATGTCT GAAATTCAGC TTGTTGAAAG TAAGATAGAT AATCTCGAGG CAGAACTTGA GGTATTGGAA CACGAGCAAC TGAATCAAAA CAAGGAGCTT CAATCAAACT TGAAGAAATC TAAAACCTTT GAAGCGTCCG TCGCGGAGTT GGAATCCAAT ATATCTACAT TGAACAAAGG GATACTTTCT TTAGAGGAGC AGTTGTTGAA GATTGATACT GAGAGAGTTA ACATCTTGAA GAATTGTAAG ATAGAAAATG TCAATATTCC GTTGAAAGAT GGGCTCTTGG ATTCTATTTC GATTGGTGAG ACTTCGGATA ACTTGGTGAA GGAGATCTAC GATATCGAGA TAGATTATTC CAATTTGGAT GAATCATTAC GAAGAACATA TAGTGCCAAA CTAGAAGCCG AACTTCAAAC TAAGTTGGAA GAGATTATCG AGCAATTGGA GAGATTGACA CCAAATGCAA AAGCAGTGGA TAGATTGAAG GAAGCTGAAG CGAAACTTAG AAATTTTGAT AAAGAGCATA CACTTGCAAG ACAAAAGGAA CGTAAAGTGT ATGACAAATT CCAGGAGGTT CGTGAAAAGA GATACCAGAC TTTTATGGAA GCATTCAATC ATATTTCTTC CAAAATAGAT TCGATCTACA AAGAGCTCAC TAAGTTCCCT GCTTCTCCTT TGGGTGGTGC TGCCTATTTG ACATTAGAGG ATGATGAATA TCCATACAAT TCTGGTATCA AATACCACGC TATGCCACCT ATGAAAAGAT TCAGAGACAT GGAATTACTT TCAGGTGGTG AAAAGACGAT GGCTGCACTT GCATTACTTT TTGCTATTCA TTCATATCAA CCATCTCCCT TTTTTGTACT TGATGAAGTT GATGCTGCCC TTGATAATGC TAATGTTAGC AAAATTGCTA ACTATATTAG GAAATATGCT GGACCTAACT ACCAGTTCAT TGTTATATCT TTGAAGAACT CGTTGTTTGA AAAGTCAGAC GCTTTAGTGG GTATATATAG AGACCAGAGA CAGAATAGCT CATCGACACT TACATTGGAC TTGACCGAGT ACTCTGAAGA AGGTTTATCA GTATCCGGCC AAGCAGTTAC TGCTTCTGGC TAG
|
Protein sequence | MGRLIGLELH NFKSYRGTTK IGFGSSFFTS IIGPNGAGKS NLMDAISFVL GVRSSHLRSQ NLKDLIYRGR RTNGNSDLSV DELEQDPNRA HVTAIYEKDD GEIVKFKRTI SSSGNSEYRV NDVSVTSLNY SLVLKAENIL IKARNFLVFQ GDVEQIASQS PTDLTKLIEN ISGSNEFTKE YESLKEEYER AREFSNSVFS RKRNLNSESR QYKEQLIEQR QFEERLMEKN ETIKRINLYH IYHNERKHFQ IQEEIDAKTA ELKELKKGLS SEQKQFKTIS ADYSKKVLES KKHTKKLEQV ATQIESAKRD LIPMQANKRA MTSKANSTKT KIEDLQADLK RQKASASSVQ KQLDQAQKLF ADFENKIASS TSLSISPEGQ KEYSKLRSQF LSRGGSAFEE DISILLNEKD SLLAAITGLE NQRANSATRI NELQSTINSE LKSSLADINT EINDVLSRKQ EKVDARSALI KSKEEFQYQE LQLKSQLRDV LIKLDEISSQ QRESNKQKKL RENVAMLKRL FPEGAIKGIV YELVRPSEQK FESAVQTVLG RNIDSVIVQT TSVAYKCIEI LKERRAGVVT FIPLDSIQSE PINLNYLRSI HESAQPGIDI LKYDDKSLEQ AINYIAGDAL VVKDINLARN LKWDSHHKLE NRIISLNGSV IHKSGLMTGG QQSQKSSASL TWDREEWISL NSVKDELTTR LSTLQENKPK ELEINLLADE ISSLDDGLPV LRNQKMSTER TIRDREAEVK FQTELQKSFD DSINSKKAKL VKLDQKTDEI RNKTASLKNE IYSEFCYDYG FSNGIDDYEN LHGATLRVRV KERAQYSKAI ATFSNKLKFE NERVNETIQR EESLKSQLLE LEENTSTVMS EIQLVESKID NLEAELEVLE HEQSNQNKEL QSNLKKSKTF EASVAELESN ISTLNKGILS LEEQLLKIDT ERVNILKNCK IENVNIPLKD GLLDSISIGE TSDNLVKEIY DIEIDYSNLD ESLRRTYSAK LEAELQTKLE EIIEQLERLT PNAKAVDRLK EAEAKLRNFD KEHTLARQKE RKVYDKFQEV REKRYQTFME AFNHISSKID SIYKELTKFP ASPLGGAAYL TLEDDEYPYN SGIKYHAMPP MKRFRDMELL SGGEKTMAAL ALLFAIHSYQ PSPFFVLDEV DAALDNANVS KIANYIRKYA GPNYQFIVIS LKNSLFEKSD ALVGIYRDQR QNSSSTLTLD LTEYSEEGLS VSGQAVTASG
|
| |