Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_30460 |
Symbol | SMC5 |
ID | 4837901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 118225 |
End bp | 121506 |
Gene Length | 3282 bp |
Protein Length | 1093 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640389216 |
Product | structural maintenance of chromosomes protein |
Protein accession | XP_001383656 |
Protein GI | 150864715 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning |
COG ID | [COG1196] Chromosome segregation ATPases |
TIGRFAM ID | [TIGR03185] DNA sulfur modification protein DndD |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.157057 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGACA TATACCAAGC CATAGGCGAT TTGAGACAAT ATGTTGGTCC CAATGATTCT GAATCTGTGG CTAAGCGTAG GAAAGTACAG AGTTCTCGTG ATTTCCGTCC TGGAAGTCTT ATGAAGTTGA AGTTGACCAA TTTCAACAAC TACGGTAGTG GAGAGTTCAA TTTGTCTCCT TCACTCAACA TGGTTATTGG TCCTAATGGA TCTGGTAAAA GTACTGTAGT TTCTGCGATC TGTTTGGGGT TGGGTGGAAA GATTGATCTT ATTAAAAGAC AGACTCTTTC TTCCATGATT AAGAAAGGAA AGTCCACAGC TTCAACTGAA GTTACAATCA AGAATTTTGA TGGTCAACCT CCTATTTTAG TTAAAAGAGA GTTCACGGCA AAAGAAAACC GTTGGTATAT AAACCACAGG CCGGCCACAG AAGCCAAAGT TAAAGAGTTA AGAGCTAGGT TTAACATCCA GTTGGACAAT TTATGTCACT TTTTGCCGCA AGAGAGGGTA GCTGAATTTG CTGGGATGTC ACAAGAGAAG CTACTTATGG AAACAGAAAG AACTTTAGGT GATGGCCAGT TGTACCGTCT TCATGAGGAT TTGATCAAGA ACGATACTCT GAGACAAGAT GTGACTACAA GAATAGAAGA ACTTGAAGAA AAGTTACTGA AATTCAACGA AGAAAGAAGT AGACTAGAAG CAGATATAAA GAAGCTTGAA GAGTATGAAG GAAAGACTCT AGAGATAGAG CAACACACAA AGATAATTCC GTACGCCCAA CTATCTGATC TCAAAAAACA GAGAGCAGAT TTAAAACGAG AGAGAGACAA AGCAAAATCT AAATTGTCGA AGTTTTTATC TTCCATGGAC CCTTTGAAAG ACCAGCATAA AGAAATAGAA ACCAAAGTTG AAATGGAAAA AGGACTATAT TCCGATATAG ATGACAAGCA AAAGGAAATA CGGTCTAGGT TTATCAATAG AAAGGCAGAT TTATCGAAAA TTAAGGAGGA AATCGGAGGC TTGAAGTCAA CGGTTGAGTC TTTGAAGAGT AAATCGATCA AATTGCAGAA TCAATTAAAG AAACTTGAAG AAAAAAGGCA CGAATTGATT TCACAACGTG ACTTGATAGT ATTACCTGAC AAAGATGAGG TTGAAGGTTA CAGAAAATTG CGAAGGGAAG TGTCTGAAAA GAAGGATGAA ATAGGAAGCA AGATTGAAGA CTTGGAGGAC AAAATTCAAG AAAAGCAATC GTCACGAAAA GAAATCATGA ACAATAAGAA GCGAGTAGAG CAGAGTTTAA ACAGTAAAGA TAGACTAATG GTCCTTTCTC CAAGAGGGGG GCCGCCAAAC TCTTTGAGGG ATGGGGCATA TAATGCACAC AAGTTTCTCA GAGATGAAGC TCAGTTAAAG GACCATTATT TTGAGTCGCC TGTGGTTTGT TGTACTGTAA CAAACAAAAC AATGGCCCCA CACTTGGAAA AGGTGATTGA CAACAATACG CTCTTTTCAA TTACCACCAC TAACAAACAA GATTTCAGCA TGATCTCTTC TTTTCAAAGG AAGATGAAGA TAAATTTCCC AATTAGATTA ACAACCAACC TGGGTACACG AAATCCACGT ATTCCCAAAG AAAGATTAAA GCAATGGGGG TTCGAATGCT ACTTGTCAGA TTTTCTTTCG GGTCCAGGGC CAGTGGTAGA TATGATATAT GATATATCAA AGATTCAGGA TATTCCCGTG AGTAGAAGTG GATTATCAGA AGAGCAGATC GAACGATTGA CAATGCTAGA TGGAAACGGG AGATACCCTT TCAAGAAGTT TATTTCTCAT GATACTCTAT TTGTGTTGAC TAAATCGAAC TATGGTCTGA ATCAAGTTTC TTATACTACG GAAAAGGTTA CTGGATCTAG ATGGTTCGAT TCATCAGGAT TGACTCAAGA AGCTAAAGAT TTCATGAATG GTCAGTTACA GGAATTTAAG GACAGGTACA ACGTTTTAAA AGGCGAAGAA GATGGATATC TTGTTGAGAA GCAAAGTCTC GATTCAGAAA GTCGTAAACT TTCGGCTGAG CTTGAGAAAT ATAAAAATAA AATTCAGCAT TTCACCAATG AAACGAAAAA CAGGGCAAAA ATAGAAGGAA AGTTAACGGC TCTTGACGCG CAAATCAAAA AGACAACAAA GGAATCTACA GAAGACACAA GTGAACAAGT TGACGAAACC GAAGAGAAGA TCAAATCAAA GTATTTGGAT TATTCAAACA AGTTATCGGA GCTTAGTATC ATTGGTAAGG AATCTAGCGA TGTTGCTATT GAGCTAAGTT TGCAATCATT TAGAGTGCTT CAAATCAGAA ACAGAGAGAT CGCGGCCCGG AATCTAATTG CTAAAGTAGA AGAACAACAA GTTTCATTAA GAAAGGAGTA TGAAAGATTG AAAGCAGAAT ATGACCAAAT TAAGAAGGGG GATGCTGTTA AAAAGATCGA AGAACAGAGT GCTTCGTACA CTCCGGAAGA AAGAGTACTA TTGTCTAGGC TTGCCAAGGC ATATATGGAT GCTGGTAACT TCTCAGAACA GGTGATAAGA GACAAAATAC TGCTTTTAGA AGACGAGCGG TCTGTGATGG CTACTGCTGA TGTGAGTTCC ATCGAAAGAT TACGGAGGAC CTTGACTGAA ATTGATTCAC TCGAGAAGAC TCTTCCAAGG TTGAAGGACG ACAAGTCTAA ATTGGACAAA AGAATTAGTG ATATTCAAGA AGCATGGGAA CCAGAGCTAA CTAAGGCAAT TCGGAATATT TCATTAGCAT TTAACAAGCG ATTCTCCAGA GTCGCAAGCG ATGGACAAGT AGAATTAGCC AAAGCAGAAA GATTCAAAGA TTGGAAATTG CAAATTCTAG TTAAATTTAG ACAAGAGTCA GAGCTAAAGG TTTTGGACCA CCAATCACAA TCTGGAGGGG AAAGAGCAGT AACAACTATC TTTTTCATGA TGTCTTTGCT GGGATTAACA AATTCTCCAT TTAGAGTTGT GGATGAAATC AATCAAGGGA TGGATCGAAA GAATGAAAAG ATGGCTCACA GGTACTTGGT GGACACGGCC TGCCATAGTT TGAGTTCACA GTATTTCTTG GTCACTCCCA AGCTTTTGAC AGGTCTCTAT TATCATCCTG AAATGGCAGT TCATTGCATT TATTCTGGTC CTTTGGTCGA CGGAACAGAT AGAGGAAATA AGGAACCTGA TTTCATGGAC TTCAAAGCAA ATTCATTAGC GCAGAAATCT ATATATACAT GA
|
Protein sequence | MTDIYQAIGD LRQYVGPNDS ESVAKRRKVQ SSRDFRPGSL MKLKLTNFNN YGSGEFNLSP SLNMVIGPNG SGKSTVVSAI CLGLGGKIDL IKRQTLSSMI KKGKSTASTE VTIKNFDGQP PILVKREFTA KENRWYINHR PATEAKVKEL RARFNIQLDN LCHFLPQERV AEFAGMSQEK LLMETERTLG DGQLYRLHED LIKNDTSRQD VTTRIEELEE KLSKFNEERS RLEADIKKLE EYEGKTLEIE QHTKIIPYAQ LSDLKKQRAD LKRERDKAKS KLSKFLSSMD PLKDQHKEIE TKVEMEKGLY SDIDDKQKEI RSRFINRKAD LSKIKEEIGG LKSTVESLKS KSIKLQNQLK KLEEKRHELI SQRDLIVLPD KDEVEGYRKL RREVSEKKDE IGSKIEDLED KIQEKQSSRK EIMNNKKRVE QSLNSKDRLM VLSPRGGPPN SLRDGAYNAH KFLRDEAQLK DHYFESPVVC CTVTNKTMAP HLEKVIDNNT LFSITTTNKQ DFSMISSFQR KMKINFPIRL TTNSGTRNPR IPKERLKQWG FECYLSDFLS GPGPVVDMIY DISKIQDIPV SRSGLSEEQI ERLTMLDGNG RYPFKKFISH DTLFVLTKSN YGSNQVSYTT EKVTGSRWFD SSGLTQEAKD FMNGQLQEFK DRYNVLKGEE DGYLVEKQSL DSESRKLSAE LEKYKNKIQH FTNETKNRAK IEGKLTALDA QIKKTTKEST EDTSEQVDET EEKIKSKYLD YSNKLSELSI IGKESSDVAI ELSLQSFRVL QIRNREIAAR NLIAKVEEQQ VSLRKEYERL KAEYDQIKKG DAVKKIEEQS ASYTPEERVL LSRLAKAYMD AGNFSEQVIR DKISLLEDER SVMATADVSS IERLRRTLTE IDSLEKTLPR LKDDKSKLDK RISDIQEAWE PELTKAIRNI SLAFNKRFSR VASDGQVELA KAERFKDWKL QILVKFRQES ELKVLDHQSQ SGGERAVTTI FFMMSLSGLT NSPFRVVDEI NQGMDRKNEK MAHRYLVDTA CHSLSSQYFL VTPKLLTGLY YHPEMAVHCI YSGPLVDGTD RGNKEPDFMD FKANSLAQKS IYT
|
| |