Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_47485 |
Symbol | MSI1 |
ID | 4839664 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | - |
Start bp | 1255271 |
End bp | 1256560 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640390979 |
Product | chromatin assembly complex, subunit 3 |
Protein accession | XP_001385247 |
Protein GI | 150865861 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.884795 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.485977 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCCAG AAGATAGCGA ACAGACTTAT ATTGACGATT TTACCCAGAG AAATTACAGA ATATGGAAGA AGAACACACC TTTTCTCTAC GACTACCTTC TGACAAATTC ACTCTTGTGG CCGTCGCTAA CAGTTCAGTT CTTTCCTGAT CGGACTGATG GACAAATAGA AAGCGGAACT TCTAAAACTT CATCTGAAGA TTCTGATAAT ATATATTTCC AAAGACTACT TCATGGTACA TTTAGTTTGG GCCTGTCAGT GGATAGTATC CAGATTCTCC AGGTTCCTGT TTTTGCTGAC TTGAATCGCA ATCTCCGTAT TGACCGACTT GATTTCAATC TGGAAAAGCA GGAATTCGAA TTAGCTACTT CTGTCAACAA TAAATTCAAG GTGCTTCAAA AGATAAACCA TATGGGAGAT GTTAACAAGG TGAGGTATAT GCCCCAGAAA CCAAACATCA TCGCCAGCGC CAACAATATG GGCGACTTGG CGATATACGA AAGAACAAAA CACAAAAGCT TCAAGAACTC GCTCATAGAC GATACCGACC TAAATAAGGT CCAGGTATAT CTCAAGAACA GTAACTCCGC AGACGTAGAA GGTACCGATA TCTTTGCTAT CGATTGGAAC AAACAAAAGG AAGGTACTAT TGTATCAGCC AGTATGAACG GCGAGATAAA TCTATATGAC ATTCGAAGCA ATTTTGTAAA GGATAAGTCT GTTGTTAATG AATCCTGGTA CTACCACAAT GAGAGCAGTA CAGGTGTCAA CGATATCGAA TGGCTCCCTC AACATGACTC CCTATTTAGT GCTGTAGATG ATGCCGGTTT CATTTCTTTG TTTGACACGA GAGAAGAAAG CAAACTAGTT CACCGTTACA GACTGTCTGA AGTTGGAGTT AACAGTATCA GTGTCAACCC TGGAATTTCT CATTGCATAG CTACTGGTGA TAGCAACGGT CTGATCCACG TCTACGATAT AAGAGGTATT GGAAGCGAAA TGAACCCTAT CTACTCGATT CAAGAACAAA CTGAATCTAT CACACAGCTT AAATGGCATC CACGGTACCA TAATGTGTTG GGTTCGTCTT CCACAGATCA TCTGGTAAAA TTGTTTGATT TGGAAAACTC TAGTTCTCTT TTGTTTGCAC ATGCTGGCCA TATGTTAGGA GTAAACGACT TTGACTGGTC TCACCATGAT GACTGGATGG TAGCCAGTGT TTCTGATGAT AACTCCTTGC ATGTATGGAA ACCATCGCAC ACGATCACAA GAAAGTATAA CAGTAGATAA
|
Protein sequence | MSPEDSEQTY IDDFTQRNYR IWKKNTPFLY DYLSTNSLLW PSLTVQFFPD RTDGQIESGT SKTSSEDSDN IYFQRLLHGT FSLGSSVDSI QILQVPVFAD LNRNLRIDRL DFNSEKQEFE LATSVNNKFK VLQKINHMGD VNKVRYMPQK PNIIASANNM GDLAIYERTK HKSFKNSLID DTDLNKVQVY LKNSNSADVE GTDIFAIDWN KQKEGTIVSA SMNGEINLYD IRSNFVKDKS VVNESWYYHN ESSTGVNDIE WLPQHDSLFS AVDDAGFISL FDTREESKLV HRYRSSEVGV NSISVNPGIS HCIATGDSNG SIHVYDIRGI GSEMNPIYSI QEQTESITQL KWHPRYHNVL GSSSTDHSVK LFDLENSSSL LFAHAGHMLG VNDFDWSHHD DWMVASVSDD NSLHVWKPSH TITRKYNSR
|
| |