Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_67950 |
Symbol | MCM1 |
ID | 4839352 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | - |
Start bp | 1545473 |
End bp | 1547016 |
Gene Length | 1544 bp |
Protein Length | 243 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640390667 |
Product | transcription factor of the MADS box family |
Protein accession | XP_001385295 |
Protein GI | 126137543 |
COG category | [K] Transcription |
COG ID | [COG5068] Regulator of arginine metabolism and related MADS box-containing transcription factors |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0903461 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGCTTATATA ACACAGCGAC TCAATTACAA GTAACTAGTA AGTATGCCGT GGCTGATGGC GAGCTCGTGG CCACACGGAT GTGGCCGGCA GCATTAACAG CATAGAACCG TCGAATTGAT ACAATCGGTG TGACCGGTAG CACTGGGTTT ACGGGAGGTT TGTCCATATT ATTATCACGA AGCCCTGGAA TGTTGCATTC TCAGCGAATT GTCCCTTTGC ACTGGAGCGT AGCGATGGAT CACATCGTAT TATCGGATGC AAGGTTGAGA CGAAGCTGAT GATGGCGTAG CTCACGTTGA TATTGGAATG GGATTGATAG CGACATTGAT GCTGTAGTAA TTGCTATTTC AGGGTCCGAT AACGACGATA AACAAATACA GATCAATTTC TGGAGTCCAT ACCAATATTT CTATAGCAAA ACATAGTGAC TATCATCATG TCGTTACATC ATGCTTCGTA ACGTTGCTAT CATTTTTCAT TCGTGTCATC TATCATTTCT GTTATCATTC TAATGCTGTT GTTGCAATCA TATACAAGTA AATACTAACA TTATTACCAG AATGTCCGAA GTAACCGAAG CCAAGTTGGA GAACTCCGAC CAACAATTCG AAAACATCGG TGCTAACGGC GCTGCCAACA AGGGCGGTGC CGATGTTGCT GGCGACGATG ACGACGACGA CGAAGCCGGC GGTGGTGGAA AGGCCCAGAA GGAAAGAAGA AAGATCGAAA TCAAATTCAT CCAGGACAAG TCTAGAAGAC ACATCACCTT TTCCAAGAGA AAGGCCGGAA TCATGAAGAA AGCGTACGAG TTGTCGGTGT TGACAGGAAC CCAAGTATTA TTGTTGGTTG TGTCTGAAAC CGGTTTGGTT TACACCTTTA CCACCCCTAA GTTGCAGCCA TTGGTGACCA AGTCTGAAGG TAAAAACTTG ATTCAAGCAT GCTTGAATGC TCCTGAAGAA GGTTTGGGCG ACGACCAAGG TGACCAGTCA GATGGAAACT CCGGCGAGTC GCCAGATCAA AGCCCGGCTC CTCCTCAGCA ACAACCCCCA CAACACCAGC AAGTACAACA GCAACAGGCC ATTGCTCACC AGCAACAGGT GCGCCATCAG CAACAGCCAC AACAGCAATT GCCTCCTGGT GCACACATAC CCCACGGAAT TCCATATCCT AACGCCGGTC ATCCACAACA GCCAGGCATT CCTATCCCAC CCAATGCCTA CGGAGACAGT TACCAACAGT ACTTCTCCAA CATGCAAAAC AGCAACATGC CCAACCAGCA ACAATATCAA TGATGACTAT TTATTAGACA AACAGTGAAC GAACCCGGAT TCCATTGTAC AGTATTATTA TTATTCTCCT ACCTATTATT ATTTTTGCTT GTTGAGGTGT CAGTTTATGT TACGTTGATT GAGGTGAGTG TGCTGAGTTG TGTCGACGAG AAGAGTTGAT TGTGATTGTT ATTGTATTCT TCCAGCGTAT TTAACAGTAC GTAAATATTT AGTTCTTCTA TTCTTTTTAT AAAACAGTGC TTTCCATTCA TGAT
|
Protein sequence | MSEVTEAKLE NSDQQFENIG ANGAANKGGA DVAGDDDDDD EAGGGGKAQK ERRKIEIKFI QDKSRRHITF SKRKAGIMKK AYELSVLTGT QVLLLVVSET GLVYTFTTPK LQPLVTKSEG KNLIQACLNA PEEGLGDDQG DQSDGNSGES PDQSPAPPQQ QPPQHQQVQQ QQAIAHQQQV RHQQQPQQQL PPGAHIPHGI PYPNAGHPQQ PGIPIPPNAY GDSYQQYFSN MQNSNMPNQQ QYQ
|
| |