Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_28367 |
Symbol | MLP2 |
ID | 4851144 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 1038085 |
End bp | 1039371 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | |
GC content | 40% |
IMG OID | 640392852 |
Product | myosin-like protein |
Protein accession | XP_001387843 |
Protein GI | 126274130 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.525338 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0889168 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAGTC TCATAGACGA CTTGGTGAAT ATCTGTTGGA GTCGAATCTC ACGCAGCTCT CTGTCTGATT CAATATTTGT CTCTCAGGTA TTAGGACTTC TTTCCGAGAT AGAATCTACG CTTGGGGTCA ATTCTCTTTT GAAAAATGAA GAGCTCAAAT TATTGAAGCA GATGATTCAG GCGACGCCTC TGATGCGCTT ACACAAGAAA GAGTTCCAAG AGTTTATCAT GCGGTTGGTG AAATATCCCA ACTTTGAAGT CTTTCTCTAC GAGCGATGTA GAATCTCGAT GGACGATTTG CGTAGAATCA TGAATGTTCC GTTTAAAGGC CCAAATCCAC CAACTCTTTC TCCGCTTGCT CCACGAGAAA TCAAAAATAC AGCTAATGAG GCAAGAATAT CACATTCTCG TTACTTTGAT CACAAAGAAA ATGTATCTCC CAACCACACC AACAAACAAT TGAAATCTCC TCCTGAATCA CCCAGCCTTG ACTACAGATA CACTAAATTG CAATCAGAAC TTAATTTCAA AGATGAACAG CTCAGAACGA AAGAAAGCGA GTACACAAGG GCGAACCTAG AGTATAGAAA GCTAGTGGAT ACCAACTCAA CACAACTTAA AAGGATACGA GATCTCGAAA GTGAAGTCAG TTCCATCAAC AAATATGTCC AGTCTTTGGA AGAACAGCTT TCAAGACAGC TGGGAGATAG AAACTCGAAT TCTTTGGCTC TGAAAATTAA AGATAGGGAC AGAACTATAC GTAGTCTTGA GCAATTAAGC AACGAATACA GAAACGAACT CAAGAACTTA GAAGAAGATA AATTGAAATC CGAGAATTCA CTTGCGGAAT TGGTCACTAG TCTACGGGAG CAGGATAACT TGATCAAAAA TCTACAATTG AAGCTTTCGC TAACCGGAGA ATCATTGAAA ATACAGTCTC AGAAAGCAGA CCCGGTTCGA GTTAACTCAC AACTTCAAGA CTTTCTACTA AACTTACCAT TCCTCAAACA GTATTACTAC TTCTACAAGT ATAAAAACAA CACACGCAGA TTGTTTATTG TGAACATGTT TGCGATGATA CTAGCGACCA TCATAGTGTT GCATGTTGCG GAATGTGTAC TATATTTCTC CATCTGGTTT TTCACTTCGA AACCAAACTC TTCCATGTAC TTATATAACA ACTTTGACAA CGAGTGGTAC AGCACCGAAT CCACCTTTGT TTGGTGGAAA GAAATAGAAA CCTTAGAATA CTTCGTGTCC ACGATCAGTG AATGGTTCAC TACATAG
|
Protein sequence | MSSLIDDLVN ICWSRISRSS LSDSIFVSQV LGLLSEIEST LGVNSLLKNE ELKLLKQMIQ ATPLMRLHKK EFQEFIMRLV KYPNFEVFLY ERCRISMDDL RRIMNVPFKG PNPPTLSPLA PREIKNTANE ARISHSRYFD HKENVSPNHT NKQLKSPPES PSLDYRYTKL QSELNFKDEQ LRTKESEYTR ANLEYRKLVD TNSTQLKRIR DLESEVSSIN KYVQSLEEQL SRQLGDRNSN SLALKIKDRD RTIRSLEQLS NEYRNELKNL EEDKLKSENS LAELVTSLRE QDNLIKNLQL KLSLTGESLK IQSQKADPVR VNSQLQDFLL NLPFLKQYYY FYKYKNNTRR LFIVNMFAMI LATIIVLHVA ECVLYFSIWF FTSKPNSSMY LYNNFDNEWY STESTFVWWK EIETLEYFVS TISEWFTT
|
| |