Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_53620 |
Symbol | MLS1.1 |
ID | 4851826 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 2927124 |
End bp | 2928779 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | |
GC content | 48% |
IMG OID | 640393534 |
Product | Malate synthase, glyoxysomal (MASY) |
Protein accession | XP_001386893 |
Protein GI | 126275718 |
COG category | [C] Energy production and conversion |
COG ID | [COG2225] Malate synthase |
TIGRFAM ID | [TIGR01344] malate synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTCTC CATTCCCACA AACCGTCGAC AAAGTCAAGG GCCTCAAGAT TTTGGGTCCC CTCCCTGAAG GAGCCAAACA CATCTTCAAT GTTGAAGCTT TGGCTTTCGT TGCTACTTTG CACCGTTCGT TCAATGCCCG TAGAAAGGAA TTGTTAGCCA ACAGAAAGGA AGCCCAGAGG TTGAGAGACT CCGGTAAGTT GCCTGACTTT TTGCCAGAAA CTGCCTACAT CAGAGATGAT GCCACCTGGA CCGGACCTCC ATTGGCCAAG GGCTTGCAGG ACAGAAGAGT TGAAATCACC GGTCCTACTG ACAGAAAGAT GGTGATCAAT GCCTTGAACT CCAACGTTGC TACTTACATG GCAGATTTCG AAGACTCGTT GACTCCAGCT TGGAACAACT TGGTCGAAGG TCAGGTTAAC TTGTACGACG GTGTCAGAAG AAACTTGACT TTTGAAACCA ACGGCAAGAA GTACGCCTTG AACCTTGACC CCAAGAGACA CATCCCAACC TTGATTGTCA GACCCAGAGG CTGGCACTTG GAAGAAAAGC ATGTCACTGT GGACGGCGAG CCAGTTTCGG GTGGTATTTT TGACTTTGCC ATCTACTTCT ACAACAACGC TGTTGAATCC TTGAAGCAGG GCTTTGGACC ATACTTCTAC TTGCCAAAGA TGGAACACCA CTTGGAAGCC AAGTTGTGGA ATGACATCTT CAACTACTCC CAGGACTACA TTGGTATTCC AAGAGGTACC ATCAGAGCCT CCGTTTTGAT TGAGACTTTG CCAGCTGCCT TCCAGATGGA CGAAATCATC TACCAATTAA GACAGCACAT TGCCGGCTTG AACTGTGGTA GATGGGACTA CATCTTCTCA TACATCAAGT CATTGAGAAA CCACCCTGAA TTCATCTTGC CAGACAGATC CCAAGTGACT ATGGCTGCTC CATTTATGTC ATCTTACGTA AAGTTGTTGG TGCACACATG TCATAAGAGA CAAGTACATG CCTTAGGTGG TATGGCTGCG CAGATTCCAA TCAAGGACGA TCCAGAAAGA AACAGATTGG CTTTGGAAAA CGTTGCCAGA GACAAGTTGA GAGAAGTGAC CACTGGTTGC GACTCCTGTT GGGTTGCTCA CCCAGCCTTG GTTCCTGTCG TGTTGAAGGT CTTCAACGAG CACATGAAGG GTCCTAACCA GATCAATGTT CCACCAAAGA CTCCATACAA GCCTGTTACT GCAAGAGACT TGTTGTCGCC ATTTGTTCCA GGTGCCAAGA TCACCGAACA GGGTATCAGA GCCAACATCA TCATTGGTCT CTCGTACATT GAAGCCTGGT TAAGAAACGT CGGCTGTGTT CCTATCAACT ACTTGATGGA AGATGCTGCC ACTGCAGAAG TGTCTAGAAC TCAAATTTGG CAATGGGTTA CCCACGGTGC TACTACAGAC ACCGGAGTCA CCGTCACCAA GCCTTATGTG CAAAAGTTGT TGAAGGAAGA ATACACCAAG TTGGCCCAGG CTGCCAAGCC AGGCAACAAG TTCAAGCCAG CTTTGGCCTA CTTTGCCCCA GAAGCTTCTG CTGACAGGTA CTCTGATTTC CTTACGACTT TGATCTACGA CGACGTCACC ACCATCGGCA GAGCCTTGCC AGGTGAAAAA TTGTAG
|
Protein sequence | MSSPFPQTVD KVKGLKILGP LPEGAKHIFN VEALAFVATL HRSFNARRKE LLANRKEAQR LRDSGKLPDF LPETAYIRDD ATWTGPPLAK GLQDRRVEIT GPTDRKMVIN ALNSNVATYM ADFEDSLTPA WNNLVEGQVN LYDGVRRNLT FETNGKKYAL NLDPKRHIPT LIVRPRGWHL EEKHVTVDGE PVSGGIFDFA IYFYNNAVES LKQGFGPYFY LPKMEHHLEA KLWNDIFNYS QDYIGIPRGT IRASVLIETL PAAFQMDEII YQLRQHIAGL NCGRWDYIFS YIKSLRNHPE FILPDRSQVT MAAPFMSSYV KLLVHTCHKR QVHALGGMAA QIPIKDDPER NRLALENVAR DKLREVTTGC DSCWVAHPAL VPVVLKVFNE HMKGPNQINV PPKTPYKPVT ARDLLSPFVP GAKITEQGIR ANIIIGLSYI EAWLRNVGCV PINYLMEDAA TAEVSRTQIW QWVTHGATTD TGVTVTKPYV QKLLKEEYTK LAQAAKPGNK FKPALAYFAP EASADRYSDF LTTLIYDDVT TIGRALPGEK L
|
| |