Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_76901 |
Symbol | MET2 |
ID | 4837851 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 301516 |
End bp | 303493 |
Gene Length | 1978 bp |
Protein Length | 477 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640389166 |
Product | homoserine O- acetyltransferase |
Protein accession | XP_001383339 |
Protein GI | 126133629 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2021] Homoserine acetyltransferase |
TIGRFAM ID | [TIGR01392] homoserine O-acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AGACTGCAGT ATCATCTGAG CAAACCACAT AAGCGATTTT CACTCGTATT GCATCACTTT CTGCTACTTT CCGGATCCAA CTAGCGAATT AGGCAGTGAC ACCAAGGAAC GACTTACCGT ATCGGGCCCA CCCCCAATTT TTCAGTTATT AATTCTAGGA TCCACAGTTT CCTAAAAGGC GAAACCCAAA AGGCGATACT AATTTACTAT TGCAGCCAGG GATTATTTCG AACGATCAAA TCGTGCACAA TACAACCTAC TGCCAGAAAA AAGTTTGACT GATATTTTCA TTATATCCAT GTTCTCTACT AGCAAGCATT TGGTACTCTA CTGATACTCC ACGAGGTACT TCTATTACAA AAGAACAATA TAAATCGGCT CAAGAGTCGC ATCATTGATC CACTCACCCG CATTCCATCA TCCGACCCGA ACCGCAAAGT AGAATGACTA CCTTTAACAA TCAGGTGTAT GAAGACATCA CTGCTGAGCA GAAAAGAACG AATTCGTACG CCAGTCTCGT TCCTGGCCAG ACCATTGTAG AAATCCCCTC CTACACGCTT GAGTGTGGAG AGGTCTTGTC ACGTTTCCCT GTAGCCTACA AGACGTGGGG TAGATTAAAC GAAGAGAAAG ACAACGTCAT TTTGATCTGC CATGCCTTGA CTGGATCCTC TGATGTCCAG GACTGGTGGG GACCCTTGCT TGGTACTGGC AAGACGTTTG ATCCCTCACG CTACTTCATC ATCTGCATCA ATTTCCTCGG TTCACCCTAC GGCTCGGCTT CTCCTGTCAG TATTGACAAG TCTACAGGTA AGCCGTATGG ACCCTCTTTC CCATTGGTCA CAGTAAAAGA CGACATCGGT ATCCAGAAGT TGATCTTGGA CTCGTTGAGT GTCAGATCGA TCGCTTGTGT CATTGGTGGA TCTATGGGAG GCATGCTTGC CTTGGAGTAC TCTGCTACAT ACAACAAGAC AAACTACGTT AGGTCTATAA TAGCGCTTGC CACCTCTGCT CGAGCCTCAG CCTGGTGTAT CTCATGGAAC GAAACCCAGA GACAGTGTAT CTTTAGCGAT CCGTTCTACG ATGACGGCTA TTACTACGAG AACAATGGCA TCAAGCCTGA CTCGGGGCTT AGCGCTGCGA GAATGGCTGC GCTCTTAACG TATCGTTCAA GAAACTCTTT TGAAACCAGA TTTGGCCGTA AGTTGCCCAA CAAGATCGGC TCAGTCGTAA CTCCTGTAGA TGACGAGGAA AAAAGAAAGC GAGATGAAGA AGAACAGGGC ATCAGATACC CTAAGACAAA AGACGAAGAG AATTGGTTGT TACACAACGA AGGCTCCAGA TCGTCCAGAT CGTCATTGAA CAGAACTTCA TCGCAAAGTA GCATCAACAT GACCAGCAAG CCCCAAACTT ACTTCACGGC TCAGTCGTAC TTGAGATACC AAGGTAACAA GTTTATTACT CGTTTTGACG CCAACTGTTA CATATCTATA ACCAGAAAGC TTGACACCCA CGACATCACC CGAAGCCGAA TCTCGGTACA AGACACGGTG GAAGACCCGT TGCCTAGTTT CTTGCAGAGC TTACAACAGC CACATTTGAT TATAGGTATC CAATCAGATG GGTTGTTTAC CTACGGTGAA CAGCAGCTCT TGGGTGAGAA TATCCCTGAC TCGAGCTTGA AGAAGCTCGA CTCTCCAGAA GGGCATGATG CCTTCTTGTT GGAGTTCGAA TTGATCAACA ACTACTGCTT GAAGTTCTTA CAAGACAAGT TGCCTGAATT CTACGACGTG ACATCTGGAA AGTACCAGCC GTTTGAGAAC TGGACCGAGT TCGTGGACAG CACTGACAAT GGAGGTAACT CTGTGTTTGG TGAGGCTGAG AAGAACATTA CCAATTGGTA GACTAATGGC ATGAAAAATG TATATCTGTT AATAAGAAGT AAATATCTAA TACATGGAAA TAAAAAGTAT ATATTTAA
|
Protein sequence | MTTFNNQVYE DITAEQKRTN SYASLVPGQT IVEIPSYTLE CGEVLSRFPV AYKTWGRLNE EKDNVILICH ALTGSSDVQD WWGPLLGTGK TFDPSRYFII CINFLGSPYG SASPVSIDKS TGKPYGPSFP LVTVKDDIGI QKLILDSLSV RSIACVIGGS MGGMLALEYS ATYNKTNYVR SIIALATSAR ASAWCISWNE TQRQCIFSDP FYDDGYYYEN NGIKPDSGLS AARMAALLTY RSRNSFETRF GRKLPNKIGS EKRKRDEEEQ GIRYPKTKDE ENWLLHNEGS RSSRSSLNRT SSQSSINMTS KPQTYFTAQS YLRYQGNKFI TRFDANCYIS ITRKLDTHDI TRSRISVQDT VEDPLPSFLQ SLQQPHLIIG IQSDGLFTYG EQQLLGENIP DSSLKKLDSP EGHDAFLLEF ELINNYCLKF LQDKLPEFYD VTSGKYQPFE NWTEFVDSTD NGGNSVFGEA EKNITNW
|
| |