Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_65920 |
Symbol | |
ID | 4840286 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 681450 |
End bp | 683240 |
Gene Length | 1791 bp |
Protein Length | 576 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640391601 |
Product | predicted protein |
Protein accession | XP_001385482 |
Protein GI | 126137918 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0024] Methionine aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAAGA TGCAACTTGC TGTCCACCAA GAGGACGCTG ACATTTTGTT GAAGGAAAAG AACGTTCTCA ACGATCCTGT TTTGGACAAA TACAGAGTTT CTGGCCAGAT TGCTCAGACT GGGTTACAAT ATATTGCTTC TTTAATCAAT GATTCGTACC ATTTAGGCAA ATACCCTCAA CCCTTGACCG TCCAGGAGTT GTGTATTTTG GGAGACTCGT TTTTGACCAA GTTGTTGTCC CGTGTGTATA ATAATGTCAT CAGAGAGAAG GGAATTGCCC AGCCTACCTC CATCGAAGTC AATGAGTTGG TGGCTGGTTT TGCTCCAGAA GTTGATGATG AAGGTGCCTA CACTTTCGTA GCTGGAGATG TTGTTACCAT CTCGTTGGGT GTCCAGATCG ACGGTTATAC TGCCAATGTG GCGCATACCG TGGTCATCTA CCCTGCTGGC GTAGAAGTGA ATAACGAAAT CAAGCCCACT GGACCTTTGT TGGGAGGTAA GGCTGACGCC ATCTGTGCTA CCCATATTGC CACCGAGACC ATTGTTGCTT TGTTGGGATT GGCTTTATCA CCAGAAAAGA TTCCTGCCCA ACTCAAGATC AACGGTAGCG CCACTATCAC CGGAGGTCAC ATCCGTGCTC TTGTGGACTC GGTTGCTGAG TCGTTCAACT GTGTGGTTTT GCCAGGATCC AAGGTCAGAA GAGTAAGAAG ATTCTTGTCG GGACAAGCTG AGGGTATTGT TGCCGAACGT GATTTCAAGG GTGTCGTTTG GGACGAATCT CACCAGGAAC AGAAATTGTT ACAGAAGAGT ACCATAAGTA ATAGCACAGA TTTGATCATC CAAACAAACA ACAGCAACAC TAGCACATCA ACCAACACCT CCAGCGCCAT TCCAACAGAT GATTTTGTTG TTTTGGCTGG TGAAGTGTAC CAAATCGACA TGAGATTGGC TTCTTTACAG GAGTTCGAAG GCGAGGCTGG TTTGATCACC ACCGAGGAAA TCGACCATTT CACTGGCAAG AACCACAAGA ATGAATTCAA CTGTAAGAGC ACTATCCACG TTCGTGATTT CGCAGTGACT CACCAGTTGA AGTTAAAGAC TTCTCGTAGA TTGTTAGGTG AAGTCGACAA GAGATTCTCG GTTTACCCAT TCAAGTTATC ATACACCTGC AAGCATTTCC CAGTCAAGTT AGAAAATGAC AATGTCCAAG AACAATTGGC ACAGATTAAG TCCGAATTGA AGACCAACAA ATTGGGGTTG TCCGAGTTGT CCAATAGACA TTTAATAAAA TCAAAGCCAG TGCAAGTCAC AAAGTTCATA CCCTTGGACA AGATCTTGCT TTCAGCTAAC CCTACTGGTA AACACGCAAT TGACATGAGC AAGCCTGTTT TGCCAGGTAT GGAAATCCCC TTGCCAAACT TGGGAGTCTC GTCTTTGAAG TTGAAGGCTT TGTTGAAGCA CGCCAAGCCT ATTGCTAACG TCAGAGAATC TACTACTGTT GTTCTCAACA ACGTCAAGAA CGAAGTTGTT CGTTTGACCG GCGGCTCTAA GACTACTACG CCAAGCTGGG TCCACTCTCA ATACAAGTTG GGAGGTGCCT ACGTCCAGTC CATTGAACAA ATTGTGCAAT TGAGCAAAGA CAAGAGATTT GGTATCAAGG TCAAAGAATG CCAGCCATAC AACTTGAGCA AGACTGTTGG CCAGGCTGCC GAGACCATGG AGTTGGATTA GATAGCGAAG TAGACGTTGT AAAATAGCAT TTTTGAATAG ACAAGAGATA AAAGTTACGT C
|
Protein sequence | MSKMQLAVHQ EDADILLKEK NVLNDPVLDK YRVSGQIAQT GLQYIASLIN DSYHLGKYPQ PLTVQELCIL GDSFLTKLLS RVYNNVIREK GIAQPTSIEV NELVAGFAPE VDDEGAYTFV AGDVVTISLG VQIDGYTANV AHTVVIYPAG VEVNNEIKPT GPLLGGKADA ICATHIATET IVALLGLALS PEKIPAQLKI NGSATITGGH IRALVDSVAE SFNCVVLPGS KVRRVRRFLS GQAEGIVAER DFKGVVWDES HQEQKLLQKS TISNSTDLII QTNNSNTSTS TNTSSAIPTD DFVVLAGEVY QIDMRLASLQ EFEGEAGLIT TEEIDHFTGK NHKNEFNCKS TIHVRDFAVT HQLKLKTSRR LLGEVDKRFS VYPFKLSYTC KHFPVKLEND NVQEQLAQIK SELKTNKLGL SELSNRHLIK SKPVQVTKFI PLDKILLSAN PTGKHAIDMS KPVLPGMEIP LPNLGVSSLK LKALLKHAKP IANVRESTTV VLNNVKNEVV RLTGGSKTTT PSWVHSQYKL GGAYVQSIEQ IVQLSKDKRF GIKVKECQPY NLSKTVGQAA ETMELD
|
| |