Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_30558 |
Symbol | |
ID | 4837976 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 379292 |
End bp | 381100 |
Gene Length | 1809 bp |
Protein Length | 602 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640389291 |
Product | predicted protein |
Protein accession | XP_001383701 |
Protein GI | 150864740 |
COG category | [R] General function prediction only |
COG ID | [COG0661] Predicted unusual protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0158618 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCTAT TGAAATGCAA CTTTCCACAT CATGTTTTTC TTCTACAGTT ACGCAAAAAG ATGGTTTTGA CGACACGTAT AGGCACAGCT TGTGCTGTGA CCCGTCACAG TGTCGGACCG ATGGCAGTCG GTTTCCGAGG GATGAGTCCC CGTATCTCGT CATCTCTCAC AAGACGTTTC TCAACCTTTG CCAGATCTGG CCCCAAGCCT AGATCTTTCA AAAGGATGTT CACAGTAGGT GTAGCACTTG GCTTAGGAGG CTCGGCATTA TACTACACTA ACGACAATGT CCGTCACGTG GTTCTTACGG CAGAAAGGGT CACGGTGGTG ACAGTTGCTG TCTTCAGGTG TTTCAACATG TACTTACAAA CGCTTGGGAA GGACTACGAG TCCCGTGAAG AGCGCTCCAA GGCGTTAAAG AAGACCCACA AGCGAGCAGC TGATATCACC TTGAGAGCCT TAGAGACGAA TGGAGGTATC TACATCAAGT TGGGTCAGCA TATCACTGCC CTTACCTATT TATTGCCTAG AGAATGGACA GATACAATGA TACCCTTACA AGATCAGTGT CCTCGTTCGT CCATGGAGGA GATCGAGAAA ATGTTTGTCA ACGATTTGGG TACCTCAGTG AATGATATGT TTTCAGAGTT TGATCCAGAA CCGGTCGGTG TAGCCTCTTT GGCACAAGTC CATATAGCAA CGCTCAGAGA GTCGGGCGAA AAAGTGGCTG TCAAGATCCA GCACCCTTCC TTGCAAGAAT TTGTTCCTCT TGATGTTTTC TTGACGAAGA CCGTGTTTGA GTTGATGTAC AAGTTTTTCC CAGAATACCC CCTCACCTGG CTTGGTGAAG AAATGCAGAG CTCTATCTTT ATCGAATTGG ACTTTACAAA AGAAGCAGAA AATGCTCAAA AGACGGCTGC TTACTTCAAG GACCTCAAGA GAGAGACTGC TTTGAAAGTT CCTAAGATTG TCGAAGCTCA GCAAAGAATA TTGATTATGG AATATGTAGG AGGTGCCAGA TTAGACAACC TCGATTATTT ACGAGAACAC AACATTTCTG CCGCCGATGT TTCGTCGTGT TTGTCACACA TATTCAACGA CATGATCTTC ACGCCAGGCG TAGGATTACA CTGTGATCCT CATGGAGGTA ACTTGGCTAT TAGAGCTCTA GAAAACTCGG AATACAAGAA CGGCCATAAC TTTGAAATCA TCTTATATGA CCATGGTTTG TACAGAGATA TACCGGTTCA GATGAAACGT GATTACTCTC ACTTCTGGTT GGCTGTTCTT GATAGTGATG TGCCCAAAAT GAGAGAGTAC GCTGAGAAGT TTGCTGGAAT ACAAGGCGAT CAAAAGTTCA GAATCTTTGT CTCTGCCATC ACCGGAAGAG CTCCAGAAAA TGCATTGAAC GACATCAAGA AACTGAGATC CGAAGAAGAA ATTAGGATCA TCCAAAACGA GTTGAACTAC CTGGAAGGTG TTCTTGAAGA CTTGATGGAT ATCTTAAGTT CTATGCCGAG AATGGTGTTG TTGATTCTCA AGACAAATGA CTTAACAAGG AGCTTGGATG AGAATTTGAA CAACCCCTTG GGACCTGAGA GAACCTTTTT GATTTTGGCA AATTACTGCG CTCGTGTAGT TTACGACGAA GAAAACGAGA AAATAGCCAA GGAGTACAAA TCGTATTCGG TAAAAAAATT TGTTCATTAC TTGACCAACT GGTGGTCCTA CCACAAAAGA ACTAGCCAAT TGTTCCTTTA TGACTTTATC GTCATGCTAC GCAACGCTAG AAGGAGACTC TCCTTGTAA
|
Protein sequence | MALLKCNFPH HVFLLQLRKK MVLTTRIGTA CAVTRHSVGP MAVGFRGMSP RISSSLTRRF STFARSGPKP RSFKRMFTVG VALGLGGSAL YYTNDNVRHV VLTAERVTVV TVAVFRCFNM YLQTLGKDYE SREERSKALK KTHKRAADIT LRALETNGGI YIKLGQHITA LTYLLPREWT DTMIPLQDQC PRSSMEEIEK MFVNDLGTSV NDMFSEFDPE PVGVASLAQV HIATLRESGE KVAVKIQHPS LQEFVPLDVF LTKTVFELMY KFFPEYPLTW LGEEMQSSIF IELDFTKEAE NAQKTAAYFK DLKRETALKV PKIVEAQQRI LIMEYVGGAR LDNLDYLREH NISAADVSSC LSHIFNDMIF TPGVGLHCDP HGGNLAIRAL ENSEYKNGHN FEIILYDHGL YRDIPVQMKR DYSHFWLAVL DSDVPKMREY AEKFAGIQGD QKFRIFVSAI TGRAPENALN DIKKSRSEEE IRIIQNELNY SEGVLEDLMD ILSSMPRMVL LILKTNDLTR SLDENLNNPL GPERTFLILA NYCARVVYDE ENEKIAKEYK SYSVKKFVHY LTNWWSYHKR TSQLFLYDFI VMLRNARRRL SL
|
| |