Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_69865 |
Symbol | |
ID | 4837444 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 2133916 |
End bp | 2137063 |
Gene Length | 3148 bp |
Protein Length | 1021 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640388759 |
Product | predicted protein |
Protein accession | XP_001382635 |
Protein GI | 150863974 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | [TIGR01408] ubiquitin-activating enzyme E1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.91244 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CATGGCCGAC AGCATGCTTG TAGACTCGCC TGTACCTCAG GAAATCGACG AGGGCCTTTA CTCTCGTCAA TTGTACGTTT TGGGCAAAGA AGCCATGTTG AAGATGCAGA ACGCATCTGT GTTAATTATA GGCTTGAAAG GCTTGGGTGT AGAAATCGCC AAAAACGTTG CTCTTGCCGG AGTCAAGCTG TTGAGCTTGT ACGATCCCGA GCCCGTAGCC ATCTCTGACT TGTCGTCGCA GTTCTTTTTG GACGAATCGG CTGTTGGCAG CAACCGTGCT GAAGTCACTG CTCCAAGACT CGCTGAACTC AATGCCTATG TTCCCATCTC CGTCATTACT GACTTGTCTG AGGCAACTTT GTCAAACTAC AAATGTATTG TTGCCACCAA CTTGCTGTTA GAAGAACAAG TCCGAATCAA CACCTTTACA CACGAACGGG ACATTGGTTT TATTGCCGCC GACAATCGAG GCTTGTTTGG CCAGTTGTTT GTAGATTTCG GATCCAGTTT CACGATTATC GACCAAACGG GCGAAGAACC ACATACGGGG ATTGTTTCCG ACATTGAGGC TGACGGAACT GTCACCATGC TTGACGATAA CAGACACGGC TTGGAAGATG GAGACTACGT CAAGTTTACG GAAGTCGAAG GAATGCCCAA GTTGAATGAT GGAAACCCTC ACAAGATTGA AGTTTTGGGC CCATATGCCT TTAGAATTAA TATTGATGAA TCTTACGGCA AGTATGTCAA GAACGGTTTA TATACGCAAG TCAAGGTTCC CAAAGAAATC CACTTTGAGC TGCTTTCATC GCAATTGGCT AATCCTGAGT ACATCATTTC CGACTACGCC AAGTTCGATA GACCTCCCCA GTTACATTTA GGCTTCCAGG CTTTACAAGC TTTCCAGACG AGACACTCCG GGAAATTGCC CAGACCATAC AACTCTGAGG ATGCTTCGGA ATTGTTCCAA TTATCCCAAG AAATCGCAGT ACAATTTCCT ACCATCTTGG GAGAAGCTTC TGTAGACGAG AAGTTGATCA AGGAATTGGC CCACCAGGCT ACTGGGGACA TTCCAGGAAT TGTAGCCTTC TTTGGTGGTT TAATTGCCCA GGAAGTGTTG AAGAACTGTT CTTCTAAGTT TGGCCCAGTC AAGCAATGGT TGTACTTCGA TTCGTTGGAG TCGTTACCTT CCAACGAAGA GTTTCCCAGA ACTGCTGAGA CAACCAAACC ACTTGGTTCG AGATACGACG GCCAGATTGC CGTCTTTGGT AAAGCTTTCC AGGAAAAGAT CGCCAATTTG AAAGTCTTCC TTGTTGGTTC TGGTGCCATT GGTTGTGAAA TGTTAAAGAA CTGGGCCATG ATGGGGTTGG GCAGTGGACC TGATGGTAAG ATTATAATCA CTGACAACGA CTCAATTGAA AAGTCCAACT TGAACCGTCA ATTCCTTTTC AGACCAAAGG ACGTTGGTAA GAACAAGTCT GATGTAGCTG CCGTAGCTGT TCAGCACATG AATCCAGACT TGACGGGTAA GATCGAGTCA AAACTTGACA AAGTGGGCCA CGAGACTGAA AATATCTTCG ACGATGATTT CTGGAAAGGT TTGGATTTTG TTACCAACGC TTTGGATAAC GTCGAAGCCA GAACCTATGT AGACCGTCGT TGTATCTTCT ACAAAAAGCC TTTGTTGGAG TCGGGAACAT TGGGAACTAA AGGTAACACG CAGGTAGTTA TCCCTAACTT GACTGAATCA TACTCTTCTT CACAAGATCC ACCTGAAAAG TCCATTCCCT TATGTACTTT GAGATCGTTC CCAAATAAGA TCGACCACAC AATTGCTTGG GCCAAGTCAT TGTTCCAGGG TTATTTCTTT GACTCTCCTG AAAGTGTCAA CTTATACTTG AGTCAACCAA ATTATGTAGA ACAGACTTTA AAGCAGAACC CGGACATCAA GGGCACGTTG GAAAACATCT CCAACTTGTT GAATAAACGG CCATACAGCT TCGACGATTG CATCAAGTGG GCACGTTTGG AATTCGAAAC CAAGTTTAAT GGTGACATCA AGCAACTTTT GTACAACTTC CCAAAGGATG CAAAGACGTC TACTGGTGCT CCGTTCTGGT CCGGTCCAAA GAGGGCTCCA GAGCCCTTGG TATTTGACAT CAACAACTCT GACCATTTCA ACTTTGTAGT TGGTGGAGCC AACCTCTTGG CTTACATCTA TGGCTTGAAG GCTCCTGTTG CTAGTTTCGA CGATTACAAG AAGGTCATTG ATCTGACTAA AATTCCAGAG TTCACTCCTA GATCAGGTAT TAATATTGCT GCAACTGAGA ACGAAGCCGA AGAACAGGAA AAGACTTTGG CTGGATCGAT CGACGACAAC GAAATCAGAA AGATCGCCGC TTCTTTACCT GAGCCAAGCA CTTTAGCCGG CTATAGATTA AATCCTATTG AATTCGAAAA GGACGACGAC ACCAACCATC ATATTGAGTT CATTTCGGCT GCTTCTAACT GTAGAGCTTT GAACTATGCC ATCGAGACTG CCGATGCCCA CAAGACTAAG TTCATTGCTG GTAAGATTAT CCCAGCCATT GCCACCACCA CAGCATTGGT GACTGGTTTG GTCTGCCTTG AGTTATACAA GGTTGTTGAC GGAAAGGACG ACATTGAGCA ATACAAGAAT GGTTTCATCA ATTTGGCCTT ACCTTTCATT GGATTCTCGG AACCAATCAA GTCAGCTCGT GGAAAGTACA ACAAGAAGGA TTTCGACCAG ATCTGGGATA GATTCGACTT AGAAGGTAAC ATCACCTTGC AGCAGTTGCT TGACCACTTC AAGGAAAAGG AGGGTTTGGA AATCTCGATG TTGTCATATG GAGTGTCTTT ATTATATGCC TCATTCTTCC CACCCAAGAA AATCAAGGAA AGATTATCAT TGGACTTGAC GACTTTAATC AAACAAGTCA GCAAGAAGGA CATCCCAGAC CACGTGAACT ACTTGATCTT TGAAGTTTGC TGCGACGACG AAGAAGGTGA AGATGTTGAA GTTCCTTACA TCTGCGTGAG TATCTAGATT AAATGATTTA TAAAGTTACG ATTACAATAG AAATAACATA AAGATATTAT ATCTCGATTT TGAATTATAG CGACCATT
|
Protein sequence | MADSMLVDSP VPQEIDEGLY SRQLYVLGKE AMLKMQNASV LIIGLKGLGV EIAKNVALAG VKSLSLYDPE PVAISDLSSQ FFLDESAVGS NRAEVTAPRL AELNAYVPIS VITDLSEATL SNYKCIVATN LSLEEQVRIN TFTHERDIGF IAADNRGLFG QLFVDFGSSF TIIDQTGEEP HTGIVSDIEA DGTVTMLDDN RHGLEDGDYV KFTEVEGMPK LNDGNPHKIE VLGPYAFRIN IDESYGKYVK NGLYTQVKVP KEIHFESLSS QLANPEYIIS DYAKFDRPPQ LHLGFQALQA FQTRHSGKLP RPYNSEDASE LFQLSQEIAV QFPTILGEAS VDEKLIKELA HQATGDIPGI VAFFGGLIAQ EVLKNCSSKF GPVKQWLYFD SLESLPSNEE FPRTAETTKP LGSRYDGQIA VFGKAFQEKI ANLKVFLVGS GAIGCEMLKN WAMMGLGSGP DGKIIITDND SIEKSNLNRQ FLFRPKDVGK NKSDVAAVAV QHMNPDLTGK IESKLDKVGH ETENIFDDDF WKGLDFVTNA LDNVEARTYV DRRCIFYKKP LLESGTLGTK GNTQVVIPNL TESYSSSQDP PEKSIPLCTL RSFPNKIDHT IAWAKSLFQG YFFDSPESVN LYLSQPNYVE QTLKQNPDIK GTLENISNLL NKRPYSFDDC IKWARLEFET KFNGDIKQLL YNFPKDAKTS TGAPFWSGPK RAPEPLVFDI NNSDHFNFVV GGANLLAYIY GLKAPVASFD DYKKVIDSTK IPEFTPRSGI NIAATENEAE EQEKTLAGSI DDNEIRKIAA SLPEPSTLAG YRLNPIEFEK DDDTNHHIEF ISAASNCRAL NYAIETADAH KTKFIAGKII PAIATTTALV TGLVCLELYK VVDGKDDIEQ YKNGFINLAL PFIGFSEPIK SARGKYNKKD FDQIWDRFDL EGNITLQQLL DHFKEKEGLE ISMLSYGVSL LYASFFPPKK IKERLSLDLT TLIKQVSKKD IPDHVNYLIF EVCCDDEEGE DVEVPYICVS I
|
| |