Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_59020 |
Symbol | |
ID | 4838505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 1659973 |
End bp | 1662351 |
Gene Length | 2379 bp |
Protein Length | 792 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640389820 |
Product | predicted protein |
Protein accession | XP_001384270 |
Protein GI | 150865166 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.200991 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTCTCCACTA AGTTTGTGGA CTACTGGAGA TCAAAGAACA CACGCCAACA ACATACGTTG CTAGCCCTGG TCTCAATTGC GGCTTCGCTC TTCTACTACG TGCTTCTGCC GTCACTTCCT TGGAACCGCA GTACTATGTT CGCGAAAAGA CCTGACAAAT ACACTACGGG CCTAATCAAC CTACGGAACG ATTGCTTTGC CAACTCGTCC GTCCAAGCCT ATCTGGCTCT TCCTGGACTT ACCGACTACT TGAACAAGTT CATCACCAGT TTCAATGAGC TTTCGGCTTT TGCGAAGTCG AAAAATATCG ACATCAACAA TGTAATCCAC CACCAGAGAC TCAGTACGAA AAATGGAGAT TCCAGTGCAG CCTCAACTTC AAAGTTCAAA AATACTATAT CCAAGTTTGA CATCTCGCTC CATATTGGGC TCGCCGACAT CATCAAGAAA CTCCAGGAAA CCCAGATGGC ATCACGAACT ATATCCGTAT GGACTTTTCT CCACGTACTA GAGGGTATCT TCAATGCAAA AATATCTCGA TCACAACATG ATGCCCATGA ATTGACTCAG CTTATCAACG AGACGTTAGA GAATGAAAAT ATCAAAATCA AAAACTTGCA CAAGTACATC AAGACCAATC TCCACACCAT TCTCGGACTG AGGGACACTC CTTCACCGAG AGACTACTCG ACATTGGACA AGATCCAGGT GCCAGAGTTT CCCTTCTCGG GGTTAACTTT GAGCCAGCTC AAGTGTCTCA AGTGTCTTGG GGTATCTACA CCGAACTTTG CTCCGTTCTT GATGATCACA TTGCACACGC CACAAAAACT GTCTAGTGAC ATCGAAGACA TGTTGAACGA CAACAAGACA GAGTCCATAG AAGGCTACCA ATGCCTCAAG TGTAGGATCG TCGCGATAAT AAATAACGAA AACCACATGA AGCGTACCAT TCCTGATCAA GATGTCAAGC ACATCAACGA ACTCAAGAAG TTGAACAACA ACTCCAAACT TTGTATCAAT GACGATTTGC CAAAAGAATT GGAAGACTTC ATCCGCGATT ATAACGTAGG CGGAGTTAAT ATTTCCCAGA TCACTTCGAC AGTGTTTAGG CATACGCAGA TCTTGAAGCC TCCCAAAATA TTCGGTGTGC ACCTCTCTCG TTCAAGCTTC AACGGAGGCA ATGCCACAAG GAATCCCTGC AAAGTTTCGT TTAAAGAGCA CTTGACGTTG TCTATAGGTA AGGAGTACCA CGAGCAGTTA AGACAATTCC AACACCAGGC AGAGGAAGAG GAGGAAAAGC AGTTGGAGTC GAAAATCGAA GTCACGGCTG CACATGTCTT AACCCGCGAT GTAAACGATA TGGAAGACGA AGACGTCCAA AGAGAAGATG TCGATGTTAA AGGAACAGAA GATGTAGATG TTGAAGCTAA TGTAGTCGAT AATGGCACTT CTACAGATGA TGCTGGAGAA GAAAACGATC TAGAAGACAA CGACACTTCG TCTACTTCAA CTGAAGAATC GATGCAGCCC TCAGTTAGTA CCACGGCCAC GATGCAGAAC AGTTCAATTA CCAATGCCTC CGACAAGTCT CGTACCATAA ACAGCGCTCC AATTTCAGAC GACCAATCAG AAAAGTTACG GGACCACTTC AAAAAGTTCA AGTTCAACGA AAATGACGTT TACAAGTACA GACTCAAAGC CATGATCAAG CACCAAGGCT CGCACACCCA AGGTCACTAC GAGTGCTACA AGAAGAAGCC TTTGTTTGTC AAGGATAAGG ACGGGAACAT ATTCAAATTG TTCCCCGAGA TTATTGACGA TTTTAATGGT GACACAACAT ACGATGTAGT TCCGGCTACC TCTTCTGAGC TTGCATCAGT GTCCATGAGG ACTTCCTCAG AAGGAACCAA ATCATCTATT AACACGGGTC ATTCTTCTTT GGACAAACGT AGATCATCTA GCAATAGCTC TATGGGCTCC AAGGATGGAA ACGTTCGTCG TAGACTCTCC ACTATGATGG GCCGTCGTCC ATCTGTGTTC CAAGCTGATC CGGAGGAAGC CGGTATTCAA GAGATTGTCA ACTCAGGGTC GGCTACTCCA GCAGAGTTGT TAGTAGATGA GCCTCGCGAG TACTTCTCAG CTGAGTTGGC CCTGGCTGCT ATTAGCAAAT CGGTCCACGA TTCGCAGAAT AGCCAACATT CGGATAAGGT CAAGATGAAG AAGATTCCTT CTGCCATAAA ACAACCCTAC TGGAGAATCA GCGATTCCAA AGTGACTGAG GTAAGTCGAA GCACAATGAT GCTTGAAATG ACAAGTGTCT ACATGTTGTA CTACGAGAGA GTCGACCGCA AACAAATCAA ACATTCCCAA ATAGTTTAG
|
Protein sequence | VSTKFVDYWR SKNTRQQHTL LASVSIAASL FYYVLSPSLP WNRSTMFAKR PDKYTTGLIN LRNDCFANSS VQAYSALPGL TDYLNKFITS FNELSAFAKS KNIDINNVIH HQRLSTKNGD SSAASTSKFK NTISKFDISL HIGLADIIKK LQETQMASRT ISVWTFLHVL EGIFNAKISR SQHDAHELTQ LINETLENEN IKIKNLHKYI KTNLHTILGS RDTPSPRDYS TLDKIQVPEF PFSGLTLSQL KCLKCLGVST PNFAPFLMIT LHTPQKSSSD IEDMLNDNKT ESIEGYQCLK CRIVAIINNE NHMKRTIPDQ DVKHINELKK LNNNSKLCIN DDLPKELEDF IRDYNVGGVN ISQITSTVFR HTQILKPPKI FGVHLSRSSF NGGNATRNPC KVSFKEHLTL SIGKEYHEQL RQFQHQAEEE EEKQLESKIE VTAAHVLTRD VNDMEDEDVQ REDVDVKGTE DVDVEANVVD NGTSTDDAGE ENDLEDNDTS STSTEESMQP SVSTTATMQN SSITNASDKS RTINSAPISD DQSEKLRDHF KKFKFNENDV YKYRLKAMIK HQGSHTQGHY ECYKKKPLFV KDKDGNIFKL FPEIIDDFNG DTTYDVVPAT SSELASVSMR TSSEGTKSSI NTGHSSLDKR RSSSNSSMGS KDGNVRRRLS TMMGRRPSVF QADPEEAGIQ EIVNSGSATP AELLVDEPRE YFSAELASAA ISKSVHDSQN SQHSDKVKMK KIPSAIKQPY WRISDSKVTE VSRSTMMLEM TSVYMLYYER VDRKQIKHSQ IV
|
| |