Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_56697 |
Symbol | |
ID | 4838162 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 1610751 |
End bp | 1612061 |
Gene Length | 1311 bp |
Protein Length | 427 aa |
Translation table | 12 |
GC content | 47% |
IMG OID | 640389477 |
Product | predicted protein |
Protein accession | XP_001383930 |
Protein GI | 150864919 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000378422 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AATCACCAAA TTCACCAAAA CCATCAGAAC CAGCAACAAC AGCAGTATCT GCAGATACCA CTCGGAAAGT CCGAGTTTGA GCTCACGGAG TATGATCTCA AGTCGCGCGA CTCGAAGTAC CGTAGATGGA CGCCTAAGAT GGACCAGTTT CTCATCAAGT TGCTTTCAGA TGTGGTGCAC AGCTATCCCA AGGGAGCTGA GGCAGAGATG ACGAAAAAGG CATGGGCCTA TGTCACGGGC CAGTTGCGTG CAGCCAACCC AGAAACAGTC TATTCCACTT ATACCAAATA CTCGTGCCAG CAGCATTTGC TCAATGTGAA TCATCACCGA TATAAGATTT GGTACTACTT GATGCTTCAC CAGAAAAACG CACCTGCTAC CAGTTACGCA TACCGGTGGA ATCCAGAATT GGGCCGGTTC CAAGTTATCG ACAATGCTAA CAGTACGTTG ATTCTTGATG AAAGACAAGT CAAGTCGTTG TTGTATAGCG ATTCGCTTCT GCTTCCACAT CTCCAGTCGT TTAACAAAGG CAACTTGATT GTTAACGACT TCTTCTTGAG CGACAACTTG CGCTACATGT CAGTTTACCA TAATGAGGTT TTGCCGTTGC TCATCAGGCT AGATCCCAAG TACGCTGAAG GGTTGGGAGA TCTTTACGCG GACATCCCCA AGTTCGACTA CCAGGAAGCC AGTCTTGAGT ACTTCAAGCC TTTGGTTCCA GCCAGAGCTC ACAAGATGGC ACCCGTAAAT CTGGCTGTTC AGGTTCAACA GGTGCAACAG TCTGTTGTCA AGAAGAGAAC CCATTCGGAT ATACCTTCTG ATATTTCGCT TCCATTTTCC AAGTCTCTCG GCTCATTGAC GGATGATACC GATCCAGACG TCAGTGGACA GCAACAACTG GTCCCAGACG AAGACTCAGT AGATCCAGCT CTCAAAAGGT CTAGAAACTC ACTCCAAGAC ACAACGTCCA ACACAATGGA TTTCGAGAAT GCTTTGGCAA CTGCAGCCAT CGCAGCCATC AACTCTCCAC CTGTAACTAA TGGAAGAGAC TCACTTCCTT TCTACATCAA GGACCGGAAG TGGTTCAACA GATTGCTCAA TCTCCACGAG TCGGGTCTCA TAGGTGTACA GGAAGTGCTT ACCGTCTGTG AAGGTGTCAG AGACGGCAAG ATCCCCTTGT TCATGCTCAA TGTTCTAGAC CAATCGTACT ACCCTACTCG AAACAATACC GGTTTGTCTG AAGAGTTGCC TGATGATGAG ACTGCTAAAA GAATCAGAGA GTTCATGCTA CCAATGGTAT ATAATTCGTG A
|
Protein sequence | NHQIHQNHQN QQQQQYSQIP LGKSEFELTE YDLKSRDSKY RRWTPKMDQF LIKLLSDVVH SYPKGAEAEM TKKAWAYVTG QLRAANPETV YSTYTKYSCQ QHLLNVNHHR YKIWYYLMLH QKNAPATSYA YRWNPELGRF QVIDNANSTL ILDERQVKSL LYSDSLSLPH LQSFNKGNLI VNDFFLSDNL RYMSVYHNEV LPLLIRLDPK YAEGLGDLYA DIPKFDYQEA SLEYFKPLVP ARAHKMAPVQ QSVVKKRTHS DIPSDISLPF SKSLGSLTDD TDPDVSGQQQ SVPDEDSVDP ALKRSRNSLQ DTTSNTMDFE NALATAAIAA INSPPVTNGR DSLPFYIKDR KWFNRLLNLH ESGLIGVQEV LTVCEGVRDG KIPLFMLNVL DQSYYPTRNN TGLSEELPDD ETAKRIREFM LPMVYNS
|
| |