Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_73005 |
Symbol | |
ID | 4840289 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 709462 |
End bp | 711900 |
Gene Length | 2439 bp |
Protein Length | 589 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640391604 |
Product | predicted protein |
Protein accession | XP_001385490 |
Protein GI | 150866025 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.792014 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CACCAGAGTA TCGTTCTAAA CCATTGAATA GCTTAGAGTG TCAGCCATGT CATCGTCCGG CTCGTACTCG CTCCAGAGCA TCAACCCCAA ACTCCTCCGG AGAATCTGTA AGTATTGAGC CAGTATGAGA TCTCGTCTAA TGAAGAGCTA GATGTAACGT CAAACGTTTA TCTTGTCGTA ACAGCAAATA TCAAACGTAG CAAATCAATC ATATCTCAAT CAATCATATC TCAATCCTAC TTATCAGAGA GATCAATCAT GGTGTTGAAT TTGAGAAACA TTCTTGATGA TATTCACTTA ATCAGTATCA GTTCCATCAA TGGACCATTC CATATCTACA GCACTATACC AGTCATTATG TTATTCTTCT ATATCATTTT GAACATTATC AATCGCATCG TAATAATCAA ATGCCAAGTT CCGGAACCAT CTTCTAGTAT TGAAATCTTA ACGATCATCA TTCTACATAT TCGGGGGCCC CCCGTTATCA CCCACTCATA TCGTGATATT GTCTTTACTG ATTACTCATT ATCATTCTAT CATTGAATTC TATGATGATC ACTATTTTCG CTGTAAAAAG TCTCATTCAT TCTTGAAATC TCATCTACCA TGCTATTCAA CATACTAACT TATTCAGACC GTGCGTGCAG ACCTTCGAAC TCCGAGCCCA ACTTGGCTCT CAACTTGGAG ATCTGTGATT ACGTGAATGC CAAACAGGGC TCTATTCCTC GTGAAGCTGC CATAGCAATC GTCAAGTTGA TCTCTCAAAG AGATGCCCAA ACTTCAGAGT TAGCCATCTC CTTGTTGGAC AACTTGGTCA AGAACTGTGG ATATCCGTTC CATTTGCAGA TCTCCCGTAA GGAGTTCTTG AACGAATTGG TCAAGCGGTT CCCAGAAAGA CCTCCCATAC GTTATACGCG AGTCCAGAGA CTCATCTTGG CCCAGATCGA GGAATGGTAC CAGACTATCT GTAGAACTTC CAAGTACAAA GACGACTTTG GCTACATCAA GGACATGCAC CGTTTGTTGA GCAATAAAGG ATATATCTTT CCCGAGGTCA AAGTCGAGGA CGCAGCTGTT TTGAACCCTT CTGATAATTT GAAGTCATTA GACGATATCC AGAAGGAAGA GGCTGTTGTA CACAGTGCCA AATTGCAAGA GATGATCAGA AGAGGTAGAC CTCAAGACTT GCAAGAGGCC AACAAGTTGA TGAAGATCAT GGCTGGTTTC AAGGACGACA ACGTCGCCGA GAATAAGAAA CAGCTTACCG ACGATGTAGC ACGTTTGAGA AGAAAAGTCG AGATCTTGGC GGAGATGTTG AACACGATCC TGAGCTCCAA CAGCAAGATT GAAGACTCCA ACGAGGCTAT TGTAGAATTG TATTCATCTG TGAAGAGTTC TCAGCCGATT GTCACCAAAA TCATCGAAAA TGACAACGGT GATGAAGAAT ATGTCCAGGA ACTCTTGGGC TTGAACGACA ACATCAACTT GGTTATTAAT AAGTTCCAGT TGTTGAAGAA TGGCAAATTA GACGAAGCTT CTCAGATTAA AGTTTCCAGT GGTTCTGGCG CAGGATCCAA TGCAGCCGAA GTTAACTTGA TCGATTTCGA CGATGACGAC ACTCCCGTAG GCTCCAACCA AGCCGAAGAC CAGGGCTACA ACGACTTGTT GAGCGACTTA TCGAACTTGG CATTCACATC TAACGATGCC TCTAAGAGTA ACAACAGCAG TAGTAACACT GCCAACATCA ACCTCTTTGG TGCTGGAGGA TCAATTGCTT TAGGTGATTT ATCCAACAAC TCTACCCCAG CCCCAACTCT TCAACACCCA CAGCCCACCA CTAACGCCAG TGGAAACGCA GGCAACTCCT TAGACTTATT GGGCGACTTG AACTCTCCAT CTCAGCAGTT ACAATCCAAC ACCTCAGCAC AATTAGATCC GTTTGGCTTA AATTTCCCCA GCTCGACTCC AAGCCAGGTA CAGAACCAAT TAGGTCAATT CTCTGGTTCG ACTATTGCTA TCTCACTGTC TTCTGTGTTG AAAGTCGAAG TGAGTGTAGC TCCCAATTCT TCCAACCAGT TCTTCCAGGG TAGAGCTTTG TTCAGCAACG TTCAAGCACA GACTATCTCT AACTTCAAGT TTCTCATTGC TGTTCCCAAG TCTTGCAAAT TAGACTTGAG ACCTCAAACA GGCGATACAA TCTACGGCTT TACCAACAAC AGTATATCGC AGGATTTCAC TATCGAAAAC TCTTTGGACA AGCAGTTGAA GATAAAGTGG AAGGCCGAAT ACACCATTGG TGGTGAAACC AAGGATGAGA CTGGGGTTAG TGTATTAAAC AATTAGTATG CTATGTATTG TTACGTCGTT ATTTTATTCG TTTATTCGTT CATATTCTTT TAAAACTAGA CAATACATTT TATAATTGTA CTTTCTATA
|
Protein sequence | MSSSGSYSLQ SINPKLLRRI YRACRPSNSE PNLALNLEIC DYVNAKQGSI PREAAIAIVK LISQRDAQTS ELAISLLDNL VKNCGYPFHL QISRKEFLNE LVKRFPERPP IRYTRVQRLI LAQIEEWYQT ICRTSKYKDD FGYIKDMHRL LSNKGYIFPE VKVEDAAVLN PSDNLKSLDD IQKEEAVVHS AKLQEMIRRG RPQDLQEANK LMKIMAGFKD DNVAENKKQL TDDVARLRRK VEILAEMLNT ISSSNSKIED SNEAIVELYS SVKSSQPIVT KIIENDNGDE EYVQELLGLN DNINLVINKF QLLKNGKLDE ASQIKVSSGS GAGSNAAEVN LIDFDDDDTP VGSNQAEDQG YNDLLSDLSN LAFTSNDASK SNNSSSNTAN INLFGAGGSI ALGDLSNNST PAPTLQHPQP TTNASGNAGN SLDLLGDLNS PSQQLQSNTS AQLDPFGLNF PSSTPSQVQN QLGQFSGSTI AISSSSVLKV EVSVAPNSSN QFFQGRALFS NVQAQTISNF KFLIAVPKSC KLDLRPQTGD TIYGFTNNSI SQDFTIENSL DKQLKIKWKA EYTIGGETKD ETGVSVLNN
|
| |