Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_47344 |
Symbol | |
ID | 4839497 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 1304137 |
End bp | 1306044 |
Gene Length | 1908 bp |
Protein Length | 594 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640390812 |
Product | predicted protein |
Protein accession | XP_001384940 |
Protein GI | 150865634 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.649263 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTTCAGAGAG AGCAATGGAA GATTCTTAAA TCCACAATCC ACAGCCAAAT TTTCCGAGCA GATAGAAACA ATATCAAAGA AGTTATCAAG GAGTTGTTCC AGGTGAATCT TATCCGAGGA CAAGGCATTG TTGTAAGAGA AGTGATGAAG GCTGTTGATG ATCGATATGT AGCAGTATAT TCGAGTATTA TTGCAGTGCT AAACCTGAAG ATACCCGAAA TTGGCGAGCT TCTTGTCCAA CGATTAGTGC TTCAATTCAG AAAATACTAT AGGGCCAACA ATAAGCAATA TTCTCTTGGT TCGGCACTAT TCATTTGCTA TTTGATCAAC CAACAGGTGG TTAGTGAAAT CTTGATTCTC CAGATTGTGC AATTGCTCTT GGAGAGTCCT ACCGACGACG ACATAGACAT AGCCATAAAC TGCTTGATGG TAGTTGGAAA GTATTTAAGT ATAAACTCAG CGGTGGCGAA CAATATGATT TACTCTCGGT TGAGAGACAT TTTACATGAC AATAGGAACA TTAACGAGAA GAGCCGTGAA GCGATCCAGG ACCTTTTTCA AATACGCAAA ACAAACTTCA AACAGTATCA GATTGTAGAA AAGAAGTTAG ATTTGGTGGA AAATGAAGAT AAGGAGACCC ATATAATAGA GCTTGGAGAG GAGATACATT CGCGAAACGA ACTTAATATC TTTACAGTAG ATGAAGAGTA CGATGACAAC GAGAAAGAAT ACGATGAGAT ACGAAAAGAG ATTCTTGATG ACAAAAAAGA GGAAGAGGAA CTGAAGGAAG AGGAAAAAGA ATCAGAAGTC GTGAAATTAG ACATTGAAGA CATGACCCAA TCTGAACTAC TACAATACCA GAAAACAGTA TATTTAACAG TGATGTCTTC GCTTTCTTCA GACGAAGCAG TGCATAAGCT CTTGAAGCTT TCGTTCTCAA AGTCGAACTC CGATAAGCTC AATACGAACG AAATTCTTGC TGACATGATT GTGAAGTGTT GTTCACAGGA AAAAACTTAC TCCAAGTTCT TTGGGGTTAT AGGAGAGAAA CTCTGTTCTA GAAATAAAAC CTGGCAGTCC ATCTTTGTAC AATTGTTCAA GAAATACTAC GAGACGATAA ATCTGTTTGA AACGAATGCC TTGAGAAATA TAGGTAAGTT CTTTGGCCAT TTGTTCGCTT CAGACAAGTT AGCTATAGAT CAAGCATGGA ATGATATTAA GCTTACTGAA GAATCCACCA ATGCCGCTAG TAGAATCATG CTAAAGTTCA TTTTTCAAGA AATGATCGAA GAGTTGGGTA CCAATGAACT CAAGGAACGA CTTATTAATG ACGACTACAT AAGAGACAAA ATTACAGGCA TCTTTCCTGT TGTGGACGTG ACATGGAAAG ATGCTGAACA TCTCCGTTTT TCAATAAACT ACTTTACAGC CATTGGACTA GGAGTGCTAA CTGAAGATAT GAGAGACATC TTAAAGGATC TTCCTCCTCC GCCGGCTGTT GAGGAAGAAA GAGGAAGATC CAAATCTAGG GCGATGAGAT CACGTAGTGA ATCACGCAGC TATTCAAGAA GCGGAAGCTA CTCAAGAAGC TATTCAAGAA GCTACTCCAG AAGTAGAAGC AACTCTAGAA ATTCCAGAAG CTCTAGAAGT AGAAGCTACT CGAGATCTCG GTCCAGATCT CGTTCACCAT CTCGTTCTCG CAGCTATTCG CGGTCTCCGT CAAGGTCCCG TAGCTACTCT AGATCCCGCT CTCGCAGCGG CTCCAGAAGC ATCTCATACT CCAGATCTCC CAGCCCCGAT GCCAGAAATC ACTCGCGGTC ACGAACTCCA TCCAAAAACC GTTATCAGAC CAAAAGAGCC AACTCGAGGG AGCTGTCTCA TGTCCCTTAC AAACGGAACA AATACTGA
|
Protein sequence | FQREQWKILK STIHSQIFRA DRNNIKEVIK ELFQVNLIRG QGIVVREVMK AVDDRYVAVY SSIIAVLNSK IPEIGELLVQ RLVLQFRKYY RANNKQYSLG SALFICYLIN QQVVSEILIL QIVQLLLESP TDDDIDIAIN CLMVVGKYLS INSAVANNMI YSRLRDILHD NRNINEKSRE AIQDLFQIRK TNFKQYQIVE KKLDLVENED KETHIIELGE EIHSRNELNI FTVDEEYDDN EKEYDEIRKE ILDDKKEEEE SKEEEKESEV VKLDIEDMTQ SELLQYQKTV YLTVMSSLSS DEAVHKLLKL SFSKSNSDKL NTNEILADMI VKCCSQEKTY SKFFGVIGEK LCSRNKTWQS IFVQLFKKYY ETINSFETNA LRNIGKFFGH LFASDKLAID QAWNDIKLTE ESTNAASRIM LKFIFQEMIE ELGTNELKER LINDDYIRDK ITGIFPVVDV TWKDAEHLRF SINYFTAIGL GVLTEDMRDI LKDLPPPPAV EEERGRSKSR AMRSRSESRS YSRSGSYSRS YSRSYSRSRS NSRNSRSSRS RSYSRSRPDA RNHSRSRTPS KNRYQTKRAN SRESSHVPYK RNKY
|
| |