Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_85034 |
Symbol | |
ID | 4840737 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | + |
Start bp | 535407 |
End bp | 536514 |
Gene Length | 1108 bp |
Protein Length | 361 aa |
Translation table | 12 |
GC content | 49% |
IMG OID | 640392052 |
Product | predicted protein |
Protein accession | XP_001386122 |
Protein GI | 150866496 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.737973 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.53722 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AATGTCTGAT ACTAACATCG ATATCACTAT CAAGCTGTCT GGAGACACGA AGTACGAGTT GCTGGTGTCG CCTCTGCTCA CTGTATATGA TTTGAAGGAG CTTATCGCAG ATAAAGCCGA CATCCCTGCG GACAGACAGA GACTCATTTA TTCTGGAAAG GTGTTGAAGG ATACTGAAAC TATTGCTTCG TACAAGGTTC AGACTGGACA TACTATTCAT ATGGTGAGAT CTGCTGCACG AGCCACCGGA GCTCCAAGTG CTTCTAATGC TACTGGTACC TCCGGAAATA CAACTTCTGC AAGTGCTACT CCTTCTGGAA GTACTAATAT TACCGGCAAT TCTGCAGCTG GAACCGGAGT CCCTTCCAAT ATCGCTGCTG GACAAGGACT GTTTAACCCT CTTGCGGACT TAACGGGAGC GCGTTATGCT GGCTATGCTC AACTTCCCCT GGCTTCTATG TTTGGCCCAG ATGGAGGTAT GAATGCCATG CCGGATCCGG ATCAGTTGGC TCTGATGATG AACGATCCTA TGGTCCAACA GCAGCTCAAT GCCATGTTGC TGAATCCACA AATGATCGAC TACATGATCA ACCAGAACCC GCAATTGCGT GCTATGGGCC CTCAAGCTAG ACAGATGTTA CAGTCCCCTA TGTTCAGACA AATGATGACG AATCCGGAAA TGATGCGCAT GATGATGAGT ATGGGTCCAA TGATGGGAGG GGCAGGTCCC GGTGCTGGAC AAGGAGCTTC GGCATTTCCA GCTCCTGGGG CTAATCCCAA CGTAGCTGAT ACTTCTACAG ATTCTACTGC TAGTGCTGCT GATACTCCTA CTACTAACGC TACTGCTAAT AACGCTAACG CTGCCGCTGC TGCTAATCCG TTTACATCAT TGTTTCCTGG TGGAGTACCT CCTGTCGATC CGTTTGCGTT GTTTGGAGGT GGTGCACCTG CTCCCGTCGA TAACCGTCCT CCAGAGGAAA GATATGAGAG CCAGTTGAGG CAACTCAACG ATATGGGCTT CTTCGACTTC GACCGCAACG TTGAAGCTCT CAGACGAACA GGTGGCAGTG TCCAGGGTGC TATCGAGTAC TTGTTGTCGA ACAACTAG
|
Protein sequence | MSDTNIDITI KSSGDTKYEL SVSPSLTVYD LKELIADKAD IPADRQRLIY SGKVLKDTET IASYKVQTGH TIHMVRSAAR ATGAPSASNA TGTSGNTTSA SATPSGSTNI TGNSAAGTGV PSNIAAGQGS FNPLADLTGA RYAGYAQLPS ASMFGPDGGM NAMPDPDQLA SMMNDPMVQQ QLNAMLSNPQ MIDYMINQNP QLRAMGPQAR QMLQSPMFRQ MMTNPEMMRM MMSMGPGAGQ GASAFPAPGA NPNVADTSTD STASAADTPT TNATANNANA AAAANPFTSL FPGGVPPVDP FALFGGGAPA PVDNRPPEER YESQLRQLND MGFFDFDRNV EALRRTGGSV QGAIEYLLSN N
|
| |