Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_34653 |
Symbol | |
ID | 4851872 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 3059189 |
End bp | 3060559 |
Gene Length | 1371 bp |
Protein Length | 431 aa |
Translation table | |
GC content | 43% |
IMG OID | 640393580 |
Product | predicted protein |
Protein accession | XP_001386915 |
Protein GI | 126275870 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.236544 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTTCA GAAAGAGAAG TGACATCATC GGAGGCGGTT CATCTGCTGT TCCAGGCAGA GCCCCTCTGG GAATTGGACG AGCCCCCGTC GGTTCTGGAG CAATACCAGG CAGAACGCCT GCGCATCCTA GACTTTCAAT TAATTCTAGA ATACCAGTAA ATCCAAGAAC TCCGAATGCT TCTGTTTCTA GAGATTCTGG ATTTATAAAT GGAAGAACTC AGTCTCCACA GGTTGAAGTT GAAGAAATTT CTGCTTTGAA GAATCCTTCT GTTAGACCAT CCTTAATCTC ATCTCAGCCG ACAGTATCTA CTGGTACCGG TGATTTGGAC AAACTACTTC TTCACCAAGG ATTGCCATTA GGACATTCTT TGCTCGTTGA AGAGTCTGGA ACGACTGATT TCGCATCCGT AATATTACGA GCTTTTGTCT CTCAAGGAAT TATGCACAAC CGGATTAATA AAGACCAGAT TAACTCTCAT GTGATAGCTG TAGGGATTTC CACCCAATGG ACTGCAAACT TACCTGGTTT GTACAAGGGC TCCTCTAAAG ATCAAAAGAA AGCTAAGATC CTTGCCAATG AGTCTAAAGT CAGTGTTTCC AACTTGGCAA CGTCCACTGC TGGTGTGACT TCTAGAGTTG ACAATGACTT GAAAATCGCA TGGAGATATG GAGTGAATAG CAAGCAGAAA TCGGCATCTC CAGAACCTTT TGAAAACAGT GCATATGAAT ACTACATCAA CCAATTCGAT ATCACCCAGA AACTTGCCCC TGGTCCAAAT GCCCAAGATA TTTCGTTTGT TCCTGTAGGT CTTAGTCATA TTCAATTAAT CCAACAGATC CAGAGCATCA TCCAACGTCA TGTCAAGTTA AATCCAGCTA TTGTGATAAG AATCGCTATC CCTGGACTTC TTAATCCTAC AGGCTACAAT CCATTGAGCT CTTCGCCTAC ATTTTTATAT CCGTTTGTTC ACTCCTTGCG AGCCATACTT AGGCAATATA GCCAGAATGT GGTTCTTGTT GCATCGCTAT CTTCAGATCT CTATCCTCGA GATTCGAACG TAGCCCATGT ACTTGAATCG TTGGCCGATT CGTGCATTCA CCTTCAGCCA TTCAACCAAG AGATGACCCA GTTGATCGAA AGAGCCTACA AGAATGAACC ATCCAAGATC CAGCAGGGTC TTGTCAATAT CGTCAAGTTG CCTGTTCTCT CGGAGAAAGG AATGATGATG ATTCATGAAG GAGAATACGC ATTCAAGAAT GGAAGAAAGA AATTCGAAAT AGAAGAGTGG GGCATTCCAG TTGAGGACTC TGAAAAAGAG GAACATACTA CTGCCGAAGG TGGCACTACT AAAAAGAACC TCGACTTCTG A
|
Protein sequence | MSFRKRSDII GGGSSAVPGR APLGIGRAPV GSGAIPGRTP AHPRLSINSR IPVEVEEISA LKNPSVRPSL ISSQPTVSTG TGDLDKLLLH QGLPLGHSLL VEESGTTDFA SVILRAFVSQ GIMHNRINKD QINSHVIAVG ISTQWTANLP GLYKGSSKDQ KKAKILANES KVSVSNLATS TAGVTSRVDN DLKIAWRYGV NSKQKSASPE PFENSAYEYY INQFDITQKL APGPNAQDIS FVPVGLSHIQ LIQQIQSIIQ RHVKLNPAIV IRIAIPGLLN PTGYNPLSSS PTFLYPFVHS LRAILRQYSQ NVVLVASLSS DLYPRDSNVA HVLESLADSC IHLQPFNQEM TQLIERAYKN EPSKIQQGLV NIVKLPVLSE KGMMMIHEGE YAFKNGRKKF EIEEWGIPVE DSEKEEHTTA EGGTTKKNLD F
|
| |