Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_73033 |
Symbol | |
ID | 4840227 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 1514390 |
End bp | 1516205 |
Gene Length | 1816 bp |
Protein Length | 519 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640391542 |
Product | predicted protein |
Protein accession | XP_001385641 |
Protein GI | 150866147 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.365315 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TATGAGGACA CCCAGAGCCT CTATACTGTC ACGGCGCTAA TAGTGACACC CTTGTCGTTG TTTGCTCTCA AGAAAATTGT GCCTATCGCG ACCTCAAGAT CCCGGTCCCA ATCCAAAATC GATGGCTCCA AGATACTCCT GGGGATTTTG GTGGCTCTCA TCATAGTTCT TAACCTTAAT GTGTGGTACT ATCTTGTCAA CAGTATGATT GACTCTCATA ACATTTACAA GTCTGTCTCG GGTCCTTCAT TCAACTTGTT GAACTCGCCT ATTTCTGAAA AGATAGTCCT GTTGGATTAT TCTGACGTCA AAATCCCTGG ATATTCCTTT ATTGAAAAGG GCATCGACGA AATAGAAAAC TTCAAGTGTG GCGATGTTCG CTTCCACGAC GACCATGGAG TCAAGGCTTC CAAGCCACAT AGTCTTGACC AAAAGAGGGA TATGAAAATT ATCAGAGACA AGGCACTTAT AATAAACGGC ACGGGTGAAT ATCCCATTAT CAGAAAATGT TTTCTTGACA AAGCCTGGGA AAAAGAAGAT GTAATTGTCA AAAAGAAATG GTACAAGTTC GCTGGTTCTT CTACTTGGTT GGATAAATAT CAAGTGTTCT TTCTTGTCAG CAGAGTAGCC TACAGTCACC GTTCCCTTCG TAACAAGGCA ACTATCAGTA TCTTGTATGC CCAAGTTTTC AACAAAAATT GGGAAGAAAT ACTCGACTAC CAATTTCCAC AGTCTGATAT TGTGTTCCCT GCCATTCTTC CAGTTAATTT AGATGAAAAT CCTAGAGGAG ATAATGCCTT TTTGGGAGCC GATGATCCTC GTGTAATGTT GAGAAACTAT AACGATACCA CTAGTGGAAC ACAAGAACAA GAGCCAGTCA TAATCTTCAA TACCTATCGT GCCGATCTTG GCTGGAAGAG AGCTATACAT GTGTATCGTC CATTGACAAA TGTTAAAGAA GCCATTCCAA TGAGGTTAGT TGGCATGGAA CCCAGACAGA GAGAAAAGAA CTGGGCTCCT TTCTTTGACG AAGATGCTTC ATCCATTAAT TTCGTGTACA GTTTGAATCC TTTGCGTATT GTCAAATGCG ACTTCAACAA CGGAGCTTGT AACAAGATTT CCGGTGACGA TTTTGAAGAA GATGAAGCCA GACCCTTGAG AGGAGGAACC AACGTTGTCA GAATTCCAGC GTCTTTCCTT CCTAAACATC TTGCCGAAAA GAGAGAATAC TGGTTTGGAA TTGCTCGTTC ACATGACCAC AAATGTGGAT GTATCGAGAG GATTTATCGC CCTCACTCTT TTGTCATTTC CAAAGCCTAT AAAACTGACG ACTACACAAT GGACTACGTC AGTTCATTTG TCGACTTTAA TATCAATACT ATGGCATGGA ATCCGGCGTT AGAGAAAGCG AAGTGTACGG ACAGTAAGAG CGTATTAATT CCCAACTCCA TTGCATACTG GGACGTCATT ACTACAAAGG ACAAGAATGG CAAGGATCAG CTTGAAGACA TAATGGGTGT TACATACTCA GAAGCAGATA TTAACAACCG TCTTATCCAC GTTAAGGGGT TCTTGCAACA TGTCGCTAAG ATCTTCAGTG GTCAGAAAGA AACTGTCGTA AACCACTACG CCCAAGTTGA GACTGCCAGA GAGGAAAATA ATTTGTTAAG TAATTGTGCC ACCTCACTTG CCCAAGAATA CTGTAAGCTG GCTGAAAAGA AGTTCAAATG GGGCTACGAC AAAAACGGCA AAATGAGCAC ATGAATAGAA TAACTTTATA TCGAAGAATA CCGGGAAATA TACCATTTTT TAATCT
|
Protein sequence | MIDSHNIYKS VSGPSFNLLN SPISEKIVSL DYSDVKIPGY SFIEKGIDEI ENFKCGDVRF HDDHGVKASK PHSLDQKRDM KIIRDKALII NGTGEYPIIR KCFLDKAWEK EDVIVKKKWY KFAGSSTWLD KYQVFFLVSR VAYSHRSLRN KATISILYAQ VFNKNWEEIL DYQFPQSDIV FPAILPVNLD ENPRGDNAFL GADDPRVMLR NYNDTTSGTQ EQEPVIIFNT YRADLGWKRA IHVYRPLTNV KEAIPMRLVG MEPRQREKNW APFFDEDASS INFVYSLNPL RIVKCDFNNG ACNKISGDDF EEDEARPLRG GTNVVRIPAS FLPKHLAEKR EYWFGIARSH DHKCGCIERI YRPHSFVISK AYKTDDYTMD YVSSFVDFNI NTMAWNPALE KAKCTDSKSV LIPNSIAYWD VITTKDKNGK DQLEDIMGVT YSEADINNRL IHVKGFLQHV AKIFSGQKET VVNHYAQVET AREENNLLSN CATSLAQEYC KSAEKKFKWG YDKNGKMST
|
| |