Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31932 |
Symbol | |
ID | 4839477 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | - |
Start bp | 251157 |
End bp | 253076 |
Gene Length | 1920 bp |
Protein Length | 639 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640390792 |
Product | predicted protein |
Protein accession | XP_001385065 |
Protein GI | 150865730 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTGGA CAAAGAGAGA CTTTCTAGAT CTATCAGCAG ATTCAACTTT TGGTTCCACA GCCTTCAATC TAGGCAAAGA CCCTCTCTTG CACCAATTGG AACAAGTAAG AGAATACAGT GAACAGGATC TTCTAGCCAA TATGATTTAC ATTTTTCTTC TCTTGGTGTT CGCAACGGTG GTACTAGGGG CTGTTTCTTT TGGAGTCAGA GAGTACTACT TGTCAATGAA GAGCAAGTTG GATACAGCAA CTAGCCCTTT TACGCCGCAG GCTGAGGATG AAATTTGGAA TGCTCGTAAT GACCAGATAC AGACAACACC AGCATTACCT GAATCAAACG CTCCTTCTTC CAATTTGAAA CAATTGCCCA TGGAGTCGGA AGAAAGTCAA AATCGACTTT CAGAGAACTC GTTGGCTTCA GGATTCACTC ATTTCACAGA GAACAGAGTC CAGCTGCAAA AAAATCACAC TATAGAAGAA ACAGAATCTA GTAACCAGGG CCTTTCCGGA TCTTTAGAAC AGGTGACGGA GCTTGAGAAA ATCCAAATCT TTCGTGATGA TATCGAATCT CCAGGCTTGA AACACTTTAT GTCTGCTGAT ATTAGCAAGG GAGAGAACTT ATTATCTGAA AAAGATTCGC CGCAAGAGCC ACTGCCACCT CCACCACCTC CGCCTACAAG AGTGAGAACC AAATCCAAGA CCAACAAAAA GACTACTTAT AGTATTAGCC ATCTTCTTAG TCTCAACAGT GACATCCTGA CCAAATACTT AACTCACAGA CTTAAGAGTC GGTCGATCAA GATGTCAAAG ATATTGCGGT ATTCGGCAGA TATTCCTGCC ACAGAGTTTG AAACTGCCTG GAATGCTAAC AAGAGTGTGA ATGATGCCAG AGCAAAGCTT GAGATGAACG AAGCTGGTGA TTCAGACTTC GAGACAGACT CAGACGCAGG CTCTGATTAT GGCGACAAGA ACAAGTGCAC AGGTTGCATA GCTGTTCTTC AGTTATTATC TATAGGAGTG TCTGATCCCA TTTTGGGCAA CCCCAACTAC TTCTTACCAG CCTTTAATGA TTTGATAGAG CATCTTATAG ACACTGGACA AGTATTCCAA GTGCTTGTCT ACAGCGTTCG TTCACATTCA CAGCTACTTC TTCTTGAAAT GATGTTGATC TATTGCTGGT CCCATTGGAA TTACCGTAGC AACGACTCGA GCGCTATTCA ACGCCAGTTT CAAAAGTGGA ACTTGGTAGA GCCTGTTGTT CGCAGCCGAG ACACCATATT CAGGTATCTC TGCAACCTTC AGTTAGGACT AGCTGCCAAT TATACTGAGA TTTCACTCTC GTCCACTCCA TCTTTCATTA ACGAAGGAGA GTTCCGTATC CACTTACTCT TGAGAGAAGC CGTCAAGTCC TTCTGCAACG ATTACATGCA ATTCATCCCC ATGATAGCGC AAGCTGTAGA TGTAGCACAG TTGCACAATA CAAAGGTGCT ATTCTACGAT CTCCTCCATG TTCTTCTCCA GAAGCACAGC CAATGTTGCA TGGCCGAGTA CTTACTGGAA GATAATACTG ATCTCAAGGC TTTGGTTCAA CACGGTCTTT GGCACAAACA CAACCACCGT TTGCACAAGT ACTCGTACCG AATTCTCCAC TTCTTGCTAG AACAAGGCGA CCACAATCTA GTAGAGCTGT GGGAAAGACA AGTAGAAGGA GGCAAGCTCT ACGTTCCTTC CGCTGGTGTT GTCGCTAATG CTACAGTCTC GGCCAATTCT CCGCAACAAC TGCTGGGAAT CAACATGTAT GCCACAGCAC TCCAGAAACC AATTACGCCT GGAGGTTTGA GCTACGGACT TCTTAGTGGG CTTGTCTATG GAACCCAGCG GCATTCGAGT ATGCGCCGTA AGCTTGCAAG AGAGTTGTGA
|
Protein sequence | MTWTKRDFLD LSADSTFGST AFNLGKDPLL HQLEQVREYS EQDLLANMIY IFLLLVFATV VLGAVSFGVR EYYLSMKSKL DTATSPFTPQ AEDEIWNARN DQIQTTPALP ESNAPSSNLK QLPMESEESQ NRLSENSLAS GFTHFTENRV QSQKNHTIEE TESSNQGLSG SLEQVTELEK IQIFRDDIES PGLKHFMSAD ISKGENLLSE KDSPQEPSPP PPPPPTRVRT KSKTNKKTTY SISHLLSLNS DISTKYLTHR LKSRSIKMSK ILRYSADIPA TEFETAWNAN KSVNDARAKL EMNEAGDSDF ETDSDAGSDY GDKNKCTGCI AVLQLLSIGV SDPILGNPNY FLPAFNDLIE HLIDTGQVFQ VLVYSVRSHS QLLLLEMMLI YCWSHWNYRS NDSSAIQRQF QKWNLVEPVV RSRDTIFRYL CNLQLGLAAN YTEISLSSTP SFINEGEFRI HLLLREAVKS FCNDYMQFIP MIAQAVDVAQ LHNTKVLFYD LLHVLLQKHS QCCMAEYLSE DNTDLKALVQ HGLWHKHNHR LHKYSYRILH FLLEQGDHNL VESWERQVEG GKLYVPSAGV VANATVSANS PQQSSGINMY ATALQKPITP GGLSYGLLSG LVYGTQRHSS MRRKLAREL
|
| |