Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_52988 |
Symbol | |
ID | 4851625 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 2373750 |
End bp | 2375300 |
Gene Length | 1551 bp |
Protein Length | 512 aa |
Translation table | |
GC content | 37% |
IMG OID | 640393333 |
Product | predicted protein |
Protein accession | XP_001386794 |
Protein GI | 126275080 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0230627 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0756506 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGCAGC AGTTTGATCG GATCCGTATC GACAATGAGC TTGCCACTAT CAAGTTTATA GGGGCTCTTC CTGCTTGGGG CCCGACAACT ACGGCTTTTG GAATAGAATG GGACAGACCC GAAAGAGGCA AGAACAATGG AGAATTGAAT GGAATTTCCT ATTTCAAGAC AGATATTACA GGAGCAGGCT CGTTTATAAA GTCGTCGAAT AAGAAAATCG AATTGAACAG ACAGACTTTT GTCCAGCAGC TTCTTTCCAA TTATGCAGTG GATTCGTATA CTGACCAGAG ACTCCATTTT GGATCCAAGA GAGTCGAAGA GTATGGCTTA GATAAGCTTA ACAAGATCCA TGCTAACTTC CTCAACTTGA CGTCAGTGAC TTTGGATCAT AAATTGATAT ACATGGGCTA CGATGATGAC GAAAAAGATA TTGTTGATAT CTTTTCTAAA CTTGCAAACT TGGCATATTT GGATCTCGGC TTTAATTTGA TCAATGATTT ATCGATTGTT TGGGGTATTA TAGACAGAAT ACCGCTGCTA ACGAAGCTTA TTCTCAACGG AAATCGTTTC TTTGATCTTT CTAAATCGGT AATTATTCCA CATAACTTAC AGAGTCTACA TCTTTCGTCT ACAAATATCA ATGCTTCTCA AATTGCTGAA GGTGTAACAG CTAAATTTCC AAATCTCCAA GAGCTTTATT TGTCTGGGAA CAATTACCAG GACGAAGATG TAGCAAATTT ATGTCTTGAA GATACCTACT TGGATGTCTT GGACTTGTCA CTCAATGCTA TCAGTGTTAT TCCAACGAAT TTGAAGCATA TACGAAGCTT AATACTCTCA GACAATCTTA TAAGAGCTAT TTCTCCAGAT TGCAAAATGG AGGAACTCAA ATCGATTGAC TTAAGACGAA ATCAAATTCA GTCTTTAGAT TTTATTGACA CGCTTTACTT GAACCTTCCT CGCATCTCTG AGTTGAGGAT CAACAATAAT CCCGTATTCG AGAAAATGGG TGTAGAAGAA ATGACTATAC AGTTGATAGC AAGATTTGAA TGTGACGATC ACAAGAGAAG TTCCACCAAG TTATTCAAAT TAAACGGAAG TTTACTTAAT GAAGATGAGA TCAGCAATGC TGAATTATAC TTTATCTCGA AAGTGAAACA AAATGAAGTA AGTTTCAAGA ACGAGAAAAG GTGGAAGAAA CTTGTAGCCA AACATGACAT TGCAGAACAT TTCACTGATT CTCCAAAAAG CAAAAGAACT ACACTTCTGA TGATAGGCAC TCTGCGACTA TTACTTCAAG TGCAAGTAGA GGACAAGATT ATTATTTCGA GATATTTCCT AAATACTTTT ACTGTGTTGC GATTGAAGGG TTTGATATCA AAGCAATTGA ATAATATCTC AGTCCGAAAA TTGCGACTTC ACTACTATGT AAACGAATTT GATAAAGATT CAACGTTGAA GGTCAAGTTC GACATCGACG ACGACATTTC CATTCTCGAT AATTTTGGTT TTCATGAAAA CCAGACCATT TACACTACCA TAGAGCCATA G
|
Protein sequence | MLQQFDRIRI DNELATIKFI GALPAWGPTT TAFGIEWDRP ERGKNNGELN GISYFKTDIT GAGSFIKSSN KKIELNRQTF VQQLLSNYAV DSYTDQRLHF GSKRVEEYGL DKLNKIHANF LNLTSVTLDH KLIYMGYDDD EKDIVDIFSK LANLAYLDLG FNLINDLSIV WGIIDRIPLL TKLILNGNRF FDLSKSVIIP HNLQSLHLSS TNINASQIAE GVTAKFPNLQ ELYLSGNNYQ DEDVANLCLE DTYLDVLDLS LNAISVIPTN LKHIRSLILS DNLIRAISPD CKMEELKSID LRRNQIQSLD FIDTLYLNLP RISELRINNN PVFEKMGVEE MTIQLIARFE CDDHKRSSTK LFKLNGSLLN EDEISNAELY FISKVKQNEV SFKNEKRWKK LVAKHDIAEH FTDSPKSKRT TLLMIGTLRL LLQDKIIISR YFLNTFTVLR LKGLISKQLN NISVRKLRLH YYVNEFDKDS TLKVKFDIDD DISILDNFGF HENQTIYTTI EP
|
| |