Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_37533 |
Symbol | |
ID | 4851543 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 2092075 |
End bp | 2093532 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | |
GC content | 41% |
IMG OID | 640393251 |
Product | predicted protein |
Protein accession | XP_001388029 |
Protein GI | 126274801 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.143167 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGGGTA AAGGAGTTAC CGTATTAAAT GATCCACGAA ATCGTGTATC TTCTAGTTTT GATGCGAAAA CTGTCCCTAC ATCCGATCCA GAGGTCCGCG CAGAGCTCAG AAAGCTTGGA GAGCCTGTAA CTTATTTTGG AGAAGATGAT TCTGATCGTA GACAGAGACT TATAAAGCTT CTCTCCGAGA AGAAACATAC TAATTTTGAC TTTGACTATG AAATGGAAGA AGATGAATTG TTAGAAAACG AAGAAAGCGA TGAAAACGAG GATGACGAAG ACTTTTATAC ACCTGGAGAA CAGGAATTGT ACACAGCCAG AGAAGAAATA CTCCATTCTT CGTTGGAAAG GGCAAGGAAG AGAATAGAGA AACAACAAAA GAAAACAAAA GACCAGAGCT TTATCAAGTA TCTCAAACAT AGGCGACACA TTAATTCACA ACTAGCCAAA TACGAGCTTT TTGGCACTCA GCTTATACAA GGTAATACTA GAGCTCTATC AGCCGTGAGA TTCTCAAAAG ATAGTGAATT GATAGCGACA GGTTCGTGGG ATGGAGGAAT CTACATTTTA GATTCTGGAG ATTTATCAAC TAAGTTTAAG CTGGCATCTG GGTATCATTC AGAGAAAGTA AGTGCTATAG ACTGGGATGT CTATACAGAC AGCAATCTTC TTGTCTCAGG AGGTAATGAA GGAAACATAA ACTTCTGGAA AGTTAATAAG GAGTCAGAGA CGAAAGTAAT AAAGCCCGTC GTATCTATAA AGGCAGCCCA TGATAATCGT ATTACTAAGA CGTTGTTTCA TCCTAGTGGT AGATTCGTAA CCTCGACATC ATGTGACCAG ACATGGAAGC TTTGGGATGT CAATCGTCCC GAAAATGCAC TATTGCAACA AGAGGGTCAT TCTAAAGAAG TTTTTGCTGG GTCTTTTCAT CCTGACGGCA GTTTATTTGC CTCTGGAGGT TTTGATGCTA TTGGAAGAAT ATGGGATATG CGGTCAGGAA GATCAATAGT TACACTTGAA AGACATATAA AAGGTATTTA CAGTATGGAC TGGTCGCCAA ACGGGTACCA TTTGGCTACA GCTAGTGGAG ACTGCTCGGT GAAGATTTGG GATCTTCGAA AACTCCAACG AGACTTCAAG GAGATATTTT CAATTCCAGT GCATACGAAG CTCGTAAGTG ACGTGCGGTT TTTTAACAGG AGATCTGTGT CTAATGTACT TTCGACCGAA GTTGCAAATG AGAATGGAGA CAATCCTGAA GTTCTCGACT CCGATGGCTC TTTTCTCGTT ACCTCTTCTT TTGATGGACT TGTAAATATC TGGTCAGCTG ACAATTTCAT CAAGGTTAAG ACCCTTAGAG GACACAACGA CAAAGTGATG AGTTGTGATA TTAGTTGCGA TGGAAGTACA ATAGTATCGT CGGGATGGGA CAGAACCGTC AAGTTGTGGA AGAGCTAG
|
Protein sequence | MGGKGVTVLN DPRNRVSSSF DAKTVPTSDP EVRAELRKLG EPVTYFGEDD SDRRQRLIKL LSEKKHTNFD FDYEMEEDEL LENEESDENE DDEDFYTPGE QELYTAREEI LHSSLERARK RIEKQQKKTK DQSFIKYLKH RRHINSQLAK YELFGTQLIQ GNTRALSAVR FSKDSELIAT GSWDGGIYIL DSGDLSTKFK LASGYHSEKV SAIDWDVYTD SNLLVSGGNE GNINFWKVNK ESETKVIKPV VSIKAAHDNR ITKTLFHPSG RFVTSTSCDQ TWKLWDVNRP ENALLQQEGH SKEVFAGSFH PDGSLFASGG FDAIGRIWDM RSGRSIVTLE RHIKGIYSMD WSPNGYHLAT ASGDCSVKIW DLRKLQRDFK EIFSIPVHTK LVSDVRFFNR RSVSNVLSTE VANENGDNPE VLDSDGSFLV TSSFDGLVNI WSADNFIKVK TLRGHNDKVM SCDISCDGST IVSSGWDRTV KLWKS
|
| |