Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_57105 |
Symbol | |
ID | 4838166 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 55626 |
End bp | 56706 |
Gene Length | 1081 bp |
Protein Length | 339 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640389481 |
Product | predicted protein |
Protein accession | XP_001383289 |
Protein GI | 150864464 |
COG category | [S] Function unknown |
COG ID | [COG3332] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.669873 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGCATTT TGTTATCTAC CACCGAACAT CCAGACTATC CGTTCATCCT TTTGTCCAAC AGAGATGAGT ACTTCACGAG ACCTACTCAA GCTGCCCATT TTAGATCGTT TGATGGTACC ATGAAGATTT TATCTCCTCT TGATATGGCT AGACCTGAAC ATGGCACTTG GATAGGAGTA ACCACATCGG GAAAGGTTGC TGTTCTAGTC AATTATCGTG AGATAGATCA CGCTCGTATG TATTGATTCT ATTTATGATG GCAAATTTTG CGATATGTGG CACTAACAAT TTGTAGACCT GCTAAGTGAG GTATCCAGAG GCATATTACC ATTAGATTAT CTTTGCACGA ATAAACTGGC CGATAAATGG CATAGAACCC TCGAATCGTC GTTGAGCCAC GTTACCAGGG GCAAAGTAGA GTTGAGTCAG ATCGGCGGGT TTTCACTTGT GTATGGACAA TTGTCCATTG ATCCCAAGAC AGGCAAGCTT AACCACTTGA ATATACTAAG CAACAGAGGA GACCATGGCA AGATTCATGC ATCTGCGAAA GATAATTCCA ACGAAGAAAG AGAAGAAAAA GAAGAAGCAG ATGAAGAAGA AGAAGATGAT GATGAAGAGG ACGACTTGCA CGGTGATATA AGTAACAAGA CCACATTTGG CTTATCCAAT TCATTGTACT ACGAACCTTG GAAGAAGGTC AAGCTTGGTG AGGAGCTCTT ACATGAGTTG GTAGAGAAGT CTAAAGAAAT GAAGCTTTCG CAGGAAGCTC TAGTTTCTGA ATGTTTCAAG TTGCTTAGTC ATAATACTTA CGACAAGGAA GTAGCCAAGC AGAAGGATTT TTCCAAAAAG ATTACAGAAC TTAGAAACTC GATATACATC CCCCCATTAG AAACCTACAT TAGCCCCAGT GCCAGATTGC TTACTGCTGG AAAATACTAC GGTACAAGAA CCCAAACCAT CTTGTTGCTT GACAGGTTTG GGTACCTTAA CTACTATGAG AAAAACATCC ATAACAGCGA TGACGTCGAC GACGACAACA TTGTCTTCAA CCACTACAGG TTTAATATCG ATCAACAATA G
|
Protein sequence | MCILLSTTEH PDYPFILLSN RDEYFTRPTQ AAHFRSFDGT MKILSPLDMA RPEHGTWIGV TTSGKVAVLV NYREIDHAHS LSEVSRGILP LDYLCTNKSA DKWHRTLESS LSHVTRGKVE LSQIGGFSLV YGQLSIDPKT GKLNHLNILS NRGDHGKIHA SAKDNSNEER EEKEEADEEE EDDDEEDDLH GDISNKTTFG LSNSLYYEPW KKVKLGEELL HELVEKSKEM KLSQEALVSE CFKLLSHNTY DKEVAKQKDF SKKITELRNS IYIPPLETYI SPSARLLTAG KYYGTRTQTI LLLDRFGYLN YYEKNIHNSD DVDDDNIVFN HYRFNIDQQ
|
| |