Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_35594 |
Symbol | |
ID | 4837953 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 1190610 |
End bp | 1191866 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640389268 |
Product | predicted protein |
Protein accession | XP_001383855 |
Protein GI | 150864860 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000667051 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGGTC CTATTAAGAG CAAGAACGGT GACATTTCCA AGGCGCCAAC ACCGCAAAAC ACCCCTGCTT CGGTTACTAA TTCCTATTTG AGATCGCAGC CTCCTACCGT TTCCACGATT GAAGAAACCA ACGAGGAAAA TGTTGGTCAG CAACTCGCTA ACAATCCGGC ACTTTTGCTG ATGATCCAAG GCAAATTGGG GGATCTTGTC GGAGCACAGA GCGGATATAT TGACTCTTTG CCAAAGTCTG TCAAAAAGAG AGTCTGGGGT TTGAAGGCGA TCCAACAACA GCAGATGAAG TTAGAGGCTG AATTCCAGAA GGAGCTCTTG AGTTTGGAGA AGAAATACTT TAAGAAGTAT GAGCCTTTGT ATGCAAGAAG AAAAAAGATC ATCAATGGCG CTGAAGAGCC CACTACTGAG GAGATTGAAG AAGGTGAAGC ATTGGAGGAA AATGACGACG AAGATACCGA AGAAGCAAAG ATCCAGGAAT TGAAGGATTC CAAGGCAGAA GAAGACGATG AAGAAGAAGA AGATGACGAA GAAGCTGCTG CTGGTATTCC CGGCTTCTGG TTGACATCGT TGGAGAACTT ATCAACTGTA TCTGAGACCA TCACAGACAG AGATTCGGAA GTGTTGGAAC ACTTGATAGA CATCAGAATG GAGTACTTGG AAACCCCAGG CTTCGAATTG ATCTTTGAGT TCGAAGAGAA TGAATTCTTC TCTAACCAGA TCTTGACGAA AACTTACCAT TACCAGGCCG AACTCGGTTA CTCTGGAGAC TTTGTCTACG ATCATGCAGA TGGCTGTGAA ATTAACTGGA AGCTGAAGGA GAACAATGTT ACTATCAATA TCGAAAGAAG AAAGCAGAGA AACAAGAACA CCAAGCAGAC CAGAACCATC GAGAAGTTGA CTCCTACAGA ATCCTTCTTC AACTTCTTTG ATCCACCTAA GCCTCCTAAG AGGGATGAAG AAGATGATGA AGAAGAGAAG GACGATGAAG ACGAAGAAGA CGAGGAAGAC GAGGACTTGG ATGCCCGTTT GGAATTGGAC TACCAGTTGG GTGAAGAAAT TAAGGACCGT TTAATCCCCA GAGCCATTGA CTGGTTCACT GGAGATGCTG TTGAGTACAA CTTTCCAGAA GACTTTGACG GACAAGAAGG AGAAGAGTTG GACAGTGAAG AAGACGAAGA TGACGAGGAC GACAGCGAGG ACGAGGGCAA ACCAAAGGAA AACCCTCCAG AATGCAACCA ACAGTAA
|
Protein sequence | MSGPIKSKNG DISKAPTPQN TPASVTNSYL RSQPPTVSTI EETNEENVGQ QLANNPALLS MIQGKLGDLV GAQSGYIDSL PKSVKKRVWG LKAIQQQQMK LEAEFQKELL SLEKKYFKKY EPLYARRKKI INGAEEPTTE EIEEGEALEE NDDEDTEEAK IQELKDSKAE EDDEEEEDDE EAAAGIPGFW LTSLENLSTV SETITDRDSE VLEHLIDIRM EYLETPGFEL IFEFEENEFF SNQILTKTYH YQAELGYSGD FVYDHADGCE INWKSKENNV TINIERRKQR NKNTKQTRTI EKLTPTESFF NFFDPPKPPK RDEEDDEEEK DDEDEEDEED EDLDARLELD YQLGEEIKDR LIPRAIDWFT GDAVEYNFPE DFDGQEGEEL DSEEDEDDED DSEDEGKPKE NPPECNQQ
|
| |