Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_40400 |
Symbol | |
ID | 4837597 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 2654561 |
End bp | 2655835 |
Gene Length | 1275 bp |
Protein Length | 371 aa |
Translation table | 12 |
GC content | 48% |
IMG OID | 640388912 |
Product | predicted protein |
Protein accession | XP_001383259 |
Protein GI | 150864444 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGACA CTGAAGTGCT CGACTACGAA AGACAGGAAT CCGATGCCAC AGGTGACGGC CAGCGCGAGG GTGCGGCTGA TATGGAAGTG GAAGTAGAAG CTGAAACGGA GCCAGAAGCT TCTAAACAGA CAACAGTTGA TCACCATGGG TTGCAAACTT CACAGACACT CGAGCCTCAG GAAGAAGTCA AGTACGAAGT GGTGGATGAA CCCAAGTTCC AGGTCACGAG ATCGATTTTC ATTGGCAATT TGCGTCGTCC GCTCAATGCG ATGCATTTCC AGAACTTCTT GAAAGAGCTA GCCAAAGAGG CTGGAGACTA CATCGTGGAA AGAGCCTGGT TGAATAGAAC GAGAACCCAC GGAATTGTTC TTGTAGACAA AGAAGAGGGA GCCAAGTTTC TCCGTGAGAA ACTTCTCGGT ACTATCTACC CACTGGAAGA AGACGACTTC AAGTTGAAGG AAGAATATGA GATTAGAGAA CAGGAACGGT ATGAACAGCA GAAGCTTCAA TATGAAGATG AAATGGAAAA GCTTGATACT GAAGAAGCCA AAGCAGCATT GGAACCTCCA TTGGAGCCTA GAAAATATTC AGTAGAGAGA CATCCTCTCT TTGTGGACTA TATTCCTGTC AAGGCTATCA ACCAATGGAT CTATGAAGAA GATAGAGGAC CCAGAAATGG CAAGTGGAAG ATCGACTACG AGACGAAGGA TGATGAAGTA GTTGCCAGCC ACAGTCTCTT ATCAGGTGAC TTTGTCCCAC GCTACCAAAG AGGTAGAGAC CGCCGTGGCC GTGGCCGTGG AGAAGGTAGA TACAGAGGCT ACAGAGGAGG AGACAGATAC GGTGGGGATC GTTATGAAAG AGACAGATAC GTAGCTGATG ATAGAGATAG AGAAAGAGAC AACAGGGACA GAGACAGGTA CGGTGAAAGA GAAAGATACG GTGGTGACAG AGACAGGTAT GGAGACAGAG ACAGGTACAC GGAGAGAGAC AGATACAGAG GTGGAAACGA TTACAATGGA CACAATGACT ATCCTCCTCC AAGAAGAGGT TACAGGGGAG ACCGTTACGA TCGCGACGGT CCCAGACCAT ATAACGCGGT GCCTCCACCT CGTCAAGACA GCTACTATCC CAGACGTGGC CGGGACAGAG ATGCATATAT TCCTGGTGAC AGGGTGGTAG GCTCTCGTAC CGACACTTAT GAGCCCAAAT ATCGTGACAG ATCTGATAGC AGACAGAGAA GAAACCGTTC AAGATCGAGA TCCAGATCGC CATAA
|
Protein sequence | MSDTEVLDYE RQESDATEAE TEPEASKQTT VDHHGLQTSQ TLEPQEEVKY EVVDEPKFQV TRSIFIGNLR RPLNAMHFQN FLKELAKEAG DYIVERAWLN RTRTHGIVLV DKEEGAKFLR EKLLGTIYPS EEDDFKLKEE YEIREQERYE QQKLQYEDEM EKLDTEEAKA ALEPPLEPRK YSVERHPLFV DYIPVKAINQ WIYEEDRGPR NGKWKIDYET KDDEVVASHS LLSGDFVPRY QRGRDRRGRG RGEGRYRGYR GGDRYGGDRY ERDRYRDRYR GGNDYNGHND YPPPRRGYRG DRYDRDGPRP YNAVPPPRQD SYYPRRGRDR DAYIPGDRVV GSRTDTYEPK YRDRSDSRQR RNRSRSRSRS P
|
| |