Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_42365 |
Symbol | |
ID | 4836996 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 1882210 |
End bp | 1883349 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640388311 |
Product | predicted protein |
Protein accession | XP_001383131 |
Protein GI | 150864354 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5160] Protease, Ulp1 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.201544 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCACATA TTGTCTTCGA CTCTGCCAAG GGCTTTTGTT ACGACAGTTC TGGTGCGGGC CCAGGTAGGC ATCTGGCTAC CAGATCCAAT GGATTTGATC AGCTACCTTG GGACGACTAC TTGCCTGATG AAGTAGACGA AAGTGAAGAA GAGACACCTG GTAATGACGA CGATTCCAGC ATTCACGCGA CTAGAAAGAA ACGGCTCACT AAACTGGAGA AAAAGGAACA GAGACAAATG AAGAAAATTA ATGCTGCTCG CAATAAGCAT ATCAAGAGAC AGCAAGAGCT ACTGGGTTCC GGAAAAGACA GGGACATGTC TGAACTCGAA ATCTTCAATC CATTCCTAGC CAAAAGCAGT ATAAAATCAA TACATAGTAA TATCCTCAGA ATGGCAGAAC AACCCAAATC AATCGATTTC AAGTTGTTCC AGTACCATTC TATCGCACTT TATAGCTCGG ATCTAGACCA TATTCTTCCT GGTGAGTGGC TCAATGACAA CAATATTTCA CTTATTTTCG AGCTTATTAA CCAGCTCTTC CTCAAGAGTC AAGATCCGGC TAAAAAATTC AACTACCAGG TCCAGATGTT GTACCCATCC TTGGTACAGC TATTTTTGCA TTTCCCAGTC ACCGATGACT TGGAAAATAT TCTTCCTATT AATGAATTGA AGCAGCTGAA GTTCATATTT ATACCGATCA ACTTCATTGA CGACTACGAA GACATTGATT TGGAAGATGT TAATAATGGC GATCACTGGG CACTTGCGCT TTTGTCGATT TTGGAGAATA GACTCTATTT GTACGACTCC ATGGCTATTG ATGGAGACGA ATTTGCATCG CAATCTGAGA CCAATTTGTT GAACGAATTG ATAAAGAGAT TGAAATCGTG TAAAAGCATA TTCAAGGCAG GCGACAAGAC CAAGATAGAT ATCATAAGGA TGAAGTGTGA CCAACAGGAT AACTTTGATG ACTGTGGAGT ATATCTCATT ATGATAGCAT GCTTTTTAGT AAAGCAACTA CTCTTCTCCG ATTCAGCGGA AGGGGCTGTA GACTTGGATA TTGGAAATGT CCGTTTCAAT GCATTAAGTG CAAGGCTCTA TATGATGAAA TTGATTCATA AACTATATAA ATCATTATAG
|
Protein sequence | MPHIVFDSAK GFCYDSSGAG PGRHSATRSN GFDQLPWDDY LPDEVDESEE ETPGNDDDSS IHATRKKRLT KSEKKEQRQM KKINAARNKH IKRQQELSGS GKDRDMSELE IFNPFLAKSS IKSIHSNILR MAEQPKSIDF KLFQYHSIAL YSSDLDHILP GEWLNDNNIS LIFELINQLF LKSQDPAKKF NYQVQMLYPS LVQLFLHFPV TDDLENILPI NELKQSKFIF IPINFIDDYE DIDLEDVNNG DHWALALLSI LENRLYLYDS MAIDGDEFAS QSETNLLNEL IKRLKSCKSI FKAGDKTKID IIRMKCDQQD NFDDCGVYLI MIACFLVKQL LFSDSAEGAV DLDIGNVRFN ALSARLYMMK LIHKLYKSL
|
| |