Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_32420 |
Symbol | |
ID | 4839436 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | - |
Start bp | 1477701 |
End bp | 1478702 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640390751 |
Product | predicted protein |
Protein accession | XP_001385285 |
Protein GI | 150865891 |
COG category | [R] General function prediction only |
COG ID | [COG0300] Short-chain dehydrogenases of various substrate specificities |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.303596 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTCCCT TATTTGTTCT GTTGTTGTTC TATCACTTGA GCAAAAAGAT TCGTGATATC TCCAACCGAC TAGTCGGTAA CTACTTCGAG CCAGAAAAAG ACACCGTACT CATTACAGGG GGATGCTCCG GTTTAGGGAA AGAGCTCGTC AACACGTTTG CCGCTACTAG AGCCAAAGTA GTTGTACTAG ATATTGTCGT ACCAACTGAC GAAGAACAGC CTGAAAATGT TTACTACTAC AAGTGTGATG TTAGCGACAG AAAGCAAGTT CTCCAAGTCC ACAAAACCAT CAAAAAGGAG ATCGGCAATA TTACAGTGCT CATTAACAAC GCTGGAATCA CTACTGGCAA GCCCCTAGTC GATCTCAGCT ACCATGAAAT AGAGAAAACT ATACAGATAA ATTTAATGTC CAGTTTCTAC ACCATCAAGG TATTTCTTCC TTCCATGCTC AGATTGCACA GGGGGTACAT TGTTACTATA GCCTCTGTGT TAGGATACAT GTCCCCGGCT AGATTGAGTG CATACGGAGC ATCAAAATCG GGGCTCATTG CTCTTCACGA GTCTCTTACT TACGAATTGG GACCCCCATC GATGAACCCT ACGGGGGTGA AAACTTTACT AATCTGTCCC GGTCAGTTAA AAACCGCCAT GTTTTCGGGT GTAAATACTC CGTCCTCGTT ATTGGCCCCT GAGCTAGATC CCAAGTATGT CGCCTCTAGT GTTCTTTCAG CTCTAGAGCT TGGTCGCAGA GGCGAAATCA AGCTTCCTCT CTACGGAAAC TTCTTGCCGA TGTTCCGCGC ATTTCCATGG CCCATTGTTG AAGTAGCGAG AGCAATTTCA GGCATCGACC ACAGCATGAA TTCCTTTAAA AACACACTCA CAAAAGTGGC CAGCACTGTC TCTACATTAT CACAATCAGG ATCCAAGAAT AGCTCAGAGA AAGCCTCTTT ACTAGGAGAA GTAGATGTCA GCGAAGGGGT CAGTGAAATA GTAACGGCTT AG
|
Protein sequence | MGPLFVSLLF YHLSKKIRDI SNRLVGNYFE PEKDTVLITG GCSGLGKELV NTFAATRAKV VVLDIVVPTD EEQPENVYYY KCDVSDRKQV LQVHKTIKKE IGNITVLINN AGITTGKPLV DLSYHEIEKT IQINLMSSFY TIKVFLPSML RLHRGYIVTI ASVLGYMSPA RLSAYGASKS GLIALHESLT YELGPPSMNP TGVKTLLICP GQLKTAMFSG VNTPSSLLAP ELDPKYVASS VLSALELGRR GEIKLPLYGN FLPMFRAFPW PIVEVARAIS GIDHSMNSFK NTLTKVASTV STLSQSGSKN SSEKASLLGE VDVSEGVSEI VTA
|
| |