Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_52835 |
Symbol | |
ID | 4851483 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 1945283 |
End bp | 1946653 |
Gene Length | 1371 bp |
Protein Length | 429 aa |
Translation table | |
GC content | 43% |
IMG OID | 640393191 |
Product | predicted protein |
Protein accession | XP_001387999 |
Protein GI | 126274611 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATCATGTTAC AGTACAAGAA ATGGCAAACA GAAGATGTAA TTAACTCGTA CTTTGACGAT CAACAGAAAT TCTACGAGAG CTGTGGATTA CCATTCGGAA AGCCTAGTAA AAATACTTTT GCAATAAAAC AGAATTACTA CTGTGTAATT TGCTGTGAGA CCCGTGTCTC CACCCCGGTA TATTCGTTGA CATGTGGCCA TGAGTTTTGT ATCAATTGCT ACTACCACTA CATCAATAAT GAGATCAGCA ACAGTAAACT CATAACCTGT ATAATTCCGG AGTGTCCCTA TACAATTCCA CATAGAGACA TCGACGAGAT AATTCTCGTT GTAGAGCTGG CAAATTCGGT CAAGGTGCGC AAGGCTCTCA GCTCGAACCC GCTCTTGATT GCCACAGCCA AGGTCTACAT CGATTCACAC GAAAACTTCA AGTGGTGCCC GGCCACTGAT TGTACACATT TCACAGAAAT CGTGTCGCCA CGAAGGCTAG AGGAAGATGA AGTGGAAAAT AAGGATAAAA AGCCCATCGA CATCTCCATA GTACCCATTG TAGGATGTGC GGACCATCAC GAGTTTTGCT TTGAGTGTAA TTACGAGAAT CATTTGCCGT GTCCCTGTTG GATTGTACGC TTGTGGATCA AGAAGTGCGA AGACGACTCG GAGACCGCCA ACTGGATAGA CGCAAATACC AATGCCTGTC CCAAATGTCA GGCCTCCATC GAAAAAAACG GAGGGTGTAA CCATATGACA TGTCGAAAAT GTCAATTCAA CTTCTGCTGG ATCTGTTTGG GAGACTGGAA GGACCACAAC AATAGCTACT ACTCGTGTAA CAAATTCAAG CCAGACAGCG AAGATTCAGA GGTGGCTAAT CGTAAAATCA AGAGTAAAGT CTCGTTGCAA AGATATCTTC ATTTCTATAA GAGATTCTCA ATTCACGAAA GCTCAATGCA AGGAGACCAA TCTACACTCT CTAAGTTGCA TGACCTAACC ATGTTATACA TGGAGAACAG AAAAGAACAT GAGACGAACT TGTCTTGGAC AGACATCCAG TTCTTGCCAG ATGCCTTCAT AGCCCTTGCG AATGGACGTA AGACGTTGAA GTGGACATAT TGCTTTGCTT ATTATTTGGC AGATTCCAAT TTCTCTGAGA TCTTTGAAAG TAACCAGGAC TATTTGAACA AGACTGTAGA GGACTTATCA GGAATCTTTC AGAACATGTT GGACAAACAC AACAAGAACA AGGTGGCGTC CATCTTGAAG CATCGTAGTC AAATCATCAA CTTGAGCGAG TTGATCACTT CCAGGAGAAA AATGTTGATC AGCGGAGCAG AAATGAACTT GAAAGAGCAC TTGCTTCGGT TTGAAGCGTG A
|
Protein sequence | IMLQYKKWQT EDVINSYFDD QQKFYESCGL PFGKPSKNTF AIKQNYYCVI CCETRVSTPV YSLTCGHEFC INCYYHYINN EISNSKLITC IIPECPYTIP HRDIDEIILV VELANSVKVR KALSSNPLLI ATAKVYIDSH ENFKWCPATD CTHFTEIDKK PIDISIVPIV GCADHHEFCF ECNYENHLPC PCWIVRLWIK KCEDDSETAN WIDANTNACP KCQASIEKNG GCNHMTCRKC QFNFCWICLG DWKDHNNSYY SCNKFKPDSE DSEVANRKIK SKVSLQRYLH FYKRFSIHES SMQGDQSTLS KLHDLTIKEH ETNLSWTDIQ FLPDAFIALA NGRKTLKWTY CFAYYLADSN FSEIFESNQD YLNKTVEDLS GIFQNMLDKH NKNKVAQIIN LSELITSRRK MLISGAEMNL KEHLLRFEA
|
| |