Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_67724 |
Symbol | |
ID | 4838430 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 1786206 |
End bp | 1787431 |
Gene Length | 1226 bp |
Protein Length | 381 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640389745 |
Product | predicted protein |
Protein accession | XP_001384296 |
Protein GI | 150865184 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.579873 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AACATTGGCG ATGTTGGATG AAGGTGATCA CAGGACAGAA CTTCCTCGCG TATTATCTAG TGCAGATTAC CAGAGAGCTG GAAAGCACCA ATTTGGAGAT GGGTTTAAAT TCCTTCCACT ACAGGAAAGA GCAATCAGGG ATATGGCATC CTCTCACACG GTGGTGAAGA TTGACAAAGG CAAGACGGAA TGTTGCATTC TTCTAATGCT TGCTGACAAG ATCTACATCG ACACCTGGGG AATCAAAGAC CACTGTGTCA GTTTGTTGGT AGTGCCGTGT TGGTCTGAGC TCAACTCCAC TGTTGCTAGA ATTGAACAGG CAGGATTAAA GGTCCGCTAC ATCGAGGATG GGGTCAACGA TCCTCTAGAA TATGACGTTC TTGTCTTGCA ACAGACGGCA AATGATTTTG GGCAAATCCA TGAGCTTACC AAACAAATTC GGAAGAGCGG AGAATGCTCG GCACATGTTC GTCGCGTTAT CATAGAAGAT GCTCACTACT TGCAGATGTC ATCTATAAGC GATAATTCTA TCAAAGATAT ACCGTATGCT TTCATGTCGT CTTTCCTTGC TCCAGAAGCA ACATTGATGG CAAACATGAG CATTGACAGC TACAATGTAG TGGAAGACGA GGACAGACAA ATTCGACGCG TCGAAATGCA AGTTGACGTT AAGGAATCAG AGACTGATGT GACCAACAAT GTAATGCTGT ACCTCGGCAC TTTAGTACTG TACAATTCCT TGATTATTGT CGACAATGAT GCTAGAGTTG AACACTTGAT TGAATTTCTC GACGATCTTG TGCCGTGCAT AGGAATCGTA GGATCGTCAA CACAAGAAAG GAGAGAAAAA GCCAATGAGA TCAAAGAGGG ATTGATGGAT GAAAATGTGT GTGTAGTTGC TACTGCCAAG TCATTGGTAG GTCTTGACTT CGAAGTGGAG GAGGTCATAA TTGCGTACGC GGTCACGTCG GAGATCTCAT TGTTGCTTGC GACCAAGATG ACCGATAGCT TGGTCTATTT GTGCTTGGTA AAGGACAGAA ATAGTGACTA CCAACGAAAA TGTCTCTATA GCATCATGCA AGAATATTTG CTATTGAAGG CTTCAACTTG TGCCAACGAA GGTAGTAATT ACTGCTCCAA TTGTGAGGCT AGTTAGGTAT TTAGAAATAA AGTAGACAAA AAGAGGTCTT TAACATGATC TTCTTGCTGT CACATATTTG GATTTT
|
Protein sequence | MLDEGDHRTE LPRVLSSADY QRAGKHQFGD GFKFLPLQER AIRDMASSHT VVKIDKGKTE CCILLMLADK IYIDTWGIKD HCVSLLVVPC WSELNSTVAR IEQAGLKVRY IEDGVNDPLE YDVLVLQQTA NDFGQIHELT KQIRKSGECS AHVRRVIIED AHYLQMSSIS DNSIKDIPYA FMSSFLAPEA TLMANMSIDS YNVVEDEDRQ IRRVEMQVDV KESETDVTNN VMSYLGTLVS YNSLIIVDND ARVEHLIEFL DDLVPCIGIV GSSTQERREK ANEIKEGLMD ENVCVVATAK SLVGLDFEVE EVIIAYAVTS EISLLLATKM TDSLVYLCLV KDRNSDYQRK CLYSIMQEYL LLKASTCANE GSNYCSNCEA S
|
| |