Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_48462 |
Symbol | |
ID | 4840264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | - |
Start bp | 1444892 |
End bp | 1446010 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 12 |
GC content | 50% |
IMG OID | 640391579 |
Product | predicted protein |
Protein accession | XP_001385974 |
Protein GI | 150866393 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.237219 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGCAC GGCCCCAGAT CCATTCCACT ATCCTTTTCA TTGGTCAGGT GCCCTACGAC TGGGACGAGG CTACCATGAA CTCTGTAGTA TGTGGCTGTG GAAAAGTAGT AGATGTGCGT TTGGGCTTTG AGTATCCTGG CCGTAACAAG GGCTACTGTT TTGTCGAGAT GCAAAATACG GCCGAAGCCG CACGAGCATA TAGTCTTCTT GGCCAGGTGC GGATCATGCA TCCTTCCAAC CGTGGCCAGG TGAAGAACTT GAGAATTGAA TCTTCCAAAG AGGGCTTCAG CAAGTCTACA GTCCGGTCGG AATCCAAGCC GGTAATGGCT CTAGACAGAC AGCACTTGCC CCCATATGTC CAGATACCTC CCGAAATGGC GGCCAATGGT CCACCAGTAG CGCCCAACAA CGTGATACAA ACCACGAACA TGAACCAAGT GAGCCAAAGT CAAAGTCAAA GTATGAACAT GAACCAGAAT CAAAACCAGA ACTACGCACC AGCCTCCGTC AATTCTGCTC CTGGCGAAAC ATCTATGCCG CACAAATATA CCCAGGCTTC GAAGATCTTG CCTCAGCCAG CACAGCTTCC CTTTGGAGTG CCAGACAAAA TCAACGACAC ACTCAGTAAA ATTCCCCCAG CACAACTTGT AGAACTTATA GCTTCTCTTA AGAACATGTT GAGCGGTCCA GATGCCCTGA GAGCATACGA GGTGTTTCTG TTGTCCCCGT ATTTGGCCAC TGCTGCTGCT CAAGCTCTTT TGCTAATGGG ATTTATAGAC GAAGAAGTGA TTAGCGATTC CATGAAATCT GCTTCAGGAA CTCCAGCACC ACAACAGCCA CCTCTTCCAC CTCCTCCGCC ACCTCAACAA CAGTACAATC CACAGCAGCT GTACCAGAAC ACTGGCTATA ATAACTCATA TCCTTCACAT TCTTCTACAC CGCAACCCCC TGCTCATTGG CGTGGATTGC CTCAGAAAGC TATAAGTAAG TTGATGGCGA TGCCGCAAGA CCAAGCTGAC TTGATAGCCC AGGTGTTAAC TCTTCCCCCG GACCAGATCG GCTCGTTGCC TCCAGATAAG CAGGCGATGG TGACGAGCTT GAGATCACAG TACTTGTAA
|
Protein sequence | MSARPQIHST ILFIGQVPYD WDEATMNSVV CGCGKVVDVR LGFEYPGRNK GYCFVEMQNT AEAARAYSLL GQVRIMHPSN RGQVKNLRIE SSKEGFSKST VRSESKPVMA LDRQHLPPYV QIPPEMAANG PPVAPNNVIQ TTNMNQVSQS QSQSMNMNQN QNQNYAPASV NSAPGETSMP HKYTQASKIL PQPAQLPFGV PDKINDTLSK IPPAQLVELI ASLKNMLSGP DASRAYEVFS LSPYLATAAA QALLLMGFID EEVISDSMKS ASGTPAPQQP PLPPPPPPQQ QYNPQQSYQN TGYNNSYPSH SSTPQPPAHW RGLPQKAISK LMAMPQDQAD LIAQVLTLPP DQIGSLPPDK QAMVTSLRSQ YL
|
| |