Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_33568 |
Symbol | |
ID | 4840608 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | + |
Start bp | 980471 |
End bp | 981727 |
Gene Length | 1257 bp |
Protein Length | 387 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640391923 |
Product | predicted protein |
Protein accession | XP_001386201 |
Protein GI | 150866559 |
COG category | [R] General function prediction only |
COG ID | [COG0724] RNA-binding proteins (RRM domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0922684 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTAATA CTGACAAGAG CGCAAAGAGT GAGTCCAAGA AGGAGAAGCG TGAGCGTAAG CTCGAGAAAA AAACCAAAAA AAGATCTATT GAGGAAGAAG TTGACAAGGT AGCTGCAGAA GAGAAGAATG AAGAGTCTCA AGAGCCATCT ACCGAAGCTG AAAACACACA AGCCTCTGCT CCAAAGCATG CCGATTTCGA AGAGTTAGAA ATCGACTTAA GCGCAGGAGT TCCTCTTTCG AAGAAGCAGC TGCGTTTGTT AAAGAAAGGG AAGTTGGACT TGGAAAGACT TGCTAAGAAG CATCCAGTTC CCAAACCAGA GCTTACTGAA GAGGAAAAGC TTGCCCAGGA GGAAGAAGAT AAGAAGAAGT CCAAGAAGTC AGAGTTTGCA GTATGGATCG GAAACTTATC GTTCGATACT ACTAAAGAGG ACTTGGTTCG TTTTATTGTC GGAAAGACAG CTCACAATGG AGAAGACGAC TCTCAGTTAA TCAAGATCGA AGAAGCAGAT ATCACCAGAG TGAACTTACC CAAGAAGGAA AACAAAATCA AGGGATTTGC ATATATTGAT TTGCCCAGTG CCGTACATGT GACCAGTGTA GTTGCTTTGA GCGAATCTCC TTTGAACGGA AGAAAGTTGT TGATCAAGAA TGCCAACTCG TTTGAAGGTA GACCAGCTGC TGCTGTTGCT CCCTTGTCGA AAAATCCACC TTCTCGTATT CTTTTCGTAG GAAATTTGTC GTTTGACACT AGTGAAGATA ACTTGGAAGA ACACTTCCGT CACTGTGGTG AAATTGTTCG TATAAGAATG GCTACATTTG AAGATACCGG TAAGTGTAAG GGCTTCGCAT TCATTGACTT TAAAGACGAA ACCGGTCCTA CCGCTGCGTT GAAGTCGAAG TTGGCCAAGA AGTTGATCAA CAGACCGCTC AGATTAGAAT ATGGTGAAGA CAGATCTAAA AGAAACCCTA ATCATATCAG AAAGGCAGAA GTTCAAGAAG GAGAAGTTGA CGATTTTGCT CCTGTCAGAG AAAGACCTGC TGCAAGAGAA AGACCTGCTA GAGAAGCTCC AAATTTCGAG AATAGTAACT ACGAGAAACC ACAAAGAGCA TCATCAACTC CTAAGAAGAG AGTATTCAGA GACGATAATC ATAACCACAG CAATAAGAGA GTCAAGTCGT CGGTAGCTTT GGCCACAGCA CAGAGAGCCA GTGCCGCCAT TGTTCCATCT TCAGGTAAGA AGATCACATT TGACTAG
|
Protein sequence | MGNTDKSAKS ESKKEKRERK LEKKTKKRSI EEEVDKPSTE AENTQASAPK HADFEELEID LSAGVPLSKK QSRLLKKGKL DLERLAKKHP VPKPELTEEE KLAQEEEDKK KSKKSEFAVW IGNLSFDTTK EDLVRFIVGK TAHNGEDDSQ LIKIEEADIT RVNLPKKENK IKGFAYIDLP SAVHVTSVVA LSESPLNGRK LLIKNANSFE GRPAAAVAPL SKNPPSRILF VGNLSFDTSE DNLEEHFRHC GEIVRIRMAT FEDTGKCKGF AFIDFKDETG PTAALKSKLA KKLINRPLRL EYGEDRSKRN PNHIRKAEVQ EGEVDDFAPN SNYEKPQRAS STPKKRVFRD DNHNHSNKRV KSSVALATAQ RASAAIVPSS GKKITFD
|
| |