Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_33476 |
Symbol | |
ID | 4840790 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | + |
Start bp | 735541 |
End bp | 737679 |
Gene Length | 2139 bp |
Protein Length | 530 aa |
Translation table | 12 |
GC content | 37% |
IMG OID | 640392105 |
Product | predicted protein |
Protein accession | XP_001386153 |
Protein GI | 150866518 |
COG category | [A] RNA processing and modification |
COG ID | [COG5188] Splicing factor 3a, subunit 3 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.50882 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.174506 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAATGT TGCAACGTAA TGTTGGTGTG TCTTGTTCTT TATGAATCTC ACAGGCGATA TCAGCAACAG GATGGTGTAA ATATATACGA GATACCGTTG ATCTGAGAAT GTTAGGCTTG ATGTCAATGA CTTGTGAAAG GCTGATAATG AGATGAGGAA TGAAAATCCA ATAAATCTTC CTATGATTTT GTATCAATTG CTGTACAAAG GGGCCGTAAT TGACAAAAAG ACTCCATATA TAGATTTCCT ATACTGAATC AACTATTTGT ATAGTTCAAT TATACTTGTG ATTTCAAAAT TATAGTACTT GAAACTTGGG GATCGAGTGC TTATTTACAG TTATAGAATA TAGCGGTGCG ATTGAACAAG TAGACAGAAT TAGCGAGAGA TAAAAAAAAT AGGGCGGTGC GGCTATTAAA TCAGATGACT TCAATCACGC AGCTTTCTGC ATCTATATCG AGATGCCCCA GACATGTCTT ACCCATAATC TCCAACTTTA TAGCAGACCA TCCGGTTAAA TACTTTAAGA ATAGTAATAC ACTATAAGAA TGTGGCTGTT TCTTGAACTG CAGCGGTCGA TTCTTGAAGA GCTCGATGTA ATCGAACTTG AATCCTCGCG TAGGTTCAGA AAAGATCCTC TATTGTATCC ACAGAATGAG AATAAAGAAA AGCTACAGAT TGTCAGAACA AAGAGACCAC AGAAAGAAGT CAAGTTGCAG CAGCATGAAT TGGCTGTTTT CCAACAGAAA TACAAGAAAC ATTGCAATTC ACTTAGAAAC CATACGGCTA ACGATAGCGA CATAATACAA TCAATCCTTG GTACCCTAGA TGATTCCAAA GCTACGTTTT CCAATTTTGA TTCTGCTCTA GCCCAAATTC AAGAGAAGCA TAACAAGACC AATAATGGTG AAATTGAAGT GGCGGAGAGC ATACGCAATA TGTATACTAT GTTTTCCAGT ATTCTCTTTT CTGGGGAAGA ATCTGTATTG CTAGATGATG ATATAAAGAG AGTAAGAAAA GAAGGCAAAG AAAAGACAAA AGTCAAGAGA AAGCACATCA TCAGCATCAC TGCATCCCAT CTTGATCCAG ATGGAATATA CTCTACTGAA GAAGTATATG GAAAATATTT GGATTTGACA AAATTCCACG AGATCTACAG AAATCAGACT TCCAGCAACG TCTCGTATTT GGAGTACTTG AAAGTGTTCG ATATCTTTCC ATACGCAGAA AGCTTCCGCA GTTCAAGCAT ATATCTACAG TACTTGAGAG ATCTAAGTGA ATATTTGGTA GACTTCGTAC TGAGAACAGA GCCATTGCAG AACTTCAATG AAGTGTTTGA ATCTATTAAG AAATCATATT CTCCTAAAGA AGAACCAGCA ACTAGAGATG GAGTAGAAAA TGAACTGGGT GAAGTGTATT GTAGTGTTTG TCAAAAGGTC TTTGCAAAGA TATCAGTTTA TCAAGGCCAT TTAAATGGTA AGAAACACAA GAAGAATGCT AAAGAATTGC AAACTGCAAC ACCAAAAGAG TCAATCATTT CTGAGAGTGA CTTGCAAGAG CATATTAATA CAGAATTGGG TAAGTTCCTC TCCAACTATA AAGAGGCAAC TATACAGAAT ACAGAGAGAA AATCAGCCAT GACTGAAAGA GAAAGATTGA TAGAGAATAC CACGATCGTC GGAGACGAAT CTGACTATAC GACTGTATAT GATTCAAGTT CAGATAGTGG AAATGATTCC AGCGATGAAG AAGAAAACGA GAACTTAAAA CACCTACCTT TAGGAGCCGA TGGAAAGCCA ATTCCATTTT GGCTCTATAA ACTTCAGGGG TTGCACAAGA CTTACAACTG TGAAATATGT GGCAATGTTA CATACAAAGG CAGAGTCACT TTTGAAAAAC ACTTTAGTGC ACCTAAGCAT CAATATGGCT TGAAATGTCT TGGAATAACT GAGCAATTTG TGTCCTACTT TAAGGATATA ATACTGATTA ACGAAGCACA AGATCTCTGG AAAAGATTGA AAAGAGATAA AAGAATCAAG GAAGGAGACA TCGAGAATGC TGTGGAAGTC GAAGACGCCG AAGGTAATGT CATGTCAGAG AAGGATTACC TTGATTTGAA GAAACAAGGT CTATTATAG
|
Protein sequence | MEMLQRNVAR SILEELDVIE LESSRRFRKD PLLYPQNENK EKLQIVRTKR PQKEVKLQQH ELAVFQQKYK KHCNSLRNHT ANDSDIIQSI LGTLDDSKAT FSNFDSALAQ IQEKHNKTNN GEIEVAESIR NMYTMFSSIL FSGEESVLLD DDIKRVRKEG KEKTKVKRKH IISITASHLD PDGIYSTEEV YGKYLDLTKF HEIYRNQTSS NVSYLEYLKV FDIFPYAESF RSSSIYLQYL RDLSEYLVDF VSRTEPLQNF NEVFESIKKS YSPKEEPATR DGVENESGEV YCSVCQKVFA KISVYQGHLN GKKHKKNAKE LQTATPKESI ISESDLQEHI NTELGKFLSN YKEATIQNTE RKSAMTERER LIENTTIVGD ESDYTTVYDS SSDSGNDSSD EEENENLKHL PLGADGKPIP FWLYKLQGLH KTYNCEICGN VTYKGRVTFE KHFSAPKHQY GLKCLGITEQ FVSYFKDIIS INEAQDLWKR LKRDKRIKEG DIENAVEVED AEGNVMSEKD YLDLKKQGLL
|
| |