Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31379 |
Symbol | |
ID | 4839032 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 647470 |
End bp | 649943 |
Gene Length | 2474 bp |
Protein Length | 722 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640390347 |
Product | predicted protein |
Protein accession | XP_001384089 |
Protein GI | 150865039 |
COG category | [A] RNA processing and modification |
COG ID | [COG5107] Pre-mRNA 3'-end processing (cleavage and polyadenylation) factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.828555 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCATAA ACGACAACAA CAGAAAACGT TTGGCGTTGG ATGTGGTTGG CCAACTAGAG GAAGATTTAG AGGCAGATCC CTTGAACTAT GCCAAATGGA ACAGACTCAT TAAACATGTG GTGGCCAAAG ACAAAGAAGA ACAAGTCAGG GCCATCTACA CCAAGTACCT CGGCATCTTC AAGTCTGACG TATGTATACG CTAAGCTGCT TATTGCTGTT AGAAAGGCAT GCTCATTTTA GTTCTTGTCA TTTGTATATA ATTAGATACT ATTGTTTGAT ACAATAGTAT AGTACTGCTG CTGTTGATAC AATAGCAGTC GAAATTCTTC TGTATATATC GGCACTGTCT CTCTGGAAGA ATTCAAATGG AATAATTAAT AGACACAGAA TCTTATAGAT GAGTTCTATA TCAGGTCTAG AGCAAAATTA AGCATTGCCT TTCTGGCAGA GTAAATTTCT AATATCGATT TAACAATTAC TAACATATCT ATAGGGAGAG CAGTGGTGCA AATATATCAA CTACGAGTTG AACCGAGGCG AGTTCCAAAA GGTAGAATCT CTATTTCACC AGTGTTTCTT GATTACAGAC AACGTAGAAC TTTGTCGTTT ATACGTATCG TACGTACGTC GTGTCAACGA TGTTATTACT GGTGGTGAAA AAGCCAGAGG CACGGTCATT CAGGCGTTTG AATTCGCCAT CAACAAAGTT GGAATAGATA TCAATAGCAC GGCTTTGTGG AATGACTACT TAGAATTCCT CAAGTCGTGG ACTCCTGCTG CTAGTTGGGA ACAGCAACAG AAAGTTGATT TGATAAGAAA AGTCTACAAA AAGTTTTTGA TTGTTCCCAC GGAAAACTTG GAGAACTCGT GGTCTCAATA TACTAAATGG GAAAATGAAG TCAACCCAGC TACAGCAGCC AAGTTCATCT CTGAAAAGTC TGCCGAATTT ATGCTTGCCA GATCGTGGAA TACCGAATGG CAGAACATCA CCGAAAGGAA GTTGATGAGA GACATTTATC CGTTCTCAGC TACTGGTGAA AAGGAAAAGA TTATTAGGAA CCAAGTCGGT TATTGGCTCA ATTGGGTCGA GTTAGAAAAG AAAAATATAT TGGAGTTGAA GGAAGATTTG CTTGAGAAGA GAATAGCATT TACATATAGA CAAGCTACGT TTGCATTGCC GTTTGTTCCG GAATTGTGGT TTAAAGCTAG TAAATTCTTG CTTCTTAGTA ATGAGGAAGC AAACATTAAT AGATGTGTAG ATCTCTTAAG TGAGGGTTTG CTGTTAAACC CCAGAAGTCT TCTCCTTTCA TTTCAGCTAG CCGAATTGCA TGAAAAGGAC GCTGGATTCG AAAAATCAAA GGATATCTAC AATAATCTTG CGAAGTGGTT AACTATTGAC TACACCAAAA CTACTGAGCA GTTAGAGTCG CTTAGATCAC GTTTTGAAAT CCCCAGCAAT GGCGATGACA ACGATGAAAA TGATCCAGAG TCCTTCAACA ACGACGACGA TATGCAAATA GATACCAAGA AAGTGTACCA ACTCACTTCT GAAGACAAAA AGCATCTTGC GACTTTGAGT AAGAAACAAA CTGAACTTGC AAAATCTGTA ACATTAGTCT ACGTGAAGTG GATGACAGCC TCCAAACGAG CAGAAGGAAT TAAGGAGGCA CGTAGTGTTT TCAAGCTGGC CAAGAAGTTT GCCAGCATAG GCAGTGAGTT GTTCGTGGAA AATGCTCTTT TAGAGCATTA TGCCGACAAC AAGAAAGTAG CCTTGAAGAT CTTTGATTTG GGTATGAAAG CTTACGCTAC AGACGGAGAC TTCTTATTTT CCTATTTGGA GTATTTAATC ATGATCAATG ATGTGGATAA CATCAGAATC TTGATCCAAA CGTCAGACAC CAACCTCACG AAAGACATCG TTTCTTTAAC TGAAGCAGTA CAACTAGGTC TGTTGAATGA ATACTTGAAG GAGTTGAAGG AAGATGAGAT TGAAGTGAAG AGAGGCTACT TGAGAAAGTT ATTTAAGCGA TACATTTCCT ATGCCTCAAA ATATGTTTCT TTGGATGTTG CTCAAAGTTT TGTAAATAAG TACGAGCAAA CTTTTCCCGA TGACGACCCC ATCGAGTTGT TTAGCGATAG ATACACACAA GGTGACGACA ACTTGATAGA AAAACTTGAT TTGGGAATTG ATAGTTCTAC ATCTTTGCCG CCCAGCAAGA AGAGAAAGGT CAATGCTAAG TCTGATATCG AAGACAACAA CAGAGACTTG GACGACCGTG ACGGTATATT CAGCGCTCCA CAACCACCAG TGTTGGAGCC TGAACAGCCC AGCTCTTTTG TGGGGCCTAC TATCACCACT CTTTTAGCAG CCTTACCCAA CGCCTCTTAC TTTGGACAGC CTTCAGAGAG TGTCTTCAAT AGTGAGAAAT TGGTCAAGTT GTTTTCCAAT TTGCCCAACA TTCCTGTTGA TTGA
|
Protein sequence | MFINDNNRKR LALDVVGQLE EDLEADPLNY AKWNRLIKHV VAKDKEEQVR AIYTKYLGIF KSDGEQWCKY INYELNRGEF QKVESLFHQC FLITDNVELC RLYVSYVRRV NDVITGGEKA RGTVIQAFEF AINKVGIDIN STALWNDYLE FLKSWTPAAS WEQQQKVDLI RKVYKKFLIV PTENLENSWS QYTKWENEVN PATAAKFISE KSAEFMLARS WNTEWQNITE RKLMRDIYPF SATGEKEKII RNQVGYWLNW VELEKKNILE LKEDLLEKRI AFTYRQATFA LPFVPELWFK ASKFLLLSNE EANINRCVDL LSEGLSLNPR SLLLSFQLAE LHEKDAGFEK SKDIYNNLAK WLTIDYTKTT EQLESLRSRF EIPSNGDDND ENDPESFNND DDMQIDTKKV YQLTSEDKKH LATLSKKQTE LAKSVTLVYV KWMTASKRAE GIKEARSVFK SAKKFASIGS ELFVENALLE HYADNKKVAL KIFDLGMKAY ATDGDFLFSY LEYLIMINDV DNIRILIQTS DTNLTKDIVS LTEAVQLGSL NEYLKELKED EIEVKRGYLR KLFKRYISYA SKYVSLDVAQ SFVNKYEQTF PDDDPIELFS DRYTQGDDNL IEKLDLGIDS STSLPPSKKR KVNAKSDIED NNRDLDDRDG IFSAPQPPVL EPEQPSSFVG PTITTLLAAL PNASYFGQPS ESVFNSEKLV KLFSNLPNIP VD
|
| |