Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_51666 |
Symbol | PFS2 |
ID | 4851082 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 887884 |
End bp | 889548 |
Gene Length | 1665 bp |
Protein Length | 541 aa |
Translation table | |
GC content | 47% |
IMG OID | 640392790 |
Product | polyadenylation factor I subunit 2 |
Protein accession | XP_001387402 |
Protein GI | 126274068 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACGGCA ATCGGTCCAA CGGGAACGGA TACAACTCCA ACTCGTATGG AAATAACAAC GGCCGTCCTT CGTACAATAA CAGATACAAC AATAACAGCA ACGACCCGAA GGTTCAGGAA AAGCTAGCAG CGTACAACCA GAACATCGAA CAACAATTAG CATCTCAAGA AAAAAAGACA GCACATCGTA GAATCGTAGA CCATGGCAAC AATATGGGAA GGTGGTATAT CCACAAGAAC TTGGGGCTTT CTCAGCGACA ACAGGCTATA GGTAGCATCA GACCTGAGTC CTCGTATTTG ATAGACCTTC TTCCAACTCT CGCATATTCG TCGTCTTCCA ACTTAGGGGC GGCCAACAAC AAGAACAACA TGGCTGTGAT GGATATCCAG ACCAAATTCG TGCATTTGTC GTCCAACAAG GTTAAACATT CTATCAATGC TGTGAAATGG ACCCCTGAAG GTAGACGTCT ATTGGTAGCA TCACATAGTG GAGAGTTTAC TATCTGGAAC GGGATGACCT TCAACTTCGA GACAATTATG CAAGCGCATG ATTCGCAAAT TCTCGCATTG CAGTACTCGC ACAATGACGA GTGGCTCTTG TCTGGTGATC TGAACGGTGT CATCAAGTAC TGGCAGCCCA ACTTCAACAA CGTCAATATC CTCAATGGCC ATACTCAGGG AATCAGAGAC ATTGCGTTTT CACCTAACGA CTCTAAATTC TTGACCTGTG GTGATGACTC CACCTTGAAA ATCTGGAACT TCAACAACGG TAAAGAGGAA CGTTCCTTAG CTGGACATCA CTGGGAAGTC AAGTCTGCTG ACTGGCATCC CAACTTGGGG TTGATCGTCA GTGGATCCAA GGATAACTTG GTCAAGTTGT GGGATCCTCG TAGTTCTACC TGTGTCACAA CATTGCATGG ATTCAAACAT ACAGTAAACA AGTGTAGATT CCAGCCTACA GGTACCAAAA GGTTGTTGGC ATCTGTTTCT CGTGACAGAT CATGTCGTGT TTTTGACTTG CGTACAATGA AAGATATCTT AGTCTTGAGA GATAGCGAGA CCGATTTGTC ATGTGTCTCG TGGCATCCTA CACATGCATC GATGCTTACT ACAGCTGCCT ACAACGGATC CATGAGTCAT TACCTTCTCG ACTCCTATAT TCCTGATAGC AACACCAGTG AGCTTTCCAA AAAGTCTACC TCTTACGGCT CGTCTTCCGT AGGCTCTATA GAAGCAGTGC ATAGAATCCC ATATGCACAC GAAAGAGCCA TCCATGCCTT AGAGTACCAC CCCTTGGGCC ATTTGTTGTG TTCTGCTGGT TCAGACAAGA CCGCTAGATT CTGGTCCCGC GCAAGACCCA ACGATCCAAT GTCGTACAAG GACCCATTGT ACACAGACGA CAAGCATGGA GCATGGTACT ATTCGGTGAA CAACAACATC AATGCTGTTA TCGAAGATCC GAGTGGCTCC AGTGGCACGG CTACAGACTC ACTACCAGTG CCGTATGGAG AGGACAGAGA CCGTTCGCAT ACCCCAGGAC TCAATTTGCC AGGATTGGGT TCGAGCTACG ACTACAACGG AAATGGCAAT AGCGGGAATG GCGATATTGC AGCCACTCCG GCCCCTAGCA ACTGGGGTTC CATTCCTGGC TTACGGGGTC ATTAA
|
Protein sequence | MYGNRSNGNG YNSNSYGNNN GRPSYNNRYN NNSNDPKVQE KLAAYNQNIE QQLASQEKKT AHRRIVDHGN NMGRWYIHKN LGLSQRQQAI GSIRPESSYL IDLLPTLAYS SSSNLGAANN KNNMAVMDIQ TKFVHLSSNK VKHSINAVKW TPEGRRLLVA SHSGEFTIWN GMTFNFETIM QAHDSQILAL QYSHNDEWLL SGDLNGVIKY WQPNFNNVNI LNGHTQGIRD IAFSPNDSKF LTCGDDSTLK IWNFNNGKEE RSLAGHHWEV KSADWHPNLG LIVSGSKDNL VKLWDPRSST CVTTLHGFKH TVNKCRFQPT GTKRLLASVS RDRSCRVFDL RTMKDILVLR DSETDLSCVS WHPTHASMLT TAAYNGSMSH YLLDSYIPDS NTSELSKKST SYGSSSVGSI EAVHRIPYAH ERAIHALEYH PLGHLLCSAG SDKTARFWSR ARPNDPMSYK DPLYTDDKHG AWYYSVNNNI NAVIEDPSGS KDRDRSHTPG LNLPGLGSSY DYNGNGNSGN GDIAATPAPS NWGSIPGLRG H
|
| |