Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_46351 |
Symbol | |
ID | 4839595 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | - |
Start bp | 1377475 |
End bp | 1380360 |
Gene Length | 2886 bp |
Protein Length | 937 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640390910 |
Product | predicted protein |
Protein accession | XP_001385271 |
Protein GI | 150865880 |
COG category | [R] General function prediction only |
COG ID | [COG2234] Predicted aminopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0158707 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCAAATGGTT TTGTCAAGTT CATAAGATCC ATTTTCGGTT ACAGAAAGAC TTCGTTAACC TTATTTGTGA TTTTGACCTA TGTTGCTGTT TTGCTTCTTG CCTACTTGGA CCATTCGCTA TACTACTCTG TAGACTTACC CACATCTCAC AAGGAACAAG AATTGCTCCA CCAGGCTTGG GTAGACCTCC AGCACATTGC CAAATACGAG CACGCTTATG GTTCCAGTGG CAATGACTAC GTCCACGACT ATCTTGAGTC GCGAATCGTT AGTGCTGTGG CACACAAATC GTATGTAGAG TATGATAACG ATTTGAACTA CACCAACAAC ATCATGTTTG GCAGCCGTTC CGAATTAAGT GGCAACCTGT TCAACTCCGT GTCATACTAT GAGAGCAACA ACTTGGTTGT TCGCATCAAC GGAACCGACG AGACCCTACC AGCGTTGCTT TTGAGTGCTC ACTTTGACTC TGTTCCCTCT TCGTTCGGAG TGACAGATGA CGGAATGGGA ATCGCTAGTC TATTAGGAGT TTTGTACTAT TATACCGGTA AATCCACAGC TCGTCCTCGA AGAACCATCG TCTTGAACTT CAACAATGAC GAAGAATTCG GCCTCTATGG AGCCACATCC TTCTTGTCGC ATCCATGGGC TACAGGTGTG CATTACTTCC TCAACTTAGA AGGCACTGGA GCTGGTGGAA AAGCCATTTT GTTTAGGGGA ACAGACTACG GTATAACCAA GTACTTTAAG GGAGTCAGGT ACCCTTATGG AACGTCTATT TTCCAGCAGG GATTCAACAA CCACTTAATT CACAGTGAAA CAGACTACAA GATTTACAAG GAAAAGGGAG GCTTGAGAGG TTTGGATGTA GCTTTCTACA AACCAAGAGA CCTTTACCAC ACTGCTGGTG ATAACATCAA GAACATCGAT ATCAAGTCGT TGTGGCACAT GCTTTCCAAC GCTTTGGATT TCACTGCCAT AGTCACAAAG GGCAAGATTG ACTTGGATGC CGATTCCTTG GACTCTGAAT CGAGCAAATC CAATACAGAC ACTGCTGTGT ACACATCATT CTTGAACTTC TTCTTTGCTT TCCCTACTTC TCAAGTGGTT GTAGCAAGTA TTTTGCTTTT GGTACTCATT CCTGGGATCA GTATCCCGTT CTTGATTATC ATATTTGGGT ATAAAAAGAA CTGGGAATTG TCCTTCGTCA ATGTCACCAA GTTTCCCATC TCGCTAGCTA TTTCTGCTGC ACTCTTGAAT CTCTTTACCA ACGGTTTTAT CGTTCCATTC AATCAATTCT TGCCCAACTC GTCTCCCTTT GCCTTGGTCG CCATTTTGTT CGCAACTTTC TTGTTGTTGA ACTACTTGAT CTTGAATGGT ATCAATTTGA TTTTTGTTTC ATACAAAATC GTCAACCATG ACGAAAAGTT GATTTCCATC ATTGAAACTT CCTTCTTGTA TTGGGTTGTT TTGATCTACT CGACTGCTAA ATTGGCTAAC AATGTTATCG GTGACGACCA CTCAGGTGAA TTCCCTATTA TTTTCCTCTG TGCTTTGCAA GCTGTAGCCT CCATCTTTGG CTTGATTGGC TGGAGTTTCA AACCCGTTCC AAAAGAACAT TATGTCGTAG TGCCCCAAGA AGAAGCTGAG CCATTGTTGG GTAGTAGTGA CAACTTCAAT TATGGCTCTC CTGATGTCGA AGATGACAGA CTTGTTTCTG ACGGTTCGTC TTTGTCGCTT AATTTCACTG GCGAAAATAG TGCTGAAAGA AAGTTGAAGG ATTTCATCAA AACCTTTAGT TATGATTGGT CAATTCAATT CTTGACTATT GTTCCAATTT CAACTTACTT GATTTACAAC TCAGGTTTCT TGGTTGTTGA TGGAATCAAT AAGTCTATCC AAGAGTCGCT TATTTCCCAG AATTTGATTT ACAAGCTCTT GCAAACTTTT GCTATTTCGT TGTCCATACC CCTTTTGCCA TTCATCTTCA AGGTTAACAG ATTGTTTGTC TTAGCTCTTT TTTTAATTTC GACCATTGGT GTCTTGTTCG TCGCCACGGC TGATTCATTC AATGTTGCCA ATCCATTGAA ATTGAGATTC ATCCAGTACA TTGATCTCGA CAAGTCCGCA CAAGATAGTT TTGTTTCTGT TATCGGACGT GAAGCTTCAC CATTGCAGTT CGTCTTGAGC GATATTCCTT CTGTGAAAGA TTCAAAGGGA GCAGTTGCCT GTGTTCCTAC TAGAGATGGA CTTCAAGATT GCTCATATAA GAGTCTGCTT GATCCAAAAC TTGTGCCAGG AGCTAAAAGC TTTGATGACT ATTTGAAAGT TGATATTCTC AAGAATTCTT CATCAAATGT CGATTATCCG TTTGGACTCT TAACTGGTGA GATCAGGATT AGAGTTCCAA AGAACAGAGA GTGTGTTTTG GATTTCAAAC CAAGTGAGTC AACAAAGATA GTGTCTCCAT TTAAAGATTC TCCAGTAAAA ACTGTCATAG TATACAAGGG TAAGAAGTCG GCCACTACTA AAGAAGTTGA AGCTGAAAGT ATACCAGAAG GTTTCTCCAA AGACAAAGAT GGTAATTACG TCTACAAGGA TCTTGTAGGC ATAGATCAAC TTCAGTTGAA CAAGCTTGAC TGGGATAAAA GTTACCATGT AGGTTTCCAA TGGGTGCCGA ACTTTTCAGA TGTTGATATC AACATGAAAA AGTCCGCTAC AAACAAATTG AATGTCAGCG TCAAATGCTT CTGGGCTGAA TTGGGCAAGG GTGAAGAATC AACTATCCCT GCCTATGAAG AGTTGTTGCA TTATAGTCCA AACTATGTTA GCTGGGCCAA TAGTGCCAAG GGGTTGGTGT CGGTCTCCAA AACCGTCGAA TTATAA
|
Protein sequence | PNGFVKFIRS IFGYRKTSLT LFVILTYVAV LLLAYLDHSL YYSVDLPTSH KEQELLHQAW VDLQHIAKYE HAYGSSGNDY VHDYLESRIV SAVAHKSYVE YDNDLNYTNN IMFGSRSELS GNSFNSVSYY ESNNLVVRIN GTDETLPALL LSAHFDSVPS SFGVTDDGMG IASLLGVLYY YTGKSTARPR RTIVLNFNND EEFGLYGATS FLSHPWATGV HYFLNLEGTG AGGKAILFRG TDYGITKYFK GVRYPYGTSI FQQGFNNHLI HSETDYKIYK EKGGLRGLDV AFYKPRDLYH TAGDNIKNID IKSLWHMLSN ALDFTAIVTK GKIDLDADSL DSESSKSNTD TAVYTSFLNF FFAFPTSQVV VASILLLVLI PGISIPFLII IFGYKKNWEL SFVNVTKFPI SLAISAALLN LFTNGFIVPF NQFLPNSSPF ALVAILFATF LLLNYLILNG INLIFVSYKI VNHDEKLISI IETSFLYWVV LIYSTAKLAN NVIGDDHSGE FPIIFLCALQ AVASIFGLIG WSFKPVPKEH YVVVPQEEAE PLLGSSDNFN YGSPDVEDDR LVSDGSYDWS IQFLTIVPIS TYLIYNSGFL VVDGINKSIQ ESLISQNLIY KLLQTFAISL SIPLLPFIFK VNRLFVLALF LISTIGVLFV ATADSFNVAN PLKLRFIQYI DLDKSAQDSF VSVIGREASP LQFVLSDIPS VKDSKGAVAC VPTRDGLQDC SYKSSLDPKL VPGAKSFDDY LKVDILKNSS SNVDYPFGLL TGEIRIRVPK NRECVLDFKP SESTKIVSPF KDSPVKTVIV YKGKKSATTK EVEAESIPEG FSKDKDGNYV YKDLVGIDQL QLNKLDWDKS YHVGFQWVPN FSDVDINMKK SATNKLNVSV KCFWAELGKG EESTIPAYEE LLHYSPNYVS WANSAKGLVS VSKTVEL
|
| |