Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_67553 |
Symbol | |
ID | 4838892 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 730725 |
End bp | 733952 |
Gene Length | 3228 bp |
Protein Length | 850 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640390207 |
Product | hypothetical protein |
Protein accession | XP_001384437 |
Protein GI | 150865287 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00659474 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TGCTATTACA GTTGCTGTGT TGACTACGTC TATTGAAACT TCTTCTTCTT TACTACAAAT TTCCAATACT ACTACTTGAA CTGATTCACT TTTTTATTCG CAACCACTTC TTACCTGATT CTCCAGTGTC TAGCAATGCC TAACAAGTTA GAATTAACTG CCAGCGAAAT CAAGGCTATT CAAACTTCGT GGGAACGGTT ACAATCCAGT AACAGAAAGC ACAAGGACCA CTTCACTTCG AGATTGTACT CGAATTTGGT AAGATCCCAC CCACAATTCA AGACTATCTT CGATGATTCC AGCGTCTTGA AGGAACACTC TACTTTATTC GGCGAAATCT TGAGTTTCAT CGTCTTGTAC AAACATGACC CTGAAATCTT GAATGACTTC ATGTACCAGT TTATTCATGA AAACCAGAGA TTTGCCAACG TTACTGTTCT TTACTTGGAG CCTATGGGAG AATCTTTGAT TGATACTTTC AAGCAATGGT TAGGTGACTC TGTTTTCACA AATGAATTTG AATCTGTCTG GATCAAGACT TACGTCTTTG TTGCAAACTC TTTGTTGCAA TACTCCGATA GTGACGACGA AGCCAGCGAA GTGGAAAGCA CCTTCAGCCC ACAGCAGGCT GGAAGTGAAG CTGACGTTTC TGACAACGAA GACATTCAGC CTTTGAATAT CAAAAGAAAC GATCGCTCGT TGTACTCGAC TCCTGAGCCT GTTTCTGAGC CTGTTTCTGA GCCTGTTGCT ACTCCCGTTG TTGCACCTGT TGTTGCACCT GTTGTTGCAC CTGTTACTCC AAAGACTGCC ACTGAGGCTC GTGCCTCAGT TTTGGACAAG TCTAACTCTA TCCAATTCAC ATTGGGTGGC AACGACAAGT ACAGAGGCTT CAGAAGATCG GTTCATGATA ACATTCCCAA GAATGAACCA ATTTCTGTCA AGATTCCACA GTCCAGCAAC TTTCTGAAAA CGCATTCCCA TATTCCTACT TCTAACATTT CCTCTTCGTT GAAGTCTGTC TTGGAATCGT CTCCTACTTC CTCTAACTCA TCTATAACTT CAGCTCCTAC ATTCGATCCA AGAAGGCAAA GAAGATCTCG TTCTGTTGTG AGTTCTTCTG CATCTTCAAT CAACGAAGAA TTTACTTCTG CACCTGAATC ATTTGAACAA CCTATCATAA CTCCTCGTTC TGCCAGAAGA AATTCTAATG CTTCTGAAAA GCCACTTCCA GCTGCACCTG TACAACAAAA GAAGACTTTG CCATATCAAC CTTCTTTACT CAGAAAATTG GAACAGAGAC AAGCTTCCCC ATCTTCAGAA GAATACTCTG ATGAAGAAGT TGAAGTTAAG TCGAGGTTTG ACCCAAGAAA ATCTAGGGGC CACAAGAGAA CAAACTCGAT ACACATTCCA TCCCCAGAAT CTTCTGAAGT GGACGAAACT GACGAGACTG AACTTTTCAA TCCATACAGA AGCACTTTTG CCGAGGAATT TGACGAAGAA AAACAAATTG ACATCGACCC ACAAATGCCA TCTGCGTCAT CTTCTATTCC CGAGAGATCT TTGTCTAGAG GAGGATTGAC TGGTGGTACT TTTGACTACA ATAGTTTTGG ACTTAAGGGT TTAGCTCCTA TTGTCGAAAA TGAAATTGAT GATGGTGCTT CTTCTAAGTA CAACAGTGAT GAAGATAATG TTTCTTCGAG CTATTGTGCT GACGAAAGTA ACAAATCATT ATCTGACCAA GGCTCTTCTG ACCAAGGGTC TTCTTCCAGA GCCTCGTCTT TGTCGTTACA CAATAGTGAT TACAAATCTT CTATCTCGTC TGGCACTGAA TCTGCCAGCA ATTCGCCATT TATGAACGGC AAGTTCCAGC CGGGCCATCA AATGAGACAT TCTTCTGGAT CGAGTGAAAT TAGTTACATG AAGTCATTAT CTGATGATAA TACCTTCGCT TACAAGACTT ACAGTTCATC TACACCCTCC TTGTTGTTAG GAAAGGATCC TACGCTGTTC AGTAAGAGAG CTTCGATTGG CTTCATGAGA AGTTCCTTTG TGTTGAAGAA GGAAATGGAA GAGTTTGGTT TGGCTAGACA AGACTCGTTG ACCAAGACTC TCTCCGGTCT GGGATCTACT CCAAACTTGG CTCACCTTCC TCAAAACTAC AATAGAAGGG CTGCTACTAC CATTTCAATT CCAGAAAGCG ATGGTGACGA TGAATTTGAC ATGTTCAGAT CCTTTGGTCA AGCTCCTCCT ATTCAGACGA AGCCTATAAT TCACAACAAG TATGAGTCCA TGAACTTTAC TGCCATTGCT GAAAAGGCAA AGGCAAAGGC TCCAGTTCAA CAAGCTCCAC CTGCTGTTAA GGAAAAGAAG TCGTTCAAGA AGAGAATTTC GTCGATGTTC TCATCCAAGT CTTCTGATTC TACACCATCA GTTCCTCCTA TTTCTGTTGC TGCCCCTAGT GTTTACACTA GTACCAGCTC CAAGAAGACT TCTAAGCGTT CTGGATCCAC CTATGACTTG GCTTCGATCA ACACCAACTC CACCAAGGGT ACCACAGTTT CTGGCTTCTC CTTCTTCAAG CAAAAGAAGA ACGATAGCAA GTATAACGTG GTACACAGAA CTACTCCAAG AAAGGGTAAC AAGTACCAGG TTAAGGCTGC TGCTTACGAC TTGGATTTGT TCAAGTAAAC AGCCCCAACC CTAGTGCCTG TACTATAGAC CGACTTTCCT ATGAAAGACA AAAACACTAG AAAACACGAT GATACCCAGA TAGCCTGATT GGTGCTGTCT CTTATTTAAA GTTGTGAAAT AACTACGAAG CTGGCCGTTC CAAAATTTTC CTTTATTTTC TTTTCTTCTC TCGTTGCCGT TTTGAATACT ACTTGTTCTA CTGGCTGGAA CGTTTGGTGC TGTTCTACAG AGGTCTCGAA ACATGAGTCG GAGTCAAACA TCGTCTGAAA GACGTCAATC GACTACCCCC CATGTAATGC ATTGGGTCAA ACAAAAGGAT TGTTGCTAGA CATCGATAAA CTTAGACTTG TTTGATGATT CTGAATGCAT GATCACGAGT ACCTGGTTCA CATAGCTCAG GCGGATCCTA CACCTGGCTG CAAATAGCAT CTTTTGATTT GTATGATCTG CACTTTGTTC CTTTAGCTGG AATTTTTTTC ATATCAGCTT CGTCTATTTC CTTTGTGAAT GTATAGTAGA TTAATGAACA TTTGAATT
|
Protein sequence | MPNKLELTAS EIKAIQTSWE RLQSSNRKHK DHFTSRLYSN LVRSHPQFKT IFDDSSVLKE HSTLFGEILS FIVLYKHDPE ILNDFMYQFI HENQRFANVT VLYLEPMGES LIDTFKQWLG DSVFTNEFES VWIKTYVFVA NSLLQYSDSD DEASEVESTF SPQQAGSEAD VSDNEDIQPL NIKRNDRSLY STPEPVSEPV SEPVATPVVA PVVAPVVAPV TPKTATEARA SVLDKSNSIQ FTLGGNDKYR GFRRSVHDNI PKNEPISVKI PQSSNFSKTH SHIPTSNISS SLKSVLESSP TSSNSSITSA PTFDPRRQRR SRSVVSSSAS SINEEFTSAP ESFEQPIITP RSARRNSNAS EKPLPAAPVQ QKKTLPYQPS LLRKLEQRQA SPSSEEYSDE EVEVKSRFDP RKSRGHKRTN SIHIPSPESS EVDETDETEL FNPYRSTFAE EFDEEKQIDI DPQMPSASSS IPERSLSRGG LTGGTFDYNS FGLKGLAPIV ENEIDDGASS KYNSDEDNVS SSYCADESNK SLSDQGSSDQ GSSSRASSLS LHNSDYKSSI SSGTESASNS PFMNGKFQPG HQMRHSSGSS EISYMKSLSD DNTFAYKTYS SSTPSLLLGK DPTSFSKRAS IGFMRSSFVL KKEMEEFGLA RQDSLTKTLS GSGSTPNLAH LPQNYNRRAA TTISIPESDG DDEFDMFRSF GQAPPIQTKP IIHNKYESMN FTAIAEKAKA KAPVQQAPPA VKEKKSFKKR ISSMFSSKSS DSTPSVPPIS VAAPSVYTST SSKKTSKRSG STYDLASINT NSTKGTTVSG FSFFKQKKND SKYNVVHRTT PRKGNKYQVK AAAYDLDLFK
|
| |