Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_65774 |
Symbol | |
ID | 4839113 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 836045 |
End bp | 839135 |
Gene Length | 3091 bp |
Protein Length | 961 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640390428 |
Product | predicted protein |
Protein accession | XP_001384829 |
Protein GI | 150865562 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.169462 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.537048 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTTCTGAGCT AGCAAAGCTT CCAGAAGACA CAGACACGTC ACTGTCGTAC CCCACCAACC ATCTCCTAAA AGCTCATAGT CGCAAGCATT GCCATGGTCG CTGGAGATAA CTTACAACAG CTAAAGCTGG CATTGGAGAC TATGTACTCC AATGCTAACC AAAACGATAA AATCAACGCT ACCCATTTTC TCGAGACATT CCAAAAATCC CAAGATGCTT GGGAAATCGT CCATACCATT CTCAACGACG CGCATTTAGA TATCCATATT CAGCTCTTTG CCGCCCAGAC GTTGCGTTCA AAAGTGACAT ACGATTTGTC TCAATTGCCA GAGCAGAACT TTGCAACCTT GAAAAATTCC ATTATTCAAT TACTAACGGT GTTTACCGCC AACAACCAAC GTCTTGTGCG TACGCAGCTC TGTGTTGCGT TAGCACAGCT TGCATTGCAA TATTTGACGT GGCAGGATGC TGTGTCGGAA ATTGTTACTA AGTTATCGCT GACGGCAACG TACTTGCCCT GTTTGTTGGA CTTCTTGAAG ATCTTGCCCG AAGAGTTGTC TGACGTCAAA AAGACATCTT TGTCCGACGA TGAGTTCAAC ACGAGAACTA GAGAATTGAT AGAGAACAAC GTAGAACAGG TGTTACTCTT GTTGAAGAAC TTGACCGACA CTAACTCCAG TAACTCGTCA CAAGACTCAA TGGTTCTCGA CTGTTTGAAC TCGTGGATCA AGGAATGTCC CATAGAAAGC ATTCTCCGTA TTGATTCTTT AACTTCACTT ATCTTCCGTA GTTTAGCCAG CGAAGAAACC TTTGATAAGT CTATTGAATG TCTCTGTACG ATTATTAGAG AAACAAGAGA CATCGACAAC CATGAGCTCA TCGAAGCCTT ATACAAACAG ATCATCGAGT TGAACTCGTT TATGCATGCT AACCCTGATA GACTTGAAGA TCCCGAAACA TTTGACGGTT TGTCACGTTT GTATGTGGAA GCCGGCGAGT CGTGGCATGT TCTTATCGCC AAGAACCCGA AGCACTTCAA GCCGTTGGTG TTAATCCTTT TGGAAATCTG TAAATACCAA GACGACTTGG ACATCGTCAA GTACACATTC TATTTCTGGC ACTTGTTGAA GCAGTTGCTC ACCATTTCCA AGTTCCAGGA ATCAAAGGAA GAGTTGGCAG ATATCTTTGC CAATTTAATC ACCATCATAA TAAAGCATTT AACCTACCCC ATAACTGGAA ATGACCACGA CCTTTTCAAT GGTGATAGAG AACAGGAAGA CAAGTTCAAA GAGTTCCGTT ACGAAATGGG GGACGTTCTC AAAGACTGCT GTGCAGTAGT AGGACCCTCG AAGGCTTTGA GCATTCCCTT CCACCAGATC CAGACCATTT TATCCTCAAA CATGCCTTCG ACCAACTGGC AGCACTTAGA GGCACCTTTG TTTTCCATGA GAGCCATGGC TAAGGAAGTT TCTACCAAAG AGAAAGTCAT GTTGCCTACT ATCATGTCAT TTCTTGTGCA GTTGCCGGAA CATCCAAAGG TCAGGTATGC AGCTACATTA GTATTGGGAC GGTATACCGA ATGGACAGCC AAGAATCCGG GATTTTTGGA ACCACAATTG AACTACATTA TCAAGGGCTT TGAGATTGTC AGCTCCAACA GCGCAGACGA ACAGGGAAAA CACGACATTA TCATTGCTGC TTCTCGAGCC TTGATGTATT TTTGTCAGGA TTGTTCCGAA TTGTTGGTCA GTTATTTGGA ACAGTTGTAC ATGTTGTATG GGCAAGTTCG TGACCAACTT GACTTGGAAT CAACGTACGA ACTAGTTGAT GGTTTGGCCC ATGTAATTTT GAAGTTACCA ACGGAAAACT TGTACACCAC TACAGAAATG TTCATTTCGC CAACTTTGCA GACTTTAAAT CAATTGCTCG TAGCTGGTGA AAATGAAGCG AACTCCAAGT CTGTTGCTGA TCAAATTGAG GTTTTGACAA AGTTCATATA TGTCTTGAAG GCTAACAATT TCAGTAAGCC TGATAGCCCT ATTGCACGTT TATTCATAGA AAAGATATGG CCAGCTATTT CTCAATTGTT GGCTGCATAT GGTAAGTCGG TCATTGCAAG TGAGAGAATT TTGAAGTTAG TCAAGTCAGG AATTCAATCC CAGAGCACAT ATTTGAACAG TCTTTTGCCC GAAATGGCTA CCTTGTTGAT TCAGGGCTTC CAGCAGTCAC ACTATGGGTG CTATCTTTGG GTATCTGGGG TTTTGATCAG AGAATACGGT GATGAGTATA CCTCGGAAGA TATCAAGGAT GCTGTCTACA GATTTGGTAT GGAACAATGC TCGTATTTCT TTAACCTATT GTTCAATACC AATGAAGAGG GAGTTCGTGC CATGTCGGAC GTTGTAGAGG ATTACTTCCG TATGATGAAC GACTTGCTTA TGTTTTACCC GTTTAAGGTG ATAGCCAACC AGGACTTATT AAAGTCTACT CTCAAGGCAT CGTTATTGAC TTTGAATCTG ATCAACGAGT TCAACCCAAT CATTTCATGT ATACACTTCC TTGTAGACTT GGTATCATGG GGATTGCCTA GCCCTCCAAT TTCGTTCTTT GATGAGAGCG ACTTGACTAT TCCCAGACAC GGCATGCAAC AGTTTCTCGT TAGCGAGAAT AACGGAGGAG AGTTGTTGAG AGTGGTGTTG AATGGCTTGA TTTTTAAGTT CAACAACGAT ATTCAGCAGG ACACCAACGA CTTGATTCTC AAGATCTTGG TAGCTGTTCC AGATAAAAAT ATTTCTATAG GCTGGTTGCA TGAAGTGGTG AAGGCTTTAC CCAACGTCAA CCAGAAAGAG ATCAGTAAGC TTATGGATAC AGTTTCAGTG GCATTGCCAA ACAAGGACAA TAGAAGAGTG AGGTCTGCGC TTCGTGACTT TGTCAACTGG TACAGCCGTA AGAACGTGAC ACCCAGAAGT GAATTCTAGG TGGAAATCTA CAGACTAAAA GGGAATCATA CGTCAAAAGT TATAGTTAAT GGATACTAGA TAGAGAAGAT GTAGAGGAGA GAGAGGTTGT GAATAGAATG AGCCACCACG G
|
Protein sequence | MVAGDNLQQL KSALETMYSN ANQNDKINAT HFLETFQKSQ DAWEIVHTIL NDAHLDIHIQ LFAAQTLRSK VTYDLSQLPE QNFATLKNSI IQLLTVFTAN NQRLVRTQLC VALAQLALQY LTWQDAVSEI VTKLSSTATY LPCLLDFLKI LPEELSDVKK TSLSDDEFNT RTRELIENNV EQVLLLLKNL TDTNSSNSSQ DSMVLDCLNS WIKECPIESI LRIDSLTSLI FRSLASEETF DKSIECLCTI IRETRDIDNH ELIEALYKQI IELNSFMHAN PDRLEDPETF DGLSRLYVEA GESWHVLIAK NPKHFKPLVL ILLEICKYQD DLDIVKYTFY FWHLLKQLLT ISKFQESKEE LADIFANLIT IIIKHLTYPI TGNDHDLFNG DREQEDKFKE FRYEMGDVLK DCCAVVGPSK ALSIPFHQIQ TILSSNMPST NWQHLEAPLF SMRAMAKEVS TKEKVMLPTI MSFLVQLPEH PKVRYAATLV LGRYTEWTAK NPGFLEPQLN YIIKGFEIVS SNSADEQGKH DIIIAASRAL MYFCQDCSEL LVSYLEQLYM LYGQVRDQLD LESTYELVDG LAHVILKLPT ENLYTTTEMF ISPTLQTLNQ LLVAGENEAN SKSVADQIEV LTKFIYVLKA NNFSKPDSPI ARLFIEKIWP AISQLLAAYG KSVIASERIL KLVKSGIQSQ STYLNSLLPE MATLLIQGFQ QSHYGCYLWV SGVLIREYGD EYTSEDIKDA VYRFGMEQCS YFFNLLFNTN EEGVRAMSDV VEDYFRMMND LLMFYPFKVI ANQDLLKSTL KASLLTLNSI NEFNPIISCI HFLVDLVSWG LPSPPISFFD ESDLTIPRHG MQQFLVSENN GGELLRVVLN GLIFKFNNDI QQDTNDLILK ILVAVPDKNI SIGWLHEVVK ALPNVNQKEI SKLMDTVSVA LPNKDNRRVR SALRDFVNWY SRKNVTPRSE F
|
| |