Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_24134 |
Symbol | |
ID | 4851834 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 2956771 |
End bp | 2960040 |
Gene Length | 3270 bp |
Protein Length | 1090 aa |
Translation table | |
GC content | 38% |
IMG OID | 640393542 |
Product | predicted protein |
Protein accession | XP_001386899 |
Protein GI | 126275742 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.00000713013 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | AGACCGATTT CAACTTTGGC TACAGATCCA CGAAAGGTGC TACGAAAAAA TACACTTCAA GGTGACTATT ACCAAGTGGG AACGCTTTTG GAGGATCCGT CTTTGAAAGA TGAGTACCTT AATGCTCTCG AGGTAGCTGA AACTTTTCTG GATGTAGAAA TCGACTTGAT ATCTGCCAAT TACAGCAATC TTTTGGAAGT GATGGCTGGA TTCCGCGAGA AAGAAGCCCT AGATCTACTT GTTGATATAT GGACAATAGT TTTCAACAAG AAGGGAAGAA AAATCAAGTC AGCCCGTAAG TTGCAACAGT CATTGCTTCA GAAAAGACAG CAGCTCGTTG ACATATTGAT GGATGCTAGA CGCTATGACA TATACGAAAC CCTTATACAA CCGCTACTAG AGGATTCTTC ATTCAGTGAG AAACGTACCC AGTCAGACGA CTTCATGATA ATTTGTCAGT TGAAGTACCA AGATGGGAAG CTACTGCTCA ATAAGCATGG TATATTGCGT TATTTGAGAG ATGAATATAT ACCCATGCAC AAGAAAAGGC AGTTTCTTGA GCGACTTACA CGTTCCGGAA TAATTTATGC TTCCAATGAC GAGGAGCGAT ATTTATTCAT CAAATACTTC ATGGAGTTTT CCAAAGTGAT GGGAGATATC GATATGAACT TCGTAGACTC GCAATTTAAG GTATACGAAA AAGTATTGCG TCTCATGGCA ATTCCTCCAG AAGGATTCGA ACAGCATATC ACGGAGCTTT ATCACCAGTT TGTAAAGCTT GACTTCGAAC CAGAAGCATT TAATAACTTG TTGACTTCAC TCATGAGTGC AACAGCATCG GCTCCAAATT ATGTCTTGAA GTATTGGCAG TATAAAGTAA AGCACAGTCG TCAGATGGGT TTAGCACCAT CCAGAGTATT GAACTACAAG GATTTGAAGT ACGCAATGCA GTCACTCTTG TCTATGAATT ACCACAAAGA AGTAGAAGAA CTCTACTGGA ACTTTCCAAC TCTTCACGAT GACGATCAAA TAGAAGTACT CTTGGAACTA TGTGCTGTCT CCAAGGATTG GGAGGGACTT CAAAAGCGAT TTGAAGAAAT GTACGGTCGT GGTAATCTTC CTCTTACAGT CCATTATGCT ATAGTCATGA ATGCATTGGC CACTGTTGGG GCCAAGCAAG AAGTCGATAA TCTTTTCCAG CAACTCATAG AGAGAAAGCT TGAGGCATCA ACCTCTGTCT TCATTTCATT GATTAACAGT AGACTCTACT ACAACGACAT CGATGGAGCA AAAAAATGTA TTGATTTATA CCTCAGTATG TCGTTTGGAG AAAAGGAAAA GGATGCATTG CCCAAGTTGT ATCTGTTGAT TTTCGATGTC TACATCAGAT CGAGTTCTTT ACAGGAAGTC ATGGACTTTT TCGAGTCCAC TTTGAAACGA CAAGCAGAAC TTGGAGTTCA ACTAATCAAC AGCAAAGTCA TTGTACAAGT CATAGATATG GCATCTTCCA ACTATGGATT GAGAGAAATA GAACGTCTTA AGAGTATTGC TGAATATTTG AACTTGGGCA ACAATGAAGT TTATTTGGGG TTGATCAGAA CATATACGAG AATGGACCAA TTTGAAAGAG CTGACGAAAT CAGTTACGAA GCTCATAGAC ATTCTGAAAT TCCTTTCAAT GATTCCAAAG TATATGCGGC CCAGCTAAGA AACTTCAGAT TTTGGCAGAG GAGCTCCACA CATAGTGAGA AAAGAGAATA CTATAGATCT AGAATGACAT TCATTAGTTT CTTAGCCGAC GATGTTCGTT TGGAGAAGTC GCTAGCTCTC AATGCTGGTT TTTTCACTGA AGTAATCAAA TATTTACTTT CAGAGGACAA TGTTAAAGAA GCATATGCCA TGTTGAGGAA GGTTGAAAAA CTTGAGTTGC TTAGGGAAAC TCATTATGTA CCCTTTTTAA AATATTTCTC CAGGACTAGA AATAGTAAGT CGAGAAATGA GGTTATGAAA CTCTACCAAG AGATGATAGA AAAGAATATT ACTGTCTCTT CTCTTGCTTA TGTATTTTTA TTGAGAGTAC TGTTGGAGAC TGACCATGGA CATGGTTTAG GGTTTAAGAA TTCCTTGAGT TTATTGGAAA CAGTTTTCAA ACTTCATGGA ATAGCTTTGA CAAGAGGAGA GAAGTTGGAA CAACCACAAG TCCTGGCTTT AATGCTCTAT CAGAACTATG TCAATCTATG TCGTATGCTT TCTGACTACA TAATTACTAC AAAGGGAGAT TCTGATATGT TGATGCACTT TATGGAACAA GTCAGAAGTA AACTCGGAGA AGTTATTGGA ATAGATTTCA CAATCACATC GTCTATTGAA ATGGCTAAGG TTTATGAAAG CGAGGGAAAC AAGGTGAAAT CGCAAGGATT TATAGACTTT GGAATGGTGG ATTTGCGTCA CTTTATTTCC AGATTTCATG CCAACTATCC ATTCAAAAAA TCAGAAGAAG TTGTTGTTCC GCGAGCATTA CTGAAACCTT TGCGTACTCT TATTACCATG AAAATGAGAA CAGATCTGAG AGATGTCTAT ATTGATATAC TCAACGAATC TCATGACTTA GTTACTCTTT CAGGTATCCA GTACAATCAG TTGATGAAAG GTATCTTGCC AACTTCAAAT ATCAACATTT TGAATAAGAT CCTTTTTGCT TGTGAACATC ACCTAGTTTC AGGTAACTGG GTTGAAATCC AAATAATGAA CAAACTTCAA TATTTATACA AGTTGATCAT TTACCATCTA ATAAGACTTC ATGGAGAGAA TAAAGTTATG TCTCGGTATT CGATATTGAA CAACTATTAC AACGTCAAAA GCCTTTCCCA GTTAGATTCT GAGTTTGCAG AGCTTGAAGA TCCTTATGAA ACTTTGAAAT ATGAAGTCGA GACTCATATT CGAAGTATTA CCACGAGGAA TTACTCGGTC GAAGATGTCA TCAGACATAT ACCAAAGATA TTTGCTCCTG AAAGACACAT TAGAACAAGA AACAAGATAT CCCACACCAA CTCTTCACAA CTCTGGTGGA ATATCAGTAA ACACTGCAAA GACGACATAC TCAAAGCATA CGACTTAATG CAAGAGTATC CCAATCTCAT GGAGTACTTG ATGTACAATA ACAGCGCAAG ATACCGTATG ATTTGTTTTC GAGACGAAAT CAACAAAATA AGGCCACCAA GATCAAGGGA GGGCTACAAC GATAGACGGT TCCGTACAAT AAGTGCACTT
|
Protein sequence | RPISTLATDP RKVLRKNTLQ GDYYQVGTLL EDPSLKDEYL NALEVAETFL DVEIDLISAN YSNLLEVMAG FREKEALDLL VDIWTIVFNK KGRKIKSARK LQQSLLQKRQ QLVDILMDAR RYDIYETLIQ PLLEDSSFSE KRTQSDDFMI ICQLKYQDGK LLLNKHGILR YLRDEYIPMH KKRQFLERLT RSGIIYASND EERYLFIKYF MEFSKVMGDI DMNFVDSQFK VYEKVLRLMA IPPEGFEQHI TELYHQFVKL DFEPEAFNNL LTSLMSATAS APNYVLKYWQ YKVKHSRQMG LAPSRVLNYK DLKYAMQSLL SMNYHKEVEE LYWNFPTLHD DDQIEVLLEL CAVSKDWEGL QKRFEEMYGR GNLPLTVHYA IVMNALATVG AKQEVDNLFQ QLIERKLEAS TSVFISLINS RLYYNDIDGA KKCIDLYLSM SFGEKEKDAL PKLYLLIFDV YIRSSSLQEV MDFFESTLKR QAELGVQLIN SKVIVQVIDM ASSNYGLREI ERLKSIAEYL NLGNNEVYLG LIRTYTRMDQ FERADEISYE AHRHSEIPFN DSKVYAAQLR NFRFWQRSST HSEKREYYRS RMTFISFLAD DVRLEKSLAL NAGFFTEVIK YLLSEDNVKE AYAMLRKVEK LELLRETHYV PFLKYFSRTR NSKSRNEVMK LYQEMIEKNI TVSSLAYVFL LRVLLETDHG HGLGFKNSLS LLETVFKLHG IALTRGEKLE QPQVLALMLY QNYVNLCRML SDYIITTKGD SDMLMHFMEQ VRSKLGEVIG IDFTITSSIE MAKVYESEGN KVKSQGFIDF GMVDLRHFIS RFHANYPFKK SEEVVVPRAL LKPLRTLITM KMRTDLRDVY IDILNESHDL VTLSGIQYNQ LMKGILPTSN INILNKILFA CEHHLVSGNW VEIQIMNKLQ YLYKLIIYHL IRLHGENKVM SRYSILNNYY NVKSLSQLDS EFAELEDPYE TLKYEVETHI RSITTRNYSV EDVIRHIPKI FAPERHIRTR NKISHTNSSQ LWWNISKHCK DDILKAYDLM QEYPNLMEYL MYNNSARYRM ICFRDEINKI RPPRSREGYN DRRFRTISAL
|
| |