Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_53245 |
Symbol | |
ID | 4851842 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 2980794 |
End bp | 2985587 |
Gene Length | 4794 bp |
Protein Length | 1584 aa |
Translation table | |
GC content | 38% |
IMG OID | 640393550 |
Product | predicted protein |
Protein accession | XP_001387136 |
Protein GI | 126275762 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0312861 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAACG TCCAACAGCT CACTGTAGGT CTTTCAAACC TTGCATCGGA GTCCAAGAGG AGATACACGG ACGTAAGGCA TGGCTGCGAT ACAGCACTAG CTGATTGTAA ATTGTACCAG CCTAATAAAT CGATCCATGA TATCACCAAT GAACAGCACA GGCAGCATAT AATAGCACCC TTTGTAGCGT CGTGCCGGAC AGGTAACGCC AAGATTGCCA CCATAGCCAT TCCCACTATA CACAAATTGA TCATGGCAGG TGTAGTGCCA CTGGGTAGTT TAGGAAGTTT GGTAGATTCG TTGATGGAAG CTTCGCATCT AGCAGTGGAT ATTCAGTTGA GAATTCTCCA GAGTCTTCCC TCATTGATGC AAAATTACAC GAAAGAATTT ACAGGCGAAT TGCTTGTCAA GCTACTTGCG TTGTGTTCGA GTTTGACCAC CAACAACAAG TCAACTGTGG TTATCAATAC TGCATCAGCT ACTTTGCAAC AGCTCTTCAG TAATGTGTTT GATAAATACA AAGAGCACAC AGAGGAGACT GAAAGAAACT TGGAAATCAC CATTGACAAT AATGAGATTA TTAAAGTAGA CGCTATTGCC CTTGAAGGGT TCAGCGTCTT CCAGGATATC ACTCATGCCA TAGAAAACGA AAAGCAAACT TATCTTAATG ACACAATCCA TATGAAAGTC ACTTCGGCTT TAGAAATCAT AGAGAATATT TTGACGAACC ACAAGGAAAC ATTCCATTCG CATCAAGAAT TGGCGTATTT GCTCAGAGTG AAAGTCATAC CTTCTTTGCT CAGAATTCTA AACTCACCAC ACAAGAACTT TCCTCTTATC ACAAGAACTA TGAGAGTTAT CCATGTACTA TTGGCTACAC AATTAAAAAA CTTGGAGATT GAGAGTGAAA TTGTTTTGTC ATTTTTAAAC CATATTCTTC TCAACAACGA AGGTGGCGCA AAAGTAGTCA ACTGGGAAAA GATTCTCGTC TTGGAAATGT TCAAATCGCT TTTCACAGAT TTCAGAGTAT TAAAATCCAT CTATGAGACT TACGACAATA GCTCTCGTAA GAAGAACGTT ATCCACGAGC TTGTATCCAT ATTAAGCACA TTTTTACTGT ACAATTCGTA TCTTTTCAAC GATATAGTGA AACCACTTCC AAAGTTTCCC ACTCGTTCTG ATTTAAGTTC AGCTCCTACT CTGCATGATG GCGCTTATCC TATATACCTT TCGAAGGTGC TGTCAACCAT GAAAGTTGCT ATTATTGACC ACTTGGATAA ATCTGAACCC CCATTGGCTA TACCCCAAAC TTATCCAGTC TTTCTCATAT ACAACATTCT CTTAGCATAT GCTGATGGTA TTGCCAAATT TGTGCAATCG TTATCAGATA ATTCCGATAC CAACAATCTT GAGGCTGATG TAGAATTCAC AAACGCCTTC ATCGAATCTT GTTTCACAGA AATCTCTGCT TTGTTTGAGA AGTTTATCTA CACTCTGATG GATGATGATG CTTTCCATTT GCTTGTCAGA TCTTTACAGA GATACACACA TACTACTGGC CTTTTGGGCC TTGGTAAATT GAGAGATAAA CTCTTGATCA TTATTTCTAC GTCTGTAACG AGAAATACTT TAAATGAGGA AACAAACAAT GGAAGCTCTT CTGGATTCAG TGAGCAAGGA AAGCAGTTGT TTGCATTCGG GGAATCGTTA GTAGAATCAT TCGGTGCCAC ATTGCAACCC CATATTGGTG ATGGTAATAG CAACAATGCC CAACCAGTTC AACTTAGATC TAGATACTTC AACTCAAGAC ATGTGACATG TTTGAGAGCA TTGGCTAATC TTGCAGTTTC TTTAGGTTCT ACCTTGCAAG ACTCATGGAA GATTATCTGG AAGACTTTTC AATGGTGTGA TTATTTCCTT TATGGTCCTG ATGAATATAG CGGATATTAC AACCATAAAC TGTACAAGAA CTTTACTGAT TCGATGTTAC CACAGTTGTC TACTTCGGAT ATTAGCAATT ACGATTTTTC AAGGAGGAAG ATGTTCGACA GTCTTAGCGA ATATCCTGCT GAGTCATTCC AACGGTTGTT GGAGGCATTG ATTCAGCTTT CAGAGTTGCT GTTTGAAGTG GAAGAGGATT CTCAAGAAGT AGAACTCAAT GAAAAGTCAA ACGAAAGTAA TGAATCTATT GACGATGATT TAGAAGTGTG TCCATACAAC AAAACATTTT ACTTGTTGAA GGTTTTGAGC TTCGCAGAAA ATGACTCAAA TCAGTATCTC ATCAAGTTTG ACAAACCGTG GGACTCTTTT TGCAAATATT TTATCAAGTT GGGAACAAGA AGAGATTTGA ATTACAATCT TCGTATTTTC ATTATCACTA CCTTTAATGA TATTGTGAAA TCTGTAGCAG ACCAAGGGTT CAAGTCAGCT GACGAAATTA CTATAAGACA AACTTCGGAA AAATCTCTAG GGGCATTGAA TAATTATCTT GTTGCGATGT TCGAATTAGG AATCCCACAA GAGCTTTTGG TTTTGAATTG TGAGACTGAG CTTCATCTTT TGACTTTAAC AACGTTGCAT GAATTGATAG ATAAATACGA CACTTATTAT CAGAGTTCGT GGCATACGGT ATTCACTATT ATAAATACCC CGTTCAAGAC TGTTGGCTCA CTCAGCGAGG ACAATAATTT GAAAGAAAAG AACCGTTTAT TGATTGAAAA GTCATTTGAT ACTTTGAAAT TGATTTTGGA TGAGTTTATG TCTTCCTTGC CTTACGACCA ATTGAAGCTT CTAATTGATA CTCTTCATAA CTTTTGTTCT CAGCATTACG ATTTGAACAT TTCCTTCAGT TCAGTTAGTT ACTTTTGGAT GATCAGTGAC TCCTTGAAGT CGCGAATTTC AATTGTTACC GAAATGAACC GTGATGAATT AAACAAAGAA CAAGAGAAGT TGACGAACAT CAGTGAGCTC ACAGCTTACA TTGATACTCA TGATTCGAAA GAGTCATATC TATTCTATAT ATTAGTTGAT GATTACTTAT TGTCAACTTT GGTTAAACTT TCCTTCGATG ATAGAGCTCA AGTTAGAGAT GGATCCATCC AGACCTTTTT CCAAATTATA GATGTTCATG GTGCCCTTTT GACAGCGTCG ATGTCGTGGG ACTTGTTTTA CAGAATTGTT CTTCCCGATT TGTTGAGTGT TAAAGTAATA AATAGTGCCA ACATTGGAGA TTGGGTCGAG AGCTTGAATT TGATCCTTTC AGGGGTGATT GCGCTCTACG GAAAATTTAT GATGGACTTT AACGAAATTC CCAAGGTTCA TGAAAAATGG GAAAGGTTAA TTCAATATTT CAATGACTTG CTAGATTTGA AAAGTATTGA ATTGAACTTG AAAGTGTTCG GTTCCTTTCA AGATCTTTTG ATTTCGTTTA GAAATGTTGA TATTTCAAAG ATTCATGAAT TCAATAGAAT TCGTACTTTG TTGTTCAAAT TTTGGGTGGG AATTCCGATT GAGTATGACT TTGTCAATGT TTCATACCAG GAGTCCGTAA CTTCACTTAT GGATTGTTTT CCAGCACTTT ACAATTTAAT AGCCGGCCAG CTCACGTTAG AAGAAGTCAA TACCATTTTG ACTGTTTTGA ACAAATGCGC TAGGAATCCT GTTTTACCGA CTTCATATTT GGATAATGTC AGGCCTTCCA AGTTGCAAAG TTCTGTGATT AAGAATTTGA CAATTATTTC AAGTTCTGAT CCAAAGATTC AGTCTCAAGT CATTCAACAG CTCAGTAATA TCTTGGTGTA TCCATTTGGT ATTAGGTCTA CTATCGAACT GAAATTAAGT AGCAATAAGT TGATCACAAA CAAGTTTAAG ATTCCGACGT TCATAGCCAT AAGTCATATT TCAATCAAGT TATTGAAGAC TAAATTGAAG GAATTGGCTG ACTCCAAAGT CTTGGTCGAA GACAAGGGTA TAATCAAGGT GTTGAAGTCA TTGCTAGAGA TTATTAGTTC AAAGTCTGTT GGTATTGAAA CTGAAGAACA GGCACTCTGG ATTGAAGCTA GCGAGATTCT TAAGGATCTA GTAGAGCAAC TTATCAAAGA CAAGAATGTT GGAGGAGACA AGGAGTTGTG GAGATTGATC ATCAACGCAG TGAAATTATG TTACGAGTAC AAGGACCAAG GGGCTATTTT TGAAGATTTC AACATCAAGC AATACCAAGA ATTGAGCCGC ATGATATTGC CAACTTTGCT CGAAGACAGC CAGCATCAGG AGTTGATCAG TGATTGGATC GAAAGCATTT ACCGTAACTC GTATCTTTAT GAGTTTGATG AGTTGGAAAC TTCTATCATG GCGGAAAATG AATCTACGGA AGATTTGGTC ACACACTTTC TGAACTTTGA CTTTGATGGC TCATTTGGGT CCACAAAGCC GCTTGTCAAG CATCGTAATA AGATTACGAG ATTCAATTGT TTGACGGAAT TGATTCGATT CTGTCAGGAT CCAGTCAATG AACAGTTGAA TGTCACCAGT CAATACTACT TTGCTTGCCG TGCTTCATTA TGTCTTCGTA GATTTATCAG CGACGCAAAA CTATTGAACA GATGTCCTAT AGCTAAAGTT CAAGAGGAGG AGCTTATACT TGTGTTGAAT GGGTTAGCGG ATATCAAGAG CGTCACGCAA GAGAATAGAG ATAACTTGCG AAAGTTGTAT CCGTTAGTGG TAAAGTTGGT TCCCTTCACA TCGAGAATCA GCGGACTAGA CGTTTTGGTG GAAAAAGTTT TACATAGATT TTAG
|
Protein sequence | MSNVQQLTVG LSNLASESKR RYTDVRHGCD TALADCKLYQ PNKSIHDITN EQHRQHIIAP FVASCRTGNA KIATIAIPTI HKLIMAGVVP LGSLGSLVDS LMEASHLAVD IQLRILQSLP SLMQNYTKEF TGELLVKLLA LCSSLTTNNK STVVINTASA TLQQLFSNVF DKYKEHTEET ERNLEITIDN NEIIKVDAIA LEGFSVFQDI THAIENEKQT YLNDTIHMKV TSALEIIENI LTNHKETFHS HQELAYLLRV KVIPSLLRIL NSPHKNFPLI TRTMRVIHVL LATQLKNLEI ESEIVLSFLN HILLNNEGGA KVVNWEKILV LEMFKSLFTD FRVLKSIYET YDNSSRKKNV IHELVSILST FLLYNSYLFN DIVKPLPKFP TRSDLSSAPT LHDGAYPIYL SKVLSTMKVA IIDHLDKSEP PLAIPQTYPV FLIYNILLAY ADGIAKFVQS LSDNSDTNNL EADVEFTNAF IESCFTEISA LFEKFIYTLM DDDAFHLLVR SLQRYTHTTG LLGLGKLRDK LLIIISTSVT RNTLNEETNN GSSSGFSEQG KQLFAFGESL VESFGATLQP HIGDGNSNNA QPVQLRSRYF NSRHVTCLRA LANLAVSLGS TLQDSWKIIW KTFQWCDYFL YGPDEYSGYY NHKLYKNFTD SMLPQLSTSD ISNYDFSRRK MFDSLSEYPA ESFQRLLEAL IQLSELLFES NESNESIDDD LEVCPYNKTF YLLKVLSFAE NDSNQYLIKF DKPWDSFCKY FIKLGTRRDL NYNLRIFIIT TFNDIVKSVA DQGFKSADEI TIRQTSEKSL GALNNYLVAM FELGIPQELL VLNCETELHL LTLTTLHELI DKYDTYYQSS WHTVFTIINT PFKTVGSLSE DNNLKEKNRL LIEKSFDTLK LILDEFMSSL PYDQLKLLID TLHNFCSQHY DLNISFSSVS YFWMISDSLK SRISIVTEMN RDELNKEQEK LTNISELTAY IDTHDSKESY LFYILVDDYL LSTLVKLSFD DRAQVRDGSI QTFFQIIDVH GALLTASMSW DLFYRIVLPD LLSVKVINSA NIGDWVESLN LILSGVIALY GKFMMDFNEI PKVHEKWERL IQYFNDLLDL KSIELNLKVF GSFQDLLISF RNVDISKIHE FNRIRTLLFK FWVGIPIEYD FVNVSYQESV TSLMDCFPAL YNLIAGQLTL EEVNTILTVL NKCARNPVLP TSYLDNVRPS KLQSSVIKNL TIISSSDPKI QSQVIQQLSN ILVYPFGIRS TIELKLSSNK LITNKFKIPT FIAISHISIK LLKTKLKELA DSKVLVEDKG IIKVLKSLLE IISSKSVGIE TEEQALWIEA SEILKDLVEQ LIKDKNVGGD KELWRLIINA VKLCYEYKDQ GAIFEDFNIK QYQELSRMIL PTLLEDSQHQ ELISDWIESI YRNSYLYEFD ELETSIMAEN ESTEDLVTHF LNFDFDGSFG STKPLVKHRN KITRFNCLTE LIRFCQDPVN EQLNVTSQYY FACRASLCLR RFISDAKLLN RCPIAKVQEE ELILVLNGLA DIKSVTQENR DNLRKLYPLV VKLVPFTSRI SGLDVLVEKV LHRF
|
| |