Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_32792 |
Symbol | |
ID | 4840028 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 648024 |
End bp | 650330 |
Gene Length | 2307 bp |
Protein Length | 768 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640391343 |
Product | predicted protein |
Protein accession | XP_001385474 |
Protein GI | 150866015 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.980986 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0911162 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGGCT TGTTAGACGC AGAAGCCAGT GAAGATGACG AAGAAGATGA GGAAAATAGT GAAGATGATG AAGAGGAAGA CTCTGACAAG GAGTTGAACG AGTTGTTGGG AGAGGAAGAA GACCCAAGCG ACTACGACTC AGAAAATTTC TCCGATGAAC CTCAGGAATC TGATACTAGA TCTATAACCG ATGCCATCTC TGGTGTAAAG ATCAGATCTC TTTCAGACAT TTCTTCCCAA GACAAACAAG AATTCCACAC CAAGTATTCT GACGGTTCGG AAAGAATCAT CAAGCCGGAA ATAGAGCCTG TGTATGACAG TGACGACAGT GATGCCGAAA ACTTCAATAC CATTGGGGAC ATTCCCTTGT CTGCCTACGA TGAAATGCCA CATTTGGGTT ACGACATCAA CGGGAAGAGA ATAATGAGAC CAGCCAAGGG TTCTGCTCTC GATCAATTGT TGGAGTCTAT TGACTTGCCT CAGGGCTGGA CCGGACTTTT GGACCAAAAC ACTGGTTCAT CTTTGAACTT GACAGATGAA GAGTTGGAAT TGATCAGAAA GATCCAACAA CAGGAAAACA CAGATGCAAA TATCGATCCA TACGAAGCTA CCATCGAGTG GTTCACATCT AAGGTGGAAG TAATGCCCTT AACAGCAGTT CCTGAGCCAA AGAGAAGATT TGTTCCTTCC AAACACGAAG CCAAGAGAGT TATGAAGATT GTCAGGGCCA TCAGGGAAGG AAGAATTATT CCTCCAAGTA AGGTGAAGGA GCAGATTGAA GAAGAAAGAC TCAACTATGA CTTGTGGAAT GACGATGACA TCGCTGTTGA AGACCATATC ATGAACTTGA GAGCTCCTAA ATTGCCTCCT CCAACCAACG AAGAATCCTA CAACCCCCCT GAGGAATACC TTTTGACCGA AGAAGAGAAA AAGCAATGGG AACTGTTGGA TCCTGCTGAC AGAGAAAGAA ATTTCCTTCC TCAGAAGTTT GGAGCTTTGA GAAAAGTTCC TGGCTACCAA GAAAGCGTGC GTGAAAGATT CGAAAGATGC TTGGACTTGT ATTTGGCTCC TAGAGTTCGT CACAATAAGT TGAACATTGA TCCCGAAAGT TTGATTCCTG AATTACCCTC TCCAAAGGAT TTAAGACCAT TCCCTATCCG TTGTTCTACC GTCTACCAGG GCCATACTGA CAAGATCAGA ACTATTTCTA TTGATCCTCA AGGCTTGTGG TTGGCCACTG GTTCAGATGA TGGTAGTGTC AGAATTTGGG AAATCTTGAC AGGAAGACAA GTGTTCAATG TTCAGTTGAT CAACAAAGAA ATAAACGACG AAGACCATAT CGAGAGTTTG GAATGGAACC CAGACTCCCA AACCGGGATT TTGGCTGTCT GTGCTGGTGA GAACATCTAC TTGGTTGTTC CACCAATTTT CGGCTTTGAT ATCGAAAACA TGGGTAGATT GAGAATCGAA TCCGGTTGGG GTTATGACAC TTTTGGTAAC AAGACCAAGG AGGAAAAGTT CAAGAATGAC GAGGGCAATG AAGATGAAGA TGACGAAGAT GATAGTGCCA CTTCCACTGC TGTCAAGAAG GACGTAGCCA GATGGTTTCC TCCAAATCAG GAACAGACCA AGCTCGGTAT ATCTGCCATT ATCCAGTGTC GTAAGACTGT CAAGAAGGTG TCGTGGCATA GAAAAGGAGA CTACTTCGTC ACCGTGTCTC CAGATAGCAA GAACACAGCC GTATTGATTC ATCAATTATC CAAGCATTTA TCCCAATCTC CATTCAAGAA GTCCAAGGGT ATCATCATGG ACGCCAAATT CCATCCATTC AAACCACAAT TGTTTGTAGC CTCGCAACGT CAAGTGAGAA TCTACGACTT GGCCCAACAA GTATTGGTCA AGAAGTTGAT GCCAGGTGTG AGATTGTTGT CTACCATCGA TATACACCCT AGAGGTGACA ACTTGTTAGC ATCTTCTTAC GACAAGAGAG TATTGTGGCA CGACTTGGAT TTGAGTGCCA CTCCTTACAA AACTTTAAGA TACCACGAGA AGGCAGTCAG ATCAATCAAG TTCCACAAGG GTAACTTGCC GTTGTTTGCA TCTGCCTCTG ACGATGGTAA CATTCATATT TTCCACGGTA CCGTGTACGA CGACTTGATG ACTAACCCAT TGTTAGTGCC TTTGAAGAAG TTGACTGGTC ACAAGATTGT GAACAGTATT GGTATCTTGG ATTTGATTTG GCATCCAAAG GAAGCCTGGT TATTCAGTGC CGGTGCTGAT GGAACCGCTC GTCTCTGGAC AACCTGA
|
Protein sequence | MDGLLDAEAS EDDEEDEENS EDDEEEDSDK ELNELLGEEE DPSDYDSENF SDEPQESDTR SITDAISGVK IRSLSDISSQ DKQEFHTKYS DGSERIIKPE IEPVYDSDDS DAENFNTIGD IPLSAYDEMP HLGYDINGKR IMRPAKGSAL DQLLESIDLP QGWTGLLDQN TGSSLNLTDE ELELIRKIQQ QENTDANIDP YEATIEWFTS KVEVMPLTAV PEPKRRFVPS KHEAKRVMKI VRAIREGRII PPSKVKEQIE EERLNYDLWN DDDIAVEDHI MNLRAPKLPP PTNEESYNPP EEYLLTEEEK KQWESLDPAD RERNFLPQKF GALRKVPGYQ ESVRERFERC LDLYLAPRVR HNKLNIDPES LIPELPSPKD LRPFPIRCST VYQGHTDKIR TISIDPQGLW LATGSDDGSV RIWEILTGRQ VFNVQLINKE INDEDHIESL EWNPDSQTGI LAVCAGENIY LVVPPIFGFD IENMGRLRIE SGWGYDTFGN KTKEEKFKND EGNEDEDDED DSATSTAVKK DVARWFPPNQ EQTKLGISAI IQCRKTVKKV SWHRKGDYFV TVSPDSKNTA VLIHQLSKHL SQSPFKKSKG IIMDAKFHPF KPQLFVASQR QVRIYDLAQQ VLVKKLMPGV RLLSTIDIHP RGDNLLASSY DKRVLWHDLD LSATPYKTLR YHEKAVRSIK FHKGNLPLFA SASDDGNIHI FHGTVYDDLM TNPLLVPLKK LTGHKIVNSI GILDLIWHPK EAWLFSAGAD GTARLWTT
|
| |