Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_32169 |
Symbol | |
ID | 4839116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 850294 |
End bp | 853455 |
Gene Length | 3162 bp |
Protein Length | 1053 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640390431 |
Product | predicted protein |
Protein accession | XP_001384830 |
Protein GI | 150865563 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.390072 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCACCGT CAGGAGATTC CAATAGTAAT GGAGTTGTCT CTGGTGGTTT AGCTTTATCG CTGGGTGGAC AGAATTCGCT ACAATTTCTG ATTTCGCACA TCCTCGGCAC ATCGGCTCGA GCACCTCATC ATCTCTCAGT GCAAGACAAC TATGTAGCCT ATGCTGCCAG TGGAGGTGTT GTCGTCCGTC AGTTGGACTT GGAAAATAAT AACGCCGTTA TCTCGGAGCG GTTCTTTTGC GCCAACTCCA GTTCAGGCAA TGAAAACACT GCAAACAGCA TCCTGCCATC TGGTCCAGAT GCATATCTCA ACATGGCACT TGAAATGGAA AGTAGTCACA ATTTGCATCA AAATCTCCAT CAGAGTTCCA AGGATGTTCA GCCTGTAAGA GATAGATATG GCTATTCTAT CGCAACTGAA CCTATCGTTG TTGGAGGTAG CAACAATATC GGAAATATTG CAGAATTAAC CCAAAGTGTT CACGACATCG ACTTATCTTC GCCATCCAAG TTGAAAGACA GAGTCAGGTC CATAAACTGT ATGTGCATAT CACCTAACAA ACGTTTGCTA GCCATCGGTG AAACGGGTTA TCAGCCACGG ATACTTCTCT TCTCATTGGC ACCAGACCTG AGCTCAAATC CTGTTGCACT TATATATGAA CACTCTTTTG GTATTAAATC GCTCTGTTTT CTGGCAGATC TGCGCTATCT TTGCTCTCTT GGTCTTGTGA ATGATGGCTG TATAAATGTC TGGAGGATTT CTAACTCTGA TGTGCAGTTA GCAGCTAACA ATAGGTGCTC TTCTGTGGTG AACAGATTGT TCTGGCATGA GGACTACATC ATCACCTTGG GATTACGTTT CATAAAAGTT TGGAGGTTTT CGAGTAAAGA AGACAACGAC AGAATCCTGG ACAAACCTCT AGCACTAAAA GGTAAAAGTG TCCTTTTGGG ATCGCTAATA AGCTCGAATT TCACAGACAT ATCTGCCTTG AACAACGACG AATTGCTCAT AATAACAAAC AACAACCAAT TGCTCTTGCT CAAGTTAAAT AGTGAACTCA AACTCATTTC ATTAGAAACG CCCCCGTTTG ATTTCGATAC CTTATTGGTA GATTATGAGC TTGAAAAGAT ATGGTTTGGA TCCAATTCCA AGCTAGAATC GTATTCTATA AACGATTTGA AACCTAGCTC TGTTTCAACT CCTCTGACGC CTTCTTCCAG AGTAAATTCT GTATTTGGAG CACAAACTAA CGAAAACGTG CGGACAGTTC CTATTCTTAG ACTCTTCAAT CTCAGTTCCA ACTACATCAT ATACCTTTCA CATCGTGAAG AAATTGTATT GTACAATAAG TTCAAGTGTG ACATCGAAAG TAGAGTAGCC AGCTCCTTGA TGAGCGAGCT AGCAGGTTTC AAGAACTGCC ATCTGGGAGA CCTATTGGTG TATTCGCATT CTGGTATGAT TAAGAGGGTT ACAAAGGACT ATGAATTAGA GACCATTTTG AAGTTTAACT TGCCGTCAAA CGAACTCATA TCAAATTCGC TTATGGCCGT AGACTCCAAC AATGATTCAC TTTTGTTGGG AGACAAGTAT GGAACATTGT ATGTTGTCAA GATAACAGAA GAAAAAGCAT CTGAAATTGT ATATCAAATC AAAGCACATT CATCTTCAAT TAACGACATA GTATACTACG AATTTGGAGA CTTTCAGTTG ATAACAAGCA TAGCAAGAGA TCGTATGATA CAATTCTTCT ACAAAAAACC AGGTACCAAC TGGGACATCT TGCAAACTAT ACCTATCCAC AATGGCAATT TATTGAAAAT TCAGTATCAC AACAGCAGGA TATATGTCTG CTCATCTGAT AGAACTATCT CTATTCACAA ACTTGAAGTT GTTGAGAGTG AATTGAGGGT TTTCCAAGAG AAGATATTAT CTATGAAATG TAGTCCGATC ACTTTGAAAA TTGTAGACGA CGATTTGATC GTGTCTACAA ACGACAAAAC TTTGTCAATA TATCTGGTAT CTCAGGGATT TGAGCTATCA CGGACTTTAA AGCTTGTTAA CGGTAAAAAT AACGAGAGCT TACTTGTGGA GAACATCATT GTATTTAAGA ATTTGCTCAT AACTTCCTCG ACAGACAAGT CTCTCAGAGT ATTCAATTAT CATACTGGCA GACCAATGAG TGTAGCCTGG GGTCACCTGG ATGTGATATT AAGCTTGGAA TTAAGTTCCA ATGAAGACTT GATTTCCATT GGGAAGGACG GTTGCTTGTT CACTTGGAAG ATTAATGAAT CGACAGCAAC AAAGAATAAC ACATATAAGG AAGATACAAC ATATAAGGAG GAAAGTAATG TGATTCCTAT GTATGCCAAT GTAACCAGAA AGATTCTTCC TATTTCTCCC ATAAAGATCA ATGCACCCAA GATCGAAACA TGCACAAAAG AGGCGCCATC TCCACGTAAT TCTATATCTC CCAGACTTAC AAACGCAACT TTGAAGAGAA TCGAAGCCCG TAGAGCTAGT TCTCAGAGTC CCACTAGAGA TTCGGGTAGA TCAAAATCGG TTTCTACTGC GAAGCCATCC TTGTCTATCA AACCTTTAGA AAAGGCACAC ACTATAAGCA CACTTTCCTC TACCGCGGGA CCAGGACTTA CTTCTCCAAG AAGACCATTG TCTCCCATTA GACGCAGTCC ATCCAGAAAT CTGTTAGATC ATTCACCAGT AAGGAGTCTG TTGGATCATT CACCCATGAA GCTTTCAAAG CCACATATCT TATTTAGTCA TGATGAAAAA AGAACCGCAG ACCAACCTTT CGTGGATACA GCTCTAGCCC AATTGCAGTT TATTGATTCT AAACTTCAAA GAGAAGTTAT AAGCAACAAC GATAAGGCCA AATTGTTAAC CAAACTCGAC TCTATTTTTA GACAATTAGG TGGAGATAAG GAGCTGACTA AAAGTAACGT TCGAGACAAA AGGACTGAAA TTAGCCAGAA TAGAGAAGCA GATGAAAGAG AATTGTTGGA GTCATACAGC GATAAGCTTC TTCAACTTAT GGAATCTAAA CTTGAGTCGA AGTATTCCAA GCAAGTTCCT CCACTTTTCA TTGGAGAAAA CCACTCTTCG ATTTCAACAA CTTCCCAAGA CTCGCTGGAG GACATAGATT AA
|
Protein sequence | MPPSGDSNSN GVVSGGLALS SGGQNSLQFS ISHILGTSAR APHHLSVQDN YVAYAASGGV VVRQLDLENN NAVISERFFC ANSSSGNENT ANSISPSGPD AYLNMALEME SSHNLHQNLH QSSKDVQPVR DRYGYSIATE PIVVGGSNNI GNIAELTQSV HDIDLSSPSK LKDRVRSINC MCISPNKRLL AIGETGYQPR ILLFSLAPDS SSNPVALIYE HSFGIKSLCF SADSRYLCSL GLVNDGCINV WRISNSDVQL AANNRCSSVV NRLFWHEDYI ITLGLRFIKV WRFSSKEDND RISDKPLALK GKSVLLGSLI SSNFTDISAL NNDELLIITN NNQLLLLKLN SELKLISLET PPFDFDTLLV DYELEKIWFG SNSKLESYSI NDLKPSSVST PSTPSSRVNS VFGAQTNENV RTVPILRLFN LSSNYIIYLS HREEIVLYNK FKCDIESRVA SSLMSELAGF KNCHSGDLLV YSHSGMIKRV TKDYELETIL KFNLPSNELI SNSLMAVDSN NDSLLLGDKY GTLYVVKITE EKASEIVYQI KAHSSSINDI VYYEFGDFQL ITSIARDRMI QFFYKKPGTN WDILQTIPIH NGNLLKIQYH NSRIYVCSSD RTISIHKLEV VESELRVFQE KILSMKCSPI TLKIVDDDLI VSTNDKTLSI YSVSQGFELS RTLKLVNGKN NESLLVENII VFKNLLITSS TDKSLRVFNY HTGRPMSVAW GHSDVILSLE LSSNEDLISI GKDGCLFTWK INESTATKNN TYKEDTTYKE ESNVIPMYAN VTRKILPISP IKINAPKIET CTKEAPSPRN SISPRLTNAT LKRIEARRAS SQSPTRDSGR SKSVSTAKPS LSIKPLEKAH TISTLSSTAG PGLTSPRRPL SPIRRSPSRN SLDHSPVRSS LDHSPMKLSK PHILFSHDEK RTADQPFVDT ALAQLQFIDS KLQREVISNN DKAKLLTKLD SIFRQLGGDK ESTKSNVRDK RTEISQNREA DERELLESYS DKLLQLMESK LESKYSKQVP PLFIGENHSS ISTTSQDSSE DID
|
| |