Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31755 |
Symbol | |
ID | 4838484 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 1610273 |
End bp | 1613533 |
Gene Length | 3261 bp |
Protein Length | 1086 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640389799 |
Product | predicted protein |
Protein accession | XP_001384263 |
Protein GI | 150865161 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.115021 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCAAAG TCAAGACGGA AAACGAAAAA GAAATCAAGG ACAAATCAGC GTCCAACCTC AAGTTGATCT CAACCAACAC GGGGACGAGA GTGTCACAGG CTTGTGACCG GTGTCGTATC AAGAAGATCA AATGTGACGG TCTCTATCCG TGTCACAACT GCAACAAGAT TGGGTTTGAG TGTAGGACAA GCGACAAACT CACCAGAAGA GCGTTTCCCA AAGGGTATAC AGAAAACTTG GAGAAGAAAC TTAAAGAGTT GGAGCAAGAG ATGATCGATT TGAAGGCCAA GTATGGCATA GTAGATGACG CTGGTAACTC TGAAGTTGGA GCTGGAGGGG CTGTTAACAG TCTCAGTGTG AACGACACCT CTGTAGCGAA TACACCTAAT AACACATTTA CTCCTTCAAC TTCTAACGTT CATAATCCTA ACTCTTCTGA TGACAAGGTG CTATTGACCA CCACCAACCA GACGGTGCAA ATCAACAACC CTATTGACCA GATCTTCAAC CTAGACAACA AGGGTATAAT TATCGGAAAC GATAATCTCA ACTTCGAGTC GCAGTTCAAC CATTTGCTCA TCAACTTAAA TCTCCCATTT CTCAAGATCA CCAATTCACA TAACTACTTG CTAAACGATC CAAACAGCTA CTTGTACCAT CCTTCGTACC ATAACTACAA CCAGTTCCAT AACAAAGATT TGGATGTGAT CTACAACCCA TTAACGGGAA ACAGCAACGT CAACGAGTTC AGCTTGGTCA ACAACCAGTT GCCTACAGAT ATCTACGACT TGTTCATCAA GCTAATCAAT AACTTCAAAA AGTTGTTCAA CAACAAAAAG GAGTTGGACA ATCAGATCAT TCAGTTCTTT CTCAACTACA ACATCTTCTT ACCCATCTTT GACTACAAAC AGTTCATGGA ATCGTACGAT GCTTTTCACA CGATGTATCC CTTCATCTTC ACCTATGATG ATTCGACTAT CAACGGGTTC AATCTCTCCA ACAGCAATGA CTACCACATT GTGAACCAGT ACCTCATGAT CATCATCCAG ATATATGCCA TGATCTACAT GAACAACCCT ACCATCAATT TGAACTTGTT GTTGAACCAT TCAGACCCAA ATTATACCTT CCACAAAAAG TCTCCTAAAG ATAACTCGCC CAATATTATC AAGTCCTTGT ACGATTTCCT CCCCTATTTC AATGTATTCC ATGTATCTGT TAACCAACTA CAAACCTATT TATTATTCTT GTATTACTCC CTCTTGACCA ACAACAAGGA GAAGTCCTTG ATTTTGTCGT CCCTAATTAA TTCATTTATT GGGATCCTTG GTATCAACCT TAATTCCAAA AACTTGTTCT TCGACGACTT GTCGCTTAAC ACACTTCAGA GAAGGAACAG AGTGAAAACA TTCTGGGTGT TCAAGGTCTT GTTAAAGTGC TTCAATTTAA AGTTTGGCTT CAAGCCTAGC TTGAATACCA CTGTTATTAA TCCTGTTACT ATCGACAAGT ATTTTCAATT GACACCAGAA AAATTGAGTA CCTTGCTCGA CAACAGTGAT GACTTGTTCA ACACTTTGTT GAAGCCAAGT ATCGAGTTCT TGAACTTGAT GAATATTATC ATTCCCTCGT CGTTTGCGCC TAACTATTAC GAATATCTTA AAAATAAGAA GAAGAAAAAA GATCCAAACC ACCAGCATCA TACCAAGCAT CTTGATTGGA TCCTTAAAGA AGATGATGGT GAAGGCAATG ATGGTAACTT GAACTATAAC TATGCTCAAT TCTTAACTAT TGACAAGAAT TTGTCCAACT GGAGAAATTC GTTGAAAACC AAGACCATCA ACTTATTGCC CTTGGCAGAT GAAATGGGCT TACCTAACAT TTCCAACATT TCTTCCAATG ATCTCTTTCA TGAATTAAGA ACCGGGAAGT TATCATCAGG TATCACCCAT GAAGGTTTAC TCAATTACTA TTCTACTGGG ATCCCGGATA TCTATACTGC TTCTCAGTTG ATAAAAATTC AGTTGAACTT CCACTACATA TTGATTAGAT CCATGAATTA TCTTAACTTC ATTGTTGACA AGGAATTAAC TTATCCATAT TATATCGAGA TTTCTCATAT ATCTCGGGAG GTATTGCATT ACTTCTTGTT TATATTCGAT CACATAAACA AGTCCAATGA GAAAACGATT GATCCGACAC CTTCTTCGTC CAATATTGTC ATGGAAAGTT TAGGTCTCGA TGTAGATGAA GATGGCTTTG TCATTAATGA TTTTTCTGTG AAAAGAAGAA AGACTAATGG TTCCGATAGA CACAATATCG CTAAGAGAAT TACCAAAGAA ATTCCACCTT CGCCATTCAA CTATATGTTG AATGGTTTAT CTATGACCAT TATCAACTTT AAGAAATCTA TGATCTTACA ATTGTTGTTT CTTTTAATTT GTCAATTGAA GTTCTTCAAA CGTAATGATT TTGATTTGAT CAGCGACTCC ATTCCTTTGT TAAACGAAAG TGTTGAATTG TTCATCAAGA TTTTCATCAA CTACAAGGTC GGAGTCAATA GAAAGGACCC CAAGAAGGTG AAGGAAGATA AACTCTTCGA AAAACTTATG AATGATCAGT TGCGTGATGA AATCTTGAAG GACCAAGCCG ATAGCGATTT TGACGATGAT GACGAAGCAA GCAATGCATA TCAAAGCATT GACTGGGATG ATGAAGAGAT GGACGAAGAT TTGAAATACT TGAAGATTCT CAAATTCGTA AAATACAAAA GTAACGATAT CTTGAACAAA CTTATTGGGA AGAACTCTTC AAGTGTTCCG GAACCAAGTC AAGTCACTCA CCATCATCAT GTCGAGCAGC CATCTCAGCT TCATGTCCAT CCTCACATCG ACAGCAATCG TTCATCTTTG TCAAAGCCAA CAGGGCCTCC AATATTGCCT CCGCCTGTTA TCAACAACTT CTATGCCAAT TCTCCTAGTG CTGGTTCGCC AGGTTTGGGC TCATTCAACA AGATCCCATC TATGTCTAAG TTCGATTTTC TCCTTTCAGA CAATGACTAC AGCAACCAAG CGAAATCGCA ATCGCATACT CCTCAAGATT ATGAGCTAGC CAAGAAAGAG AAACTAGTCA TAAATGACTT AATGAACTTG CGGCACGGAT CTCAGGGTAG CAATAAGTAT CCTGGACCAT CTCTGAATCC CAATTGGATT CATTCGTCGT CCAATTCGAA CACAAACGGA TACAATGGTT CAGGGCACTA A
|
Protein sequence | MVKVKTENEK EIKDKSASNL KLISTNTGTR VSQACDRCRI KKIKCDGLYP CHNCNKIGFE CRTSDKLTRR AFPKGYTENL EKKLKELEQE MIDLKAKYGI VDDAGNSEVG AGGAVNSLSV NDTSVANTPN NTFTPSTSNV HNPNSSDDKV LLTTTNQTVQ INNPIDQIFN LDNKGIIIGN DNLNFESQFN HLLINLNLPF LKITNSHNYL LNDPNSYLYH PSYHNYNQFH NKDLDVIYNP LTGNSNVNEF SLVNNQLPTD IYDLFIKLIN NFKKLFNNKK ELDNQIIQFF LNYNIFLPIF DYKQFMESYD AFHTMYPFIF TYDDSTINGF NLSNSNDYHI VNQYLMIIIQ IYAMIYMNNP TINLNLLLNH SDPNYTFHKK SPKDNSPNII KSLYDFLPYF NVFHVSVNQL QTYLLFLYYS LLTNNKEKSL ILSSLINSFI GILGINLNSK NLFFDDLSLN TLQRRNRVKT FWVFKVLLKC FNLKFGFKPS LNTTVINPVT IDKYFQLTPE KLSTLLDNSD DLFNTLLKPS IEFLNLMNII IPSSFAPNYY EYLKNKKKKK DPNHQHHTKH LDWILKEDDG EGNDGNLNYN YAQFLTIDKN LSNWRNSLKT KTINLLPLAD EMGLPNISNI SSNDLFHELR TGKLSSGITH EGLLNYYSTG IPDIYTASQL IKIQLNFHYI LIRSMNYLNF IVDKELTYPY YIEISHISRE VLHYFLFIFD HINKSNEKTI DPTPSSSNIV MESLGLDVDE DGFVINDFSV KRRKTNGSDR HNIAKRITKE IPPSPFNYML NGLSMTIINF KKSMILQLLF LLICQLKFFK RNDFDLISDS IPLLNESVEL FIKIFINYKV GVNRKDPKKV KEDKLFEKLM NDQLRDEILK DQADSDFDDD DEASNAYQSI DWDDEEMDED LKYLKILKFV KYKSNDILNK LIGKNSSSVP EPSQVTHHHH VEQPSQLHVH PHIDSNRSSL SKPTGPPILP PPVINNFYAN SPSAGSPGLG SFNKIPSMSK FDFLLSDNDY SNQAKSQSHT PQDYELAKKE KLVINDLMNL RHGSQGSNKY PGPSSNPNWI HSSSNSNTNG YNGSGH
|
| |