Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_14509 |
Symbol | |
ID | 4838180 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 469963 |
End bp | 473379 |
Gene Length | 3417 bp |
Protein Length | 1084 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640389495 |
Product | predicted protein |
Protein accession | XP_001383724 |
Protein GI | 150864756 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.644497 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AAGCTGCTCC AAGCCGCTGA GAAGGCGGAG AACAACTGCC TAGTGGAGAT TCTCGGAGCT GTTAAGCCGG CCCGCTCCTC CAAGTCTACC ATCACCTCCA TAGTACGAGG ATTGTACACC AAGAACGCAA AGTTGTTAAC CATCTTGGAC GCAGACATAA AGGAAGTGAT AGACAATCCC GAATTCGAAC TCGACACCTA CAAAGGGTTT GTCAATAGTT TCATTCTCTG GCTTGATGCA GATGCCATGG TGTTGTTTGA CAAGTACGCA GCCTTCATAA CCAGCAAGGA TAGCTACCAC AACTTCACGA ATTTGGACAA AGTAGAATCA TTTGCGGGAC CCCTAACTAA CTGTGTACAT TATCTTGATT TCATTGCTGC CACTTCGAAT CTTTTGAGAA ATCCGTTTGT TATCGATAAG TTGTCCACAA TCAAGGAGTT CATGGAAAGA CTTCTTAATA ACTACCGAGA GCTTCTCGAG CTGAGCCGTT TGAATGATAT CGGTTTTGCC AACATCAATG CCTTCGTGAC ACCCACAACA CCTACAGTAA CCCACAAGCC TCCTTCCAGA GGCCCTCCCA AAGTCTCGGG TATTTTCAAG TTGGAACAAA TTGTCGAAAG AACAAAAGAT GAGCCTCTTT TGCTTCACGA CAAGGATCAC AAGGAAGTGC CCATAGAGTT GGTTTTATTG GATATTGGTG GCAGTAGAGG TTCTCATGGT AAGAAGTACA ACTCATTGGC TATTCTCCAG GTCCATTCTG GAACAAAGGA GTTCAGTAGA TCGTTAATGT TTCCTCCCTT TAGAGTCAAC GAGGTATCTG TAGATTTCAA GAGGGATACG TTGTTCTTGA AAGCTATCAA TTTCTCTCAT CCCGAGAATA CAGACTCAGT TATCTCCATA ACTGGGTTGG ATTCCCAGTT TTTGTATGAC TGGCACAAGA AGCTCACAGC CATTTTCCCA GAGGAAGCAA AACATTCGCC AGTAGGGGAG GACTCCTTCT TGATTACTCC TGTAAATAGT CCCACCAAGA TGGCAGGATT AGGCATCAAT GTTCTTTCTG ACTCTGAACA CAAACAGGAC ATTAAGGACA GAGAAGAACA GCAAAAATAC CACACTCCAC AGAAACCCAA GCTCTACATC CAAGAGACTT CGGAGCAGTC GAGTTCACCA TTGTCACAAT TGGAAAAACA TTTCCGCAAC TCGATCCAGA AACTGAATGG AACTAATGAG ATACCAATCT TGCCTCCTCC ACCTCCAGCA TTCCAGTTGA GAAAACCATC TTCGACTTCG ATTCAATCTG AAGACTCTTC CGACTCAGCA AAGACTCAAT ATGATCGCAG TTTGAGCATC ATAGAAAAAA CTGTTTCTTC AACTACATTG GCTCGTTTGG TATCGACAAA AGCTGACGGA GACACATGCC CAACCAAGGT TATAAACAAG AGAACAATGG AATCTTCCTT GGACTGTGAT GCGGGAGGAC GTCCTATTTC TTCTCAGGGC GAAATCGTAG ATGTAGAAGG CACACCAGTA ATGATTAAGG CTCCTACATT GCCCAATATT GTAACAGAAA CTGGTAGATT ATCTTTGTCT GACAAACACC ATTCTGGACA GCAAACTAAA TTTTCATCTG TTCCCGACCT TGCCACTGGC CGTAGTTCAA ATTTGTATCA ATTGTCGACT GGCTCTGAAA TTGATATTTC TAATTTCGGC AAGGACTACC GTCCTTCGTT TGCTGAATCT GAAGTCTCCA TAAATTCTAA AAAGGGATCT TCTCCTTCTG TTGGGTCTGG CACCCCGGGA ACTCAGAAGA AGCCAACAAG AAAAAAGTCG TTGTTCAACC TTTTCAAAAA GGGATCTAAA GCAAACTTAA ACGATAATGT GGCTGGTTTC CAAATGGTTA ACAAGTCCGT AGATTCAGTT TCTACAAATA CCAACTTGGC AAACAAATCA ATGGATTCAA CTTCGAGTGT TTTCAATACA GCAAATAGAT CGTTGGTCAC GTCTTCAAAG CATTTCAACG CAGCTGATCA ATCTTTTAAT TCCTTTTCCT CAAGCATTTT GAACAAGAAA GACTCTGATG ATTCGGTTAC TTCGAGTATT TCTACTAACG ACTTGATGAA GAAGGACTCG GATGCAACTC TTAATTCTTC AACAAAGAAT AGTAAGGTGG ATCAGGTTGA TGCTGGAAAC CCTACAAATA AGAGTGAAGA ATCAACTCCC TCTACTGAAT CCAGTAGATC AGCAATTCCA ACTGCATTTG CATTGCCATC TTCAACTTCA ACATACTTCT TCAAACAGTA CAAGAACGGT TCAAGTGCAA GCCTCGGTCA ACACAACGAC TCGCAGACTA ACTTGGAACT TCTAGAAAGC ACAGAGGAAG AATTGTTTGT GTCTGATGAA ATCAAGTCCA TGATCAACGA CGATCACTCA ATAGAATTTT TCATTACAGA AGCCACGCCC AAGGCGATGA AGGTGTCAAA ATGGAAACCT AAGTATGGTA AGTGGGAAAT GATCACAGTT AACGAGAATT TGTTTGCAAG AATTGTTACC AACTATCATC TTAACAAGAG TTGGATAATA TTCTTCAAAC AGGATACTGA TGCAAACAAT GCTGAAATTG ATAAGCCAAT CTTATTGTTG GACATCGTTG GAGGGCAAAC TTCAGTCAAG CTTAATGCTT TGGACTTGCA AATCTCTTCC ATTAATTCCG TTACTTCTGA AAAGATGCAA ATTATGGTCA GATGCAAGAC TTCGGCATTG GGAAACAGCA TTCTTACCAA CGTCAATAAT GTTATGGGGG TTTTGTCAGC AAATGCAAAG AATAATAATT ACGGATCCTT ACAAAACTCG GAGTTAGCAC CATCTTCTTT TACAATAACC TCTTCAATAA TGGATGGAAA GTGCCAACCA AGTGCATCCA CTACTTATAG CAGTTTCAGT TCGAGTGAAT CAAGGTCAAC CACTAAATCG GCAAAACCAA TTTCGAGAAT GAAGAAAGGT GGCTCAGATT ATTCAATCAA TTCCCAAGAG GCGGCCAATC TCAACATCCT CACTAACCCT GACAACACCA AGTTGATCTT GTTGAATCAG ATGCCCGTCA GGTTGCAAAA GCAGCTAGAA TCTTATAACA GCATCAGCAC GCCTTCTTCA TGGAAAATTC TTTCCATGTA CAGTTTAACT GTATATCTGA TAACCGAATC CTTCACCAAC AAGTCATATT TCAACTTGGT GTTGAAAAAA ACTGATATAG AGAGCAAAGC TGAAACAGAG TTCAACTGGT TGATTCGTGA TGCAGAGATG TATAAACGTA TCGAACGAAT TGGAAAAGCA GGTTTACTTG TCAAAGTCAC AAACGATGAT ATTTTCATGA TAGAATGCAA GGGCCGAAAG GAATTACACC AGTTAATAGG CATCTTC
|
Protein sequence | KSLQAAEKAE NNCLVEILGA VKPARSSKST ITSIVRGLYT KNAKLLTILD ADIKEVIDNP EFELDTYKGF VNSFILWLDA DAMVLFDKYA AFITSKDSYH NFTNLDKVES FAGPLTNCVH YLDFIAATSN LLRNPFVIDK LSTIKEFMER LLNNYRELLE SSRLNDIGFA NINAFVTPTT PTVTHKPPSR GPPKVSGIFK LEQIVERTKD EPLLLHDKDH KEVPIELVLL DIGGSRGSHG KKYNSLAILQ VHSGTKEFSR SLMFPPFRVN EVSVDFKRDT LFLKAINFSH PENTDSVISI TGLDSQFLYD WHKKLTAIFP EEAKHSPVGE DSFLITPVNS PTKMAGLGIN VLSDSEHKQD IKDREEQQKY HTPQKPKLYI QETSEQSSSP LSQLEKHFRN SIQKSNGTNE IPILPPPPPA FQLRKPSSTS IQSEDSSDSA KTQYDRSLSI IEKTVSSTTL ARLVSTKADG DTCPTKVINK RTMESSLDCD AGGRPISSQG EIVDVEGTPV MIKAPTLPNI VTETGRLSLS DKHHSGQQTK FSSVPDLATG RSSNLYQLST GSEIDISNFG KDYRPSFAES EVSINSKKGS SPSVGSGTPG TQKKPTRKKS LFNLFKKGSK ANLNDNSVDS VSTNTNLANK SMDSTSTDQS FNSFSSSILN KKDSDDSVTS SISTNDLMKK DSDATLNSST KNTIPTAFAL PSSTSTYFFK QYKNGSSASL GQHNDSQTNL ELLESTEEEL FVSDEIKSMI NDDHSIEFFI TEATPKAMKV SKWKPKYGKW EMITVNENLF ARIVTNYHLN KSWIIFFKQD TDANNAEIDK PILLLDIVGG QTSVKLNALD LQISSINSVT SEKMQIMVRC KTSALGNSIL TNVNNVMGVL SANAKNNNYG SLQNSELAPS SFTITSSIMD GKCQPSASTT YSSFSSSESR STTKSAKPIS RMKKGGSDYS INSQEAANLN ILTNPDNTKL ILLNQMPVRL QKQLESYNSI STPSSWKILS MYSLTVYSIT ESFTNKSYFN LVLKKTDIES KAETEFNWLI RDAEMYKRIE RIGKAGLLVK VTNDDIFMIE CKGRKELHQL IGIF
|
| |