Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_78610 |
Symbol | |
ID | 4840111 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 163621 |
End bp | 168146 |
Gene Length | 4526 bp |
Protein Length | 1298 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640391426 |
Product | predicted protein |
Protein accession | XP_001385364 |
Protein GI | 150865944 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATTTCATTGC TGAACATTTT CAACATATAT AACATTTACT TTATCACATA AACTCATTCC AGCTTCCATA TAGTCCCTTC ATTTCTGTCA GCAAAGTCAT TTTTCATCTT CATCTAAAAT GCTGCTGCAC CACAAGGTCT ATGCAAATGA GCTTTTCCAG ATTATGGCAT CGAGGGACGT CGACCAGACA AGAAAGCTCG AATTAGTATT AAAGCTAAAG ACGAATATAA AAAAGGATGC GGTTGATATT GCTCAAGTTC CAACTTATTT TGAAGCGTTA TCTATAGGTG TAGACATTCC CGATTTGGGT ATTCTGGTAG CCAGCTTTAG CACCCTTGCT CATTTGATCA AGAGAGTAAG TATGCAAGAC AAAACAGGAA TGGTGCTTAA AAACCAAAGC TTTCTTGTTT TACCCATCAT AATCAACCGA TTGGCCAATT CAAACACGTC TACGTTGAGT TCAGCAAGAA AAGCATTGGA GGCATATTGG TTCTCATCAC CTCGCGAAGT TGAAAGTGCC ATTTTAGACA TAGCCTTGAA GCACAGAAAC ACCGACATAA CTCTTGAAGC TGTCCATTGG CTCCACCACA TCATAAGCAA TGTCAACCAG CATTTCAATC TCACAAGGTT TGTTCCACAG TTAGCTAAAC TATTGGCCAC GTACCCATCG TCGCATTCTA CACAACTACA GAATTCTATT AAGTCATTGC TCTGTGACTA CTATAATTTT AAACAGAACA GATTATACAA ATTCGACTTG GCTCGCGAAC TTGAACTTAA GAATGTTCCT GATACCATCA AAGAATCCAT AATGCAAAAC ATCGCAGGGA TTAACCCAGA ATCTACTTCT GTAACCAAGC CCGATTCCAA TTTCGTAATA GCTGGTGATG GCAACGTCCG CGTAACGGCC ACAATAGAAT CGACTGAAAA AAGGTATGCT CCAGTCAGAC CCAGATATAC TGCTTCTACA GTGAGCTCTC GTTCTGAATT AGTTCCGACA GCGAAAGCTT CTTCTCCATT ACCAGTACCA GTACCAGCTC AAACGAAAAT GACGCCGCCT CCAGTTCAAG ACCATCAAAG CATACCGTCT GAAGCAGAAA CTTTACTTCC AGAAATTCGC GACATCATTT CTAAGTACAA CTATGAATTA GATACTTCAA TTAGATCTGT CAGCTTTGAG GGATCAGAAC AAATGCTTTC CACATTCAAC GACCTTCTTC CTCCATTCAA CAGCAAGGAA ACAGAATTTA ACTGGGGCCA GCGTGAGAAG AATATCGTTC AAATGAGATC TATTTTGCGA GGCAATGCTC CATCCCTTTA TAGGAGGGAT CTTATAGTCG GCTTAAAAGA TTCCGCCGAA GCAATCTGTA AGGCAGTTTC TTCTTTACGA ACTACATTGT CTTCTCATGG TTGTCAATTA GTAAAAGAAT GCGCCATATT CTTAAAGACT GATATTGACT CTCTTGTTGA TCTTTTCATG CCTTCCTTGG TTAGGCTCTG TGCTGCAACT AAGCATATAG CTTCTACAAA TGCCAATATG TCCATTGTTG CTATATGTGC GAATGCTTCC TATAGCCCAA GACTATTGCA AAGAATTGTC AATGCTACTA ACGAGAAAAA CGTTCAACCA AGGTCGTACA GTGGTATTTG GTTGCAAATC ACTGTGTCCC GTTTCTTTAA TTCGCATTCC TTTCTTTCTT CTCATGGATC TACTCCAAAT ACAGGGGTCG ATTTGGCGAT GAAAGCATTA GCAAAATTGC TCAAGGATCC AAATCCAACA GTAAGACAAA TAGCAAAAGA TTCATACTGG TGTCTTTGGA GAAAGTTGCC AACACAATCA GAATTGTTAT TGAGCAAATT GGAACCCAAG ATTATCAAAA TTGTGGAAAG ATCAAGACCC AAGGATGTTT CACACGAGGA AGTTTCGGCC CCTACTCTAA ATATGCAAAG ATCCCGCCCC TCTTTAAAAG AAACTATAAT TGAAAGGAAC AAGGACCTCA GACTAAAGCA AAGAGAATTG AGTTCTTCTC GAATGTCAAC CAGAGAAGCG TCTGATTCCC AGAAATTAGA TCAGAGTCAC CGTGAAGAGT TACATATTAA AAGAGTTGCC ACTAGGACCA ATTCTACAAA TGGATTTAGA CAAAATTCAA TTGAAAGAAA CTTATCGAAT TCTCACAAGC CAGATTTAAG ACCACCCAGC AAGGAAGGAG TATCTCCAAA GGATATAGAA CAGGAGAGTG TGCAGCAGGA GACAACACGC ACAAAACAGA CAGACACAGC ACCAGCTTCT AAAGAAGTAG CCTTCGATGT ACAAGCAGAT CCTATCCTTA AATTCTTATC TTCATATCAA ACAGATTTAA TAAAGGAAGG AATCAATTTA CTTAAATATG CCATCATGGG CGAAGAAGAA CTATCTCCTG AGATAACCTC ATTATTGAAG AGTATCTCTC TCAGACAACC GAAGCTTTTA GAACCTCTTC TTTTATCCAA TGACAACTTG TTCAAAAGAA GTTTTCAGTT CTTTTCTGCT GAAGACTTCC TTCGTATTTG CTCAATAGTT ATTAATCCAA TCGAAGGAAG AACTATAGAG CTTTTAATCT CTATGATGAC AGTCGATGAG CTCTACGAGT CGATAATCAA ATTGTTGTCG TACTCAATCA ACACAGCTAA TATATTAGGA GATGATGAAC TTACAATGCA AGTAATCAAA TACAAATCTA GCATCATTCA ATTGATTGTG GTTTTCCTTC AGAGCAGCCT TGAAAAGATA CCGATTCGAG ACAGTTATTT TCTGAAAGTC ACTTCAAACT TTTTGGAGTT GGTAAGCATT TTGAAATCAA CTGATATTTA CCCTGAGTTC TCGAAATTAC TAGCTAAGCT TTATTCAATC AATATAGCGC TATTCGTATC CGAATTGGAT CTTGTTGACA TCAATACAAA AGAAGAAGTT GAATATATTG TTGGAATCGA CCATACACTA AGTATGAAGA ATATTCCCAA TACAATGCCT AATTCTTTAT TTGAGTTGAC CGAAGTTCAA AAGACCCATA GCTATGACGG CTTCTCGCCA GTCAAAACAA ATCAAGATTT TACTATGATA TGGCCAGAAA GGAAAGACAA CGATGATTAC AATTTCATTC CCAAAAGTGC AAATGATTCT CAGAAGAATA TTGCAGCTCA ATTTCAAGGA TTCAACCATC ATGAAAGTAA ATCTGAAGAT CCTGTGAAAG AATATAGTGA TAGTATGGAT ATTGACAATC CCGCTGCCAA CGACAGAAAT TCGACACCAG CTCTTGCCGG AGAGCTCTCC GAATCCAATT CTTTGCATGC TGCCATTGAA GGAGTAGCTG ATTTATCCAT GGATCAACAC GATAGTCCCA AAAACAGGGC CACTGACGAA AATGTGTTTC TTGATTCGTC CAGTAGTATC AAATCTGGAT TGTTCAACAG AACTAATGGA GATCACTCAA TTGAATTGGC TAAAGATTTA GCTCAGGTTC AAATATGCGA GCAGCCGCCC TCTCTCATTG ACCAGAATGT CCAACTCCAG ACTTTTCTTG ATAAGGTTGA TCCGTTGAAG AGCATCTCCA ACAGAAATAA ACCCATTGCA ATTTATGAAG ATTCGAAGGG TTCTCCTCAG AAACTAAAGG AACACAGATA TTCAGAATTC AATTGGTTTA ACTTTCAGGT TGCAAGATTG CAACTCGAAT CAGACGATGA CGAGGTCGAA GACAATGAAA CTACTGAAGA CGATTACAGT GAGTTATTGA TGGATGTATG CGAGAGTTTG AAGTCGTTAG AATTAGATTC TAAGACCTTA ACAACTGCAC TTGATTTACT TCAGAATATG CATAGGTTTG GATTCGAGTT TGTTAAGTAC TATGAAAATG AAGGAGCCAA ATTAATGGAG GAATCTCTCT GGCAATTTTT CCATGATTGC AACAAAGTTT CCATTACCAA CAAAGTCAGA GGATTAGTAT TATTGAAGCA ATTATTAATC ATCCATTCGA AACTAGACTT GGCCAACTTA TGGACAACAT TGATCAATGT TGCATGCCTT CCAGATTCTG AAAACGAGAT CGCGTTTGCT CTACGTGATA TTTCCTCCGA AATTCTTTCT GGTCTTTACC GGAGTGATGA ATTGCTTGAT GTTTTATTCA GTTCATTTAC AAGCGACCAC TTATCGAGAG CACTGACATT CATTTTGGGG ACACTTTCAA AAGTCTTGGA TTCCACATCA ATCCCAGAAT TGTTAAGTGA TTCAATTCTT CAACAGATTG ATCGAGTTCT TAGGAACTTT GTTAATAGCA GAAATGCCGA GTGTCGAAGA TATGCTATTC TTTGTTACGG GAAGCTTGTC AAATCCTCCA GAGTTAGTTT TGCGGGAAAA TCTCCCGACA ACACAATTCT AGACGGAATC ATTCAGAGAT TTTCCCCAAG CCAGAGAAGA TTGGTGGAAT ACTATAGTCA AGACCCAAAG AGCTAGAATT TTACACTGAA ATAATAAGCA GAGAATTTAT TAACAT
|
Protein sequence | MSSHHKVYAN ELFQIMASRD VDQTRKLELV LKLKTNIKKD AVDIAQVPTY FEALSIGVDI PDLGISVASF STLAHLIKRV SMQDKTGMVL KNQSFLVLPI IINRLANSNT STLSSARKAL EAYWFSSPRE VESAILDIAL KHRNTDITLE AVHWLHHIIS NVNQHFNLTR FVPQLAKLLA TYPSSHSTQL QNSIKSLLCD YYNFKQNRLY KFDLARELEL KNVPDTIKES IMQNIAGINP ESTSVTKPDS NFVIAVQDHQ SIPSEAETLL PEIRDIISKY NYELDTSIRS VSFEGSEQML STFNDLLPPF NSKETEFNWG QREKNIVQMR SILRGNAPSL YRRDLIVGLK DSAEAICKAV SSLRTTLSSH GCQLVKECAI FLKTDIDSLV DLFMPSLVRL CAATKHIAST NANMSIVAIC ANASYSPRLL QRIVNATNEK NVQPRSYSGI WLQITVSRFF NSHSFLSSHG STPNTGVDLA MKALAKLLKD PNPTVRQIAK DSYWCLWRKL PTQSELLLSK LEPKIIKIVE RSRPKDVSHE EVSAPTLNMQ RSRPSLKETI IERNKDLRLK QRELNQSHRE ELHIKRVATR TNSTNGFRQN SIERNLSNSH KPDLRPPSKE GVSPKDIEQE SVQQETTRTK QTDTAPASKE VAFDVQADPI LKFLSSYQTD LIKEGINLLK YAIMGEEELS PEITSLLKSI SLRQPKLLEP LLLSNDNLFK RSFQFFSAED FLRICSIVIN PIEGRTIELL ISMMTVDELY ESIIKLLSYS INTANILGDD ELTMQVIKYK SSIIQLIVVF LQSSLEKIPI RDSYFSKVTS NFLELVSILK STDIYPEFSK LLAKLYSINI ALFVSELDLV DINTKEEVEY IVGIDHTLSM KNIPNTMPNS LFELTEVQKT HSYDGFSPVK TNQDFTMIWP ERKDNDDYNF IPKSANDSQK NIAAQFQGFN HHESKSEDPV KEYSDNLAQV QICEQPPSLI DQNVQLQTFL DKVDPLKSIS NRNKPIAIYE DSKGSPQKLK EHRYSEFNWF NFQVARLQLE SDDDEVEDNE TTEDDYSELL MDVCESLKSL ELDSKTLTTA LDLLQNMHRF GFEFVKYYEN EGAKLMEESL WQFFHDCNKV SITNKVRGLV LLKQLLIIHS KLDLANLWTT LINVACLPDS ENEIAFALRD ISSEILSGLY RSDELLDVLF SSFTSDHLSR ASTFILGTLS KVLDSTSIPE LLSDSILQQI DRVLRNFVNS RNAECRRYAI LCYGKLVKSS RVSFAGKSPD NTILDGIIQR FSPSQRRLVE YYSQDPKS
|
| |