Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_82645 |
Symbol | |
ID | 4837902 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 540867 |
End bp | 545801 |
Gene Length | 4935 bp |
Protein Length | 1489 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640389217 |
Product | predicted protein |
Protein accession | XP_001383738 |
Protein GI | 150864767 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.688651 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.590002 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AGTTGCATCC GTCTTCCATA CCTTGTTTCA GAACGTCCTG GTCGTTTTAG AACGACTCAA TTAGCTTGAT TTTTAATTTT ACTCTCCATC AAAGGGTATT TTTCAGTTAT CCATTTTTGA TAAATTTTTC TTTACTTGAA TATCCAATTT TCACAAACAG ACATATACAT ACATTGACAG TCCATCCCCG TTTATAACAT CTCTGAATTG AATTCCAGAA CTTCAAACAC CCACAGAAAT GCAGCATGCG CGAATCTTCA ACTCCGGGGG CCAGTCAAGG CGAAACCATT CTGGGTCCAG TAATGTCTTG AGCTCACAGT CGTTATCCAA GTCCAACTCC ATGAACTCAA TGTCTGCCAG CCAGATGAGT ATTCAAATAG GAGCTTCCCG TGTAACGTCT TCTTCTTCTG GTAACCACTT TATGAATAGA AATATGAGAA GTACTATAAC TGTAGGAAAG AAGCATGCAC CAATTGAACA ACAAATTCAT CAGACTCAGA CCCCCAACTT GGTACGGTCA ATATCGAAAA ACAGCGAATC CCTAGGAGCA ACTACTATAA GAGAAGATAC TCCGCAGACG TCTACAGTCC AACCTAATCT TAAGGACCCT ATCCCTCTAA CAGTATGGTT TCATGATCTT AGAACTTCTG ATGAAGATGT AATAATCGAC TCCAATGCCA TACCAGGAGG TGTGCGAAAC GGACAAGTGT ATATGTTACA GCTGTTGGAA ACCGAAGATC TGAAAAAGTT GCTCTTCGTC ATCAACGATC GAAACATAAG AGATAATAAT GCCGCTGCGC AAGATGATTT ACAATCGGTC GAATCCCCCC AACAAGAACC AGCGAGTCAA GCTGTACCTA TAGCGAAGTC TAAGTTCCAG ATTTCGTTGA TATCTAATCC GTTACAAAAA TTGTTGGATA TTCCACCTCG TAGTTTAGTG CAGATCAAAC GGATTCAGAA CTTGGCTAAG GTTGAAGCTG ATTCCGTAGA AATCTTTATC AAAGACGTCA ACTTATCACG AGACTCTATG TGGAACTTTT CTTCCACTTT GGTGAATTCG TGTGTACACA TCGACAAAAG ACTATTGTTC CTAAACAACA GGACAGGCAT CGTAAAGTAT ATCTACAAGA ATGGACGGAA CGTCTTCTCG GGCTACATTG GTGAGAATAC GAAAGTCACT TTCAGATCTG AAAGTGCAAA GCTCACCGTT TTGGTACAGT TGTCACGGGA AATGTGGCAC TTTGAAGAAA ATGGCGAAAT TATGTTCCAT AAGCTTGTAA ACAACTTGTT TCCCAAGATG TTTAAAAGAT GGAGAGATAG AAATACCCAT CATGCAATCA CAATAGTCTT GTTCACCAGC ATGGATTTGA CTGATATACC GTGGACAACT TTGGGTCAAG GTGAGAGACC CAATAACCGC AGAGACTACT TCAGAGTTGT CGTAGACCAG GTTAACATCT TTCACTGGGA TAAGATTATG GCCAATTTGC GGTTGGAGTT TGCTAACTTC AAACGTGATA TCATGTTGAA TCAGCAAGAT ACAAACCACT ACACTATGGA TGGTGAACCT CTCCCTTCTG TGAAAGGAAA CATATTGGAA GCAGTCAACT TGGGAATGAC CTTGGCAAAC AACAGATTCA TTAACACTGA TCTTAAGCAT TCCTTGAACC ATTATATCGT TGTAACTCCA GGAACAGGTA TATATGATGT TGACTATGAG CTCATGCTTG AAACAAGCAA GAAGATGTCC ACAATTGATT ATGGATTGGA CATCATTTGT TTGAGTCAAC CTCCTCTTCA TGTTGTCCCA TTATTTCGTT TTTCCAAAGA TGGCAAAGTA AAGCACGTCG TTCCTAACTG GTGTGATATT TCTTATTACA AAGATCTGAA CCAGTCTGCA AATCTGTGGA TCCCTCGTTG CAAGATTTAT GAATTGCAAA TGATGGGAGT CATGGAAAAC AATATGAATC AGATCAGAAT AGATCGCTTT CAAGTTACCC AAAAGGCTCC TACTATGATA GAAGCCATGG ATAACTACGA TAACGACCTT TTCAAACCGG TGAATCATAG AAAATATATA AAGGAAAACG ATGAAAAGGC AATAGACAGT TATGAAATAA AGGAAAAGAA CAATTCCAAA TTCAAACCCA CAACCTTAAA AAATGCCAAT GCGACCCTTT CGTTGATTTT CAACAACAGA ACTCGATTGC AGCCTACCGA ATTGACTCCC TCGAATTCAT CGGTGTTGGG AACTGTTACA CATAGCAATG GAGAGGCTTT GTCGACGTTG TACAATTTGA ATAAGATTTC TGATGATCGC ACTATTTCTT CCTTAGCCCC GCTGATTTCT AGCACCAGAT CAATAAGTAG CAAAAGATCC ATGGATATCT TGAGAAAAGA AACACATTCG CCTCGTTTGA TTAAGAGTGA TTCCTTATTC AAAACTGAGA CAGGAGCTTC TACACCTGCA GAACCTAGGA CAACGAAATC TGTTGAACGG GAACGTAAGT CTCGTCACCT TATGAAACGT CCAGATAGAG ATTTTTCGAA ACGCAGATCT CAGAGTAAAG GTTCAGGGAA CGTAGATTAC GAATCTGAGA ATCCTACTGA TATCTTCTGG ACTGAGATTG AAAACCCTTC TCAGGAATAC CGTGCAGATA CACTTTTGTA TCCCAATGTG AGTAGGTGGA GTAATGCTTT CCCAGATAAA ATCAGACGTA GATTGGTGAA GTGGAGGTCA TTGCAGTCTC CAGCCGCATT GCCTATTACA ACCTCTGTAT TTCCGTCTGT AAAGATATTA GAGACTGAGT ACACGTTTCA AATTTACAAT GTGTTGCTAA ACTATGAAAA CTACTTGGAA CTTGAAACAA CTCACGAATT GATGCGTGAA ATGATTCAGT TGAGACTTCT TTTAGGTTTC CAAATTTGTT TTGGTGATCA AGTTAAAAAA GCTGAAAGTG AACGCAAGCC AGCTGGCAAT GTGGAAAGCT TGATAAAGTA TTTGCCAAGG CACAGCTCAC TCGGAGCACG AATTTATATG TCTCTTGGAG ATGAGATACA TCGTATCTAT CTTGACTACA ACGGGAATTT AAATGTCCAA TTGTATCATA AGACGGTGAC TAATGAAGAA AACAAGATCA CTCTTGGCCA AGCCAAACTT ACCAACTACT TTCCATTAAT TAGAACTCGT TATGCTGATG AATACTCCAC AGCGAAAATT GATGGTATCA ATCTGAAACC AAAGATGTAC AATTGGAACC AGTTCGACCA ATATTTAGCT GGGTATGAAG ATGCTATGCC AGATGCAAAT AAGGATTTCT ATAAAATGAA GTTCGTTGTA ATGCCAGCAC CTATTCCTAA GAACGCTTTC TATATTACCA ATGAGAACTT GACAGACGAA GAAATTCGTG TTGAAGGCTT AAGAAAGTTG ATCGCTATGA TTGAAAAAGG AAAGTATTTG AAGAGGAAGG TTACTTCTAA GAAAGAAGAA ATATTGCCTG AAATTGTATT CTACACTGGC AACTTGTACG ACTTCTTGAA TGAGGAGGCA CAGAACTTTG ACAATACTGG AAATCAGGCT GGACTTATGA TTCCAGAGAG CATGAGATTC AATAAGAGTA TTAAACTTTC TGAATTAGCA GAGGAATTGC AGACCCGGTC TACTGGTCTA CCATTAGTTG ATAGAACGTG GCATTTCAAG AGACATTTGC ATTGTTTCTT AGGAAGTGAG TTGGTCTCGT GGTTATTGGA ATGTTTTGAA GATATTCAGA CCAGGGACGA AGCCACAAGC TATGGTCAGT CGTTGATGAA CAAGGGCTTG TTCAAGCATG TTGAACTGAG ACATGGATTT TTGGATGGTT ACTACTTCTA TGAATTTGAA AGTGAGTACA TCGACAAGAC CTATATCGAG CTGAAGAAAG GCTGGTTTGG AATCAAGAAG ACTTCTTCAG AAAAGCCAGA CACCGATTCC AATACTCCAG TGTATACGAG AAACAATTCC GATGTCGAAT CACTCTCTTC ACCAGCTTTG CCCCTGCAAG ATCCGCTTGA TTTGAAAAGA ATAACTAGTA CCTTAATTAC AGACTCTGGA GCATCTTCAT TAGAAGGAAG TAGAAGAAGA AAGAAGTTCA TCTTAAGCAG AGCCGTTAAG TACAATGTTG ACAGTTTAGG TAAATCATTC AGACCCGAAA TTGTTACCGT TCATTATGAC AGAGTTCATA ATCCAGAGCA TTGTTATCAT ATCAGATTGC AATGGCTAAG TACCACGGGA AGGTTTATTG ACGAAGCCAT TACCAACTGG TCCAGATTGT GCGAGAGACA TGGGTTGAAA TTAGTGGAAA CTCCTTGGAA AGAGCTATGT AATATTCCTC TGATCAGTCC GTTCCACTCC TTTGTCGATA TCAAACTTTG CGTGAACCCT CTTGTTGATC CTGAATTCCT GGATGTGAAG ATTCTCAGAA AGAATAGGTT TTACTATCAC TTGTATTTCC TCAAGAAATT TGAATTCCTC TTGGATAACA GATCATCTCT TTTCTTTCTG AAAGATAGTA TTGAAATCTC ATACAGTTGG GGTAAACCTT CGTTTCAATA TGCCCAATAT ATTCATAAAA ATGGCACTTA TATTATTGAG CTTCGTGACA ATGGAGACTT TTTCCTTGCT CCCAATAATA TCCATATCAC CAGAGTGAAT ACTACACTCA CATCAATTCC CGACTTTGAC GGTAGCATCA CGACGTACAA TACAAATTCC CAACAGGTTA TGTTGAACTT CAGGTCTGCT TGTCAGAATG AAGATTATTT GAAAGAATTG TTCCGCGAAG CGAAAAGCAA CTGGCGAGAA GAGTTTCCCT TGGATATGAT GCCTACGGAT CTTCCTCAGT AATAGAAGTT GAAAAGTTGA AAAGTTGAAA GTTTGTCTTG TAAATATATA TGTAT
|
Protein sequence | MQHARIFNSG GQSRRNHSGS SNVLSSQSLS KSNSMNSMSA SQMSIQIGAS RVTSSSSGNH FMNRNMRSTI TVGKKHAPIE QQIHQTQTPN LTSTVQPNLK DPIPLTVWFH DLRTSDEDVI IDSNAIPGGV RNGQVYMLQS LETEDSKKLL FVINDRNIRD NNAAAQDDLQ SSKFQISLIS NPLQKLLDIP PRSLVQIKRI QNLAKVEADS VEIFIKDVNL SRDSMWNFSS TLVNSCVHID KRLLFLNNRT GIVKYIYKNG RNVFSGYIGE NTKVTFRSES AKLTVLVQLS REMWHFEENG EIMFHKLVNN LFPKMFKRWR DRNTHHAITI VLFTSMDLTD IPWTTLGQGE RPNNRRDYFR VVVDQVNIFH WDKIMANLRL EFANFKRDIM LNQQDTNHYT MDGEPLPSVK GNILEAVNLG MTLANNRFIN TDLKHSLNHY IVVTPGTGIY DVDYELMLET SKKMSTIDYG LDIICLSQPP LHVVPLFRFS KDGKVKHVVP NWCDISYYKD SNQSANSWIP RCKIYELQMM GVMENNMNQI RIDRFQVTQK APTMIEAMDN YDNDLFKPVN HRKYIKENDE KAIDSYEIKE KNNSKFKPTT LKNANATLSL IFNNRTRLQP TELTPSNSSV LGTVTHSNGE ALSTLYNLNK ISDDRTISSL APSISSTRSI SSKRSMDILR KETHSPRLIK SDSLFKTETG ASTPAEPRTT KSVERERKSR HLMKHYESEN PTDIFWTEIE NPSQEYRADT LLYPNVSRWS NAFPDKIRRR LVKWRSLQSP AALPITTSVF PSVKILETEY TFQIYNVLLN YENYLELETT HELMREMIQL RLLLGFQICF GDQVKKAESE RKPAGNVESL IKYLPRHSSL GARIYMSLGD EIHRIYLDYN GNLNVQLYHK TVTNEENKIT LGQAKLTNYF PLIRTRYADE YSTAKIDGIN SKPKMYNWNQ FDQYLAGYED AMPDANKDFY KMKFVVMPAP IPKNAFYITN ENLTDEEIRV EGLRKLIAMI EKGKYLKRKV TSKKEEILPE IVFYTGNLYD FLNEEAQNFD NTGNQAGLMI PESMRFNKSI KLSELAEELQ TRSTGLPLVD RTWHFKRHLH CFLGSELVSW LLECFEDIQT RDEATSYGQS LMNKGLFKHV ESRHGFLDGY YFYEFESEYI DKTYIESKKG WFGIKKTSSE KPDTDSNTPV YTRNNSDVES LSSPALPSQD PLDLKRITST LITDSGASSL EGSRRRKKFI LSRAVKYNVD SLGKSFRPEI VTVHYDRVHN PEHCYHIRLQ WLSTTGRFID EAITNWSRLC ERHGLKLVET PWKELCNIPS ISPFHSFVDI KLCVNPLVDP EFSDVKILRK NRFYYHLYFL KKFEFLLDNR SSLFFSKDSI EISYSWGKPS FQYAQYIHKN GTYIIELRDN GDFFLAPNNI HITRVNTTLT SIPDFDGSIT TYNTNSQQVM LNFRSACQNE DYLKELFREA KSNWREEFPL DMMPTDLPQ
|
| |