Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_30785 |
Symbol | |
ID | 4837833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 934607 |
End bp | 937606 |
Gene Length | 3000 bp |
Protein Length | 999 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640389148 |
Product | predicted protein |
Protein accession | XP_001383808 |
Protein GI | 150864827 |
COG category | [R] General function prediction only |
COG ID | [COG1524] Uncharacterized proteins of the AP superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.778067 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTCAG TTGATATCGA CACTTCAGCG ACCAACCGGC GAGCAATCAA ACAACAGCAA TTCCAGCAAC AGCTGCGATC AAAGATCCGA CAGCAGAAGT TAAAAGTCAC CTTTATAGGC TATCTTGTGG TGCTAGTATT TCTAGCTCTA GTTCAGTTCA TAGGAGTAGG GTTTTTCACA AAGGGCTTCT TACTTCTGCG TAACGTCTTA CCCAATGTAT CCGAGTGTAC CACTAATGAC TTCAATACGT GCATGGCGCC AGCCCGGTTC GATAAGGCGA TCTTGTTGGT GATAGATGCG TTAAGGTTTG ATTTCGCTAT TCCTATAGCC GACTCAAATG AATACTATCA CAACAACTTC CCTATACTAC ATCAATTGGC TCAGGATGAT CATGGTGTAT TGCTCAAGTT CATTGCCGAC CCTCCCACAA CGACCTTGCA ACGCTTGAAG GGATTAACCA CAGGTTCGTT GCCAACTTTC ATCGACGCCG GTTCCAACTT TGACGGGGAT GCCATCGATG AAGATAACTG GTTGCTCCAG CTCCACAAGA ACAACAAAAG CATTGCATTT ATGGGTGATG ACACCTGGTA TGCCTTGTTC AACCACTACA TCAATCCTGC GTTGAACTTT CCTTACGACT CCTTAAATGT CTGGGACTTA CACACTGTTG ATAACGGAGT CATAGAGCAC TTGTACCCAT TGCTTCACAA GGATAATTCG AGCCAGTGGG ACCTTCTAGT GGGCCATTTT CTCGGAGTAG ATCATGTAGG GCACAGGTAT GGGCCCCGTC ACTTTCTGAT GAAGGAGAAG TTAAACCAGA TGAACGAGGT TATAGCCAAT GTAGTCAAAA GTTTAGACGA CAAGACATTA TTGGTAGTGA TAGGTGATCA TGGAATGGAT TCCACCGGTA ACCACGGAGG CGATTCGCCG GATGAGTTGG AGAGCACCTT GTTCATGTAT GCCAAAAACA ATAAGTTTTT TAAAAAGGAT TCAAGTCATT ACAACACTAC AGAGCAAGGT AAGCATTATA GAGCTGTCAA CCAGATTGAC TTGGTCTCCA CCATGTCGTT ATTGCTTGGG CTACCAATAC CGTTCAACAA CCTTGGTTTC CCCATCGATG AGGCATTTGA AAACCAAATG GAATTGTCCG TAGCGTCGTA TAAAACTCTA CAACAGATAC AGGGATTCAG AAACAGTACG CCAAATTTGT CGCCCGAAAT CAACAAACAA TACCACCAGA TCATCAGTAA TTATACTAAC AATTCTCATG ACTTGTACAC CTTGGTAGAT CTGGCCAAGA CATACCAGTC CCGTTCTTTA GAAGAATGCA AAGGATTATG GGCGACGTTT GATTTGAAGA TGATTGGCGT TGGAATAACA ATCCTTTTGC TAGCATTGAC TTTTATCCTA ACCTATGCTA GATCGATTCC TGCGGTCAGA GTTCTGACGA TGTCTTTCGA GTTCATTGGA TCAGTGATAG CCATGTCATT GCTTGGTTTA GTGCTAAGTC TTTCAGTGAG TTTGGTATTA AAGCCTGCTG ATTTCAACTT GAAAAAATGC TTAGCCCTCG GTGCTTCTTT GGGAATAATT GTCGGATTCT GGGCCCCCAT AATGGATAGA TTCAGCATCA ACTGGCTAGT GCACCAGCTC ATCGATTTCT TCGTGTACAA TTTCAACAGT TGGTCCTTTT TAGGGTTGGT CTATGTCGTG GCACATTGCT TGATTTTTGC ATCCAACTCA TACGTGGTAT GGGAAGATAA AATGGTTCTG TTTTTCTTGA TGACTTTTGG TGTTGCTTGT ATTTTTAACA TTGCTATCAA TTTCGAGCTT CCGCGTTCAC AAAAGATTTT AGGACTTCTG CATGCCATCA CATTTACCCT GTTAACAAGG TTGGTATCCA CTATTAATCT TTGTCGTGAA GAACAAAGAC CTTACTGCCA GGCTACCTTC ACTACGTCTT GGTGGTCCAT TGTGTTATTG CACTTGTGCT CTTACCTTCT TCCAACAATC ATCAAGTCAT TCTATAAGTT GTCTGATTCA TACCATTCAG CTGCTCCTTT GTGGGTTGGT ACTGGCCTCA AGTTTCTAAT GTTCATGAAT GCTGTTTATT GGACCTTAGA ATATGTTCTG AACAGCGAGT ATTTCTTATC GACGAGTTTT GTCTTGAGCT CGCCTTTGAT CAAATCCTTG AAGTTGGCAA TTGCAAGAAT CGTCTTGTTC ATTACCTTGG TGCTTGCAAA TTTTAGTTGG TCCAAGGGTC CTTTGTGTGT CAAGTTAGAG CTCTCGGATG CCGTACAAGA AGACTCAGCA GAATCTGAAG ATTCAGACGG GCCACTGAAG ACAGCCACAA TTTTAGGATA TGGCAACGTT TACGGGTCAT CATATTTTTT GTTGGTTCTC AACTTTACAG TAGCCATCAT GTTGGTATCC AAACCATTGG GAGCCATTTC CATCAATATG TTGATCGTAC AAATCTTGTC GTTGTTGGAG CTATACCACA TAATGGACAT ACGTAGAAAC TTGATTTCGC CATTGATCTT TGGATTGTTG GGTTACCAGC ACTTCTTCAG TACGGGGCAT CAAGCTACTC TTGCTGCTAT TCAATGGGAT GTGGGCTTCA TGACCACAGA AACCATAACC TTCCCATTCA CCCACTTGAA TATTGTGTTG AATACATTTG GTCCTTTCTT GATTATTTGC TTGTCGGTGC CTTTGATCAC GTTGTGGAGA TTGGCTCCTT CAAGCAAGCC TATTACCATC TTGTCGCAAA TCGTAACCAA TGTAACCACT CTTATTACAT ACCAGTTGTT CACTGGGGTG TCCAGCTTAA TATTTGCAGC TCATTTTAGA AGACACTTAA TGGTGTGGAA AATCTTTGCA CCCAGATTCA TGTTGAGCGG ATTGTTGATC ATAACCATAA ACATTTTCGT TATCGTCGTG ACGTTGTGGT TTGGAACAGG CAGGGTTGTA ACCCAAGTGA ACAGAATCTT TGGGAAGTAG
|
Protein sequence | MESVDIDTSA TNRRAIKQQQ FQQQSRSKIR QQKLKVTFIG YLVVLVFLAL VQFIGVGFFT KGFLLSRNVL PNVSECTTND FNTCMAPARF DKAILLVIDA LRFDFAIPIA DSNEYYHNNF PILHQLAQDD HGVLLKFIAD PPTTTLQRLK GLTTGSLPTF IDAGSNFDGD AIDEDNWLLQ LHKNNKSIAF MGDDTWYALF NHYINPALNF PYDSLNVWDL HTVDNGVIEH LYPLLHKDNS SQWDLLVGHF LGVDHVGHRY GPRHFSMKEK LNQMNEVIAN VVKSLDDKTL LVVIGDHGMD STGNHGGDSP DELESTLFMY AKNNKFFKKD SSHYNTTEQG KHYRAVNQID LVSTMSLLLG LPIPFNNLGF PIDEAFENQM ELSVASYKTL QQIQGFRNST PNLSPEINKQ YHQIISNYTN NSHDLYTLVD SAKTYQSRSL EECKGLWATF DLKMIGVGIT ILLLALTFIL TYARSIPAVR VSTMSFEFIG SVIAMSLLGL VLSLSVSLVL KPADFNLKKC LALGASLGII VGFWAPIMDR FSINWLVHQL IDFFVYNFNS WSFLGLVYVV AHCLIFASNS YVVWEDKMVS FFLMTFGVAC IFNIAINFEL PRSQKILGLS HAITFTSLTR LVSTINLCRE EQRPYCQATF TTSWWSIVLL HLCSYLLPTI IKSFYKLSDS YHSAAPLWVG TGLKFLMFMN AVYWTLEYVS NSEYFLSTSF VLSSPLIKSL KLAIARIVLF ITLVLANFSW SKGPLCVKLE LSDAVQEDSA ESEDSDGPSK TATILGYGNV YGSSYFLLVL NFTVAIMLVS KPLGAISINM LIVQILSLLE LYHIMDIRRN LISPLIFGLL GYQHFFSTGH QATLAAIQWD VGFMTTETIT FPFTHLNIVL NTFGPFLIIC LSVPLITLWR LAPSSKPITI LSQIVTNVTT LITYQLFTGV SSLIFAAHFR RHLMVWKIFA PRFMLSGLLI ITINIFVIVV TLWFGTGRVV TQVNRIFGK
|
| |