Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_72010 |
Symbol | |
ID | 4838494 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 1014660 |
End bp | 1017475 |
Gene Length | 2816 bp |
Protein Length | 895 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640389809 |
Product | predicted protein |
Protein accession | XP_001384503 |
Protein GI | 126135958 |
COG category | [S] Function unknown |
COG ID | [COG5594] Uncharacterized integral membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.355698 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TCACGTTTAC CATGTCTGAC AGTAGCACCA GTTCCGATAG CTCGGTATCC CAGTTTTTGG CGGCATTGAT CCCCACTGCG GTCATCGCAG CTGTGTTTAT ACTTCTCTTC ATTGCCATCC GTAAGAAGCA GAAGAGAGTC TATGAACCTC GTTCCATCAT CGAGACCGTC CCAAAAGATT TGCAAACCGA GTCCACTCCG ACGGGCTTAT TTTCCTGGGC TCCCCATGTT TTGAAGAAAT CCGAGTCCTA CCTCATCCAA CAAGCCGGTA TTGATGGCTA CTTCTTTATC CGTTTTCTTC TTGAGTTTGG TTTGATTTGT ATCTTGGGAT GTTTCATCAC TTGGCCAATC TTGTTCCCTG TCAATGCCAC CAACAGTAAT GGCCAAAAGG GTTTCAATGC CATCTCATAC TCCAACGTCA ACAACAAATG GAGATTTTTG GCCCACATTT TTGTTTCCTG GATTTTTTTT GGCTCCGTCC TCTTCTTGAT CTACCGTGAA ATTGTCTACT ACACTACATT TAGACACGCT GTACAAACTA CTCCATTGTA TGATTCTCTA CTTTCATCAA GAACTCTTCT CCTCACAGAA ATTCCTGAAA GTCTTTACGA AGAGGAGACA TTGCGTTCGT ATTTTCCACC AGCTAAGACT ATATGGTATG CTAGAGACTA TAAAAAGTTA GAAAAAGATG TAAAGGAACG TACCAAGTTA GCTGGTAAAT ACGAAGGGGC TGCCAACAAG GTCATCATTA AGGCTGTCAA GATGAGGAAC AAGGCAATCA AGAAGAAAAA GCCAACACCA GAGCCTGCCG ATGAAATTGA TAAATACTTG AAGGATGGAA AGAAGAGGCC AACTCATAGG TTGAAGTTCT TGATCGGTAA GAAGGTCGAT ACATTGAACT ATGGTGTCGA GAGACTTGGT GAATTGAACA CGTCGATTAA AGAGCAGCAA GAAAATTTCA AGGAAAACAA ACAGGTTCCA TCTGTCTTCA TCGAGTTTCC AACTCAATTA GACTTGCAAT TGGCCTACCA AGCCGTTCCA TTCAATAAGG ACTTGAAGGG TGTTAGAAGA TTTAGCGGAT TAGCTCCCAG TGACATTATT TGGGAAAACT TACCTTTAAC CAAGAAGTCC AGATGGGCTA AAAAAGTTGT TGCCAATACT GTCTTGACTT TAATGATTAT TTTCTGGGCT ATCCCCGTTG CAGTTGTCGG TGCTATCTCC AATATTAACT TTTTAACAGA CAAAGTCCAT TTCCTTAGAT TTATCGACAA CATGCCTGCA AAACTTATGG GAATTATCAC CGGTTTACTT CCAGTGGTTG CCTTGGCTAT TTTGATGTCT TTGGTTCCTC CATTTATTAA GAAGATGGGT AAAGTTGCTG GCTGTATTAC TATTCAAGAA GTCAATGGAT TCTGTCAAGC ATGGTTCTTC GCCTTCCAGG TTGTTCATTC ATTCCTTGTT GTCACCGTTA CTTCTGCCGC AGCTTCCTCT GTTACATCTA TTATAAGTAA GCCTGGTACC GCTTTGCAAT TGTTATCAAG CAATTTGCCT AAGGCTTCAA ATTTCTATCT AGCATTCTTC TGCTTGCAGG GCTTGACTAT TCCCTCTGGT TTGTTGTTGC AAATTGTTCC TTTGATTTTG TCACAGGTTT TCAGCAGACT TGCTAGTACC CCTAGAGCTA AATGGAACGT ATGGTATAAA ATAGGTTCCC CTGACTGGTC TACCACCTAT CCCGCTTATC AACTCTTGGC AGTGATTGGT TTATGTTATG CCATCATAGC ACCATTGGTG CTTGGATTTG CTGGTATCGC TTTTCTTGTT ATTTACTTGG CCTACATTTA CACCTTGGTT TATGTTTTGC AGCCAAACCC TGTAGACGCT CGTGGTAGAA ATTACCCAAG AGGTTTGTTG CAATTATTCG TTGGATTATA CTTGGCTGAG GTTTGTCTTA CAGCTATGTT TGTCTTTGGT AAGAATTGGG TCTCTGTTGC TTTGGAAGCT CTCACCATTC CAGTTACTGT CGCTGTTCAC CTCTACTTGA AGTGGAAGTA CTTACCATTG TGGGAAACAG TTCCAATTTC TGCTATTAGA TACGCGGCTG GCAACAAGTC ACTCGAATAT CCAATGCACG ACCAGGGTTA CAAAGAAATT AAGACAGAGG GTACTAACTA CTGGGAAGGT GGTAATGAAC TTGGTGCCTA TCAAGACCAG GATGTCACAA TCCAAGGAGG CGCATCTGAG AGGGCAAACG AATCTGGACC CGAGTCCAAG GTTGCTGGCT CAGATGAATC AGACAAGCAT AGTCCCTTCG ACAAGGCATC CGAGCTTGAA GGAGGTAAGA CTGCTGCTGT CGGTCAAGTT GTGGCTGTTC CAGGAAAGGG TGTTTCTTGG TTAACAAAGT TCTTCAAGCC AAAATCCGAA ACCTTTGATA TCGTACGTGC GAAGATGCCA GCATGCTACT TCAACTACAT TGAATACAAT GATGAGTTCG TGAAGACAGC TTATGCTGAT CCATCTGTTA CTGACGAAGA ACCTCACATT TGGATTGTCA AGGACAATTT GGGTCTCTCT GACATTGAGA TGAATAGGGC TATTGAAGGT GGAGTTGATG TTTCCAACAG TAATACCGAG TTCAACGAGA AGGGTTCCGC AACCTACACC GGACCACCAC CTAGTTACGA AGAGGCACTC AAAGTTTAAT TTTCTCTATT TAGTTTGTAT TATTTGTTTG CTACAAGTGT TCTCTCCTTT AGTTTCAACT TCAGTCATGC TTCTTAAATG ACATTAATTG TATAATAAAC GATATTTATT TGGAAA
|
Protein sequence | MSDSSTSSDS SVSQFLAALI PTAVIAAVFI LLFIAIRKKQ KRVYEPRSII ETVPKDLQTE STPTGLFSWA PHVLKKSESY LIQQAGIDGY FFIRFLLEFG LICILGCFIT WPILFPVNAT NSNGQKGFNA ISYSNVNNKW RFLAHIFVSW IFFGSVLFLI YREIVYYTTF RHAVQTTPLY DSLLSSRTLL LTEIPESLYE EETLRSYFPP AKTIWYARDY KKLEKDVKER TKLAGKYEGA ANKVIIKAVK MRNKAIKKKK PTPEPADEID KYLKDGKKRP THRLKFLIGK KVDTLNYGVE RLGELNTSIK EQQENFKENK QVPSVFIEFP TQLDLQLAYQ AVPFNKDLKG VRRFSGLAPS DIIWENLPLT KKSRWAKKVV ANTVLTLMII FWAIPVAVVG AISNINFLTD KVHFLRFIDN MPAKLMGIIT GLLPVVALAI LMSLVPPFIK KMGKVAGCIT IQEVNGFCQA WFFAFQVVHS FLVVTVTSAA ASSVTSIISK PGTALQLLSS NLPKASNFYL AFFCLQGLTI PSGLLLQIVP LILSQVFSRL ASTPRAKWNV WYKIGSPDWS TTYPAYQLLA VIGLCYAIIA PLVLGFAGIA FLVIYLAYIY TLVYVLQPNP VDARGRNYPR GLLQLFVGLY LAEVCLTAMF VFGKNWVSVA LEALTIPVTV AVHLYLKWKY LPLWETVPIS AIRYAAGNKS LEYPMHDQGY KEIKTEGTNY WEGGNELGAY QDQDVTIQGG ASERANESGP ESKVAGSDES DKHSPFDKAS ELEGGKTAAV GQVVAVPGKG VSWLTKFFKP KSETFDIVRA KMPACYFNYI EYNDEFVKTA YADPSVTDEE PHIWIVKDNL GLSDIEMNRA IEGGVDVSNS NTEFNEKGSA TYTGPPPSYE EALKV
|
| |