Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_69919 |
Symbol | |
ID | 4836862 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 2262832 |
End bp | 2265931 |
Gene Length | 3100 bp |
Protein Length | 861 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640388177 |
Product | predicted protein |
Protein accession | XP_001382662 |
Protein GI | 126132274 |
COG category | [S] Function unknown |
COG ID | [COG5594] Uncharacterized integral membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.634298 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0404263 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CAGCAGTCTA GACGTAGCTT TCTCTTTGAT CTGCTAACAG AACCCAGATT GTGCATTGGC AGTGTAACTA TACTCATTGC TTCTTTTGGC ACTATCCTTG AAGCTATCTT GGGCGCTATT CATTATTGTA GCTGCCTTTG CAACTCATTA CCAGTATTGC TTGATACAGC TGACCATTGC TATACCAAGC GATAAAATCT ACTGCTCGTT ACAAACTAGT GGCTTGTAGC AATACTTCAC AGTTGGCCTA TTGTACCCCC CCAGAAAACA GGATCTGATA TCGCTAACAT CCCAAAATCA TGGCATCCAC AGCTACTTCC ACCTCCGCGG TGCTTTCTAC GTTAGTGGCC AACCTCATTC TTTTTGGTAT CTTCATCCTA GGTTTCTTGA TCCTCAGATT GAAGTACAAG AGAATCTACT CCCCAAAATC GTCGTTCGAG CTCGTGCCCG AGGACCAACG TCCCGAGCCG TTGCCTAGGG ACCCATTCAG ATGGATTTTC ATTCTTTTAA CCAAGCCCAA CTCATTCATT ATCCAACAAG CAGGAATTGA TGGCTACTTC TTCTTGAGAT ACGTCTTCAG CTTTGCCTGT GTCTTCTTGG TAGGAATGCT TACCTGGACT GTCTTGTTGC CTATAAATGC CACCAACGGT AAAGGTGCCA CTGGATTAGA CCAGTTGGCC ATCTCTAACG TCAAGGACAG AAACAGATAC TATGCTCATG TGTTTATTGG CTGGGTTTTC TACGGTGGAG TCATCTTCGT CATCTACCGT GAATTGTTCT TGTACAACTC CTTGAGATCT GCCGTTCTTG CATCCCCTAA ATACTCCAAG AAGTTATCGT CGAGAACTGT TTTGTTCCAG ACTGTCCCAG ACTCGCTTTT AGACGAGAAG CAATTGTACA AGATGTTCAA CGGTGTCAAG AGAATCTTCG TAGCCAGAAC TGCTAGAGAC TTAGAATCTA AAGTGGCCAA GAGAGACGCC TTGGTCAAAC AGTTGGAGAA CGCTCAAAAC AAGTTATTGG CAACAGCTGT CAAGAACAAA ATGAAAGCTG AAAAGAAGGG CCAAAAATTA GAACCTGTGG ATGAAATCTC TGCCTACGTG CCTCAAAACA AGAGACCTCG TCACAAATCC GGCGGTTTTT TCTCCAAGAA GATCGATACT ATTAACTACT GCAAGGAAGA AATCCCTAAA ATCGACAAGG AAGTCAGAGC CATGCAAAAG AAGTTCAGAA CTAATAGACC CAAGAATTCT ATCTTTGTTG AATTCGAAGA CCAGTACCAT GCTCAGTTAG CCTACCAAGC TACTGTGCAT CACAACCCGT TGAGAATGAA GCCTGTTTTT ACTGGAGTTG AACCAGGTGA CGTTCAGTGG TCCAACTTAA GAATGTTCTG GTGGGAAAGA ATCACTAGAA GATTCCTTGC TTTTGCTGCT GTAGTTGCTT TGATTATCTT GTGGGCCGTC CCCGTAGCTT TTGTCGGTGT CATCTCTAAC ATCACTTACT TGACTAACAA GTTGCCCTGG TTGAGATGGA TCTTGAACAT GCCTCACTTC TTGTTAGGTA TTATCACCGG TTTATTGCCT GCCATCATGT TGGCACTCTT GATGATGATT TTGCCTATGT TCATCAGAGG CATGGCTAAG ATCGCTGGCG CCCCAACTTA CCAGGCTATC GAATTGTACA CTCAAAACGT CTACTTTGCC TTTTTGATGA TTAACGGTTT CTTGGTTACT GCTCTTGCTT CATCAGCTAC TTCTACCGTT ACACAGATCA TTGAGGAGCC AACTTCTGCC ATGAGCATTT TGGCTAACAA TTTGCCTAAG TCTTCTAACT TCTACATTTC GTATATTATC TTGCAAGGTT TATCCGTTGC GTCTGGGTCT CTTTTCCAGA TTGTAGGTTT AATCTTGTTC TACCTCTTGG GCAGACTTTT GGACAACACC GTCAGAAAGA AGTGGAACAG ATTCAGCGGC TTAGGATCCA CGGCTTGGGG TACTACTTTC CCAGTGTTTA CCAACATTAC TTGTATTGCA CTTATCTACT CCATTATCTC ACCTATGATC ATGTTATTTG CTTGCGTTGC CTTGTTCTTG ATCTACATCG CTTTCTGCCA CAATTTGACT TACGTGCTTA AGGAAGGCCC CGACACCAGA GGGTTGCACT ATCCAAGAGC TCTCTTCCAG ACTTTTACTG GTATTTACAT TGGTCAGGTT TGTTTGTTGG GTATCTTCGC TGTCGGTAAA GGTTGGGGAC CAATTGTGTT GCAGATTATC GGCATTTTTG CAACTGTGTT CATCCACATC AACCTTAACG AGTCGTTTGA TCACTTGCTC CAGGTTGTTC CTATCGACTG TATGAGAGCC TTGGACGGTG TTTCTCAGAC TGCTTCGTTT ACCGGCTCTA GTGAATACAA GAGAAAGGTA TTGGACAGAA AGACAGGTGC TGGCAAAACC GAAAAGGCTA TTGCTGAAGA CAAGGAAGAA CAAGAGCAGA TCAAGCGTGA TATCCTTCAA GAAGATGGCG AGTTCAACGA TGGTGAAAAC GAGAGAACTC TCATTCCGTT ATTGGCTGAC AGGGACTTCA AGACAACTGA GTCTCAAAAT GTCTTTGTTC GTTTTGTCAG ACCAGACGTG TTCTTGAACT ACAGGCATGC CAAGCAGCAA TTACCTGCTA CATACAACAT CGAACCTGAA ACTGAAGATG ATAAGCATGC CTACGATATG CCTGTCATCT CGGCTCCATT GCCTGGAATA TGGATTCCAG CTGATCCTAT GGGATTCTCC AAGCAACAGA TTGAAGAGTT CAAAGGTATT GTCAGCATCT CTGACGAAAA CTCGGGCTTC GACGAAAAAG GTGCCATTAC CTTCCTCGGA GAAGCTCCTA ACTAGGCTAT TTACTAGTGG TTCATGTTTG TTTTATAATG GAAAAGTGTT TGATTTGGTC GTTAATTTTC TAGACTAGAA GAAGACAATT GTATATATGT TTCTCAATAT AGAAAGAAGC TACAATTAAT GTAATTTAGG CCGTATTGTA AACTAGCCAA AATTGTCTTA CTTCACTACA TCTTCTTTAC ATCTCTTCCA ATGCTACAAA TATTTCTTGC
|
Protein sequence | MASTATSTSA VLSTLVANLI LFGIFILGFL ILRLKYKRIY SPKSSFELVP EDQRPEPLPR DPFRWIFILL TKPNSFIIQQ AGIDGYFFLR YVFSFACVFL VGMLTWTVLL PINATNGKGA TGLDQLAISN VKDRNRYYAH VFIGWVFYGG VIFVIYRELF LYNSLRSAVL ASPKYSKKLS SRTVLFQTVP DSLLDEKQLY KMFNGVKRIF VARTARDLES KVAKRDALVK QLENAQNKLL ATAVKNKMKA EKKGQKLEPV DEISAYVPQN KRPRHKSGGF FSKKIDTINY CKEEIPKIDK EVRAMQKKFR TNRPKNSIFV EFEDQYHAQL AYQATVHHNP LRMKPVFTGV EPGDVQWSNL RMFWWERITR RFLAFAAVVA LIILWAVPVA FVGVISNITY LTNKLPWLRW ILNMPHFLLG IITGLLPAIM LALLMMILPM FIRGMAKIAG APTYQAIELY TQNVYFAFLM INGFLVTALA SSATSTVTQI IEEPTSAMSI LANNLPKSSN FYISYIILQG LSVASGSLFQ IVGLILFYLL GRLLDNTVRK KWNRFSGLGS TAWGTTFPVF TNITCIALIY SIISPMIMLF ACVALFLIYI AFCHNLTYVL KEGPDTRGLH YPRALFQTFT GIYIGQVCLL GIFAVGKGWG PIVLQIIGIF ATVFIHINLN ESFDHLLQVV PIDCMRALDG VSQTASFTGS SEYKRKVLDR KTGAGKTEKA IAEDKEEQEQ IKRDILQEDG EFNDGENERT LIPLLADRDF KTTESQNVFV RFVRPDVFLN YRHAKQQLPA TYNIEPETED DKHAYDMPVI SAPLPGIWIP ADPMGFSKQQ IEEFKGIVSI SDENSGFDEK GAITFLGEAP N
|
| |