Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_88690 |
Symbol | |
ID | 4838125 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 492599 |
End bp | 495499 |
Gene Length | 2901 bp |
Protein Length | 629 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640389440 |
Product | predicted protein |
Protein accession | XP_001383729 |
Protein GI | 150864761 |
COG category | [R] General function prediction only |
COG ID | [COG5354] Uncharacterized protein, contains Trp-Asp (WD) repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTCTTCCACA ACAAGTGTCG AAACGGGACT AACAAACCCA ACTGTGTGAT TGCTATCTTT AATAGCCGAT TTCGTCCTTG TAAGCCACTT CTCCAGCTGA TTTCCGTCGA TTATTCACTT AGTTCGTCTA CTACTGGAGT CCGTAGAAAA TTCACAACCA GATAGGATTC CCGATCACAA GTAAAAGCAT AAAGCAAACA ATACCATGTC CAAGTCCACC GAGTTCTTCT GTATGTATCT GACTTTGAAA GAGGGAATGG CAAGAAGCCT GACTAGTGAG AATTAAAGAG AATTGAGATC AGCGTAGTAC GGCTGATTTG GTTGATTTGA TTCTGAAATT TCTGGAAATT CTGGAGAAGA GAATTTTTGT CATGAAACTA TTCCAGATCT CTACCCGGAG ACAGACATTG TTAAGATTGA GGGTATATGA CTACGAGTAT AGAGGATAAT ACCGATAATG CTGATATCCA GATAATGGCA TTGCCACTGT TAGTGAATCT CATCTCTCAT CTGAAATCTG TAAATGCATC TGAAAATCAT TTGAGTATTC GCGTACAAGA ATTATGAACT TTTGAATTTC AATAAATTCA TTCTACAAAC ATCTACAAAA TCCTCAGAAT GTACAGAATC CACAGAATAT GCCAAATCTA CAGATCTACG AAATCAACAA TTTCATCAAT ATTACGAAAT TTTTGAAAAT AATTCTCATT TTCTCTCATT CTCATACTCT TCATACTTAA CATCTTATAC TAACCTGCCA GGTCGTCTAC CACGGTCGAT AGAATTGACA CATGACTACG AGGAAATCAC GCCGTCTCCC AAAGCGGCGG AAGAATGCCG TTCGGCCTTG TACTCGCCCA ATGGAGCATT CTTTGCTTAC ACCCAGCCCA ACGAGGTGAT AGTATTGAAT ACTAAGAGTA AGGCTGAGTT GTACCACAAG ATCGAGTTGC CCGAGGTGTT TGATATATAT TTTTCACCAC AAGGAACGTT CTTATGTTTA TGGTGCAAAC CCATTCAAAT CAACCGTGAA AACGGAACCT GGAATAACAA CTTGAAGATT TTCAACTTGA AGACCAAGTC CTTGATAGTG GAATGGCTGC AGAAACACCA GAGCGGATGG AAGCCCCAGT TTACCCAGGA CGAGAAACTC GTAGCCAAAA ACTTCAACAA CAAGGAAATC CATTTCTTTG ACATCAGCAC GTCTCTGCAG GAAACCATAA ATATAAACCA GCCTACCCAC AAGTACAAAG TGGCTGATGC CAAACAGCCG TTCCAGAACT TCCAGATTTC ACCAGGTTTA AACCCCTCTG TTGCTATCTT CATACCAGAA GCTAGTGGCA AACCAGCCTC GGTGTTGATC TATAACGTTC CAAACTTCAA CCAACCCACC TGTAGCAAGA ACTTCTTCAA GGCTGAACGG TGTCAATTGA AGTGGAACTC GTTGGGTACA GCATTGTTAG CATTGGCCTC TACCGACCAC GATACCAGTA ACAAGTCGTA CTACGGCGAA ACCAACTTGT ACCTCTTGGG AATTGCCGGT TCATATGATT CCAGAATCGA CTTGAAGAGA GAAGGACCCA TTCACGACAT CACCTGGTCT CCTTCTGCCA GAGAGTTTGC TGTTATCTAC GGTTACATGC CATCAGAAAC GACTTTCTTT GATGCTCGTG GTAACGCTAT CCACTCGTTA CCTACAGCTC CACGTAACAC CATCTTGTAT TCTCCTCACG CTCGTTACGT GTTGGTTGCC GGTTTCGGCA ATTTGCAGGG AACTGTAGAT GTATACGATC GTCAGAACAA GTTCAGTAAA GTCGTAACCT TTGAAGCTGC CAACACTTCT GTGTGCGAAT GGTCACCTTG TGGACGTTAC ATCTTGACTG CAACGACTTC TCCTCGTTTA CGTGTGGATA ACGGCTTGAA GGTGTGGCAT GCATCTGGTC AATTGGTTTA CTTGAAAGAG TACCAAGAGT TGTATGCAAT TGGATGGAAG CCTCAGACTA TCGCTGAGTT TCCTCCTTTG AAGCAATTGG AGCCTGCTCC ACCAGCCCAT GACTCGGCAC GTGAATACAT CGCCAAGAAA GCTGCTGCTT CAGCTACTGC TGCTTCCAAG CCTGCTGGTG CATACCGTCC TCCTCATGCC AGGGGCAGTT CTGCTCCAAG CACAGCTACT TCCTTGTACC AGAAGGAGCT TCAAAACAAC TTGAAGTTAC AACAACAGCA GCGTAATGGA ATCAACCCTA GTAGTGGCAG AGCCCGTGTT GTTCCAGGAG CCAACCCTGT AGAGATCAAG GAATCCAAGA CAGCTCAAAA AAACAGAAAG AAGAGAGAAG CTAAGAAGAA CCTGAAGGAA GACTCTCCCT CTATAGAAGG CTCTCCTGCA CCATCTGCTG CTCCTTTGGG TCCACCACCA GGACTTGGCC AATTATCGTC TGCCCCAGCT TCTGCCCCAG CTTCTACTCC AGCTGCTCCA GCTACTCCCT CACCAGTGGC TCCGGCTGTT GCAGCTACTT CCTCTCAGGG AGGTGTTGTT GTTGGCGGAG TTGCGCTGTT GGAAGAGAAG AAAATCAGAT CGTTGTTGAA GAAGTTGAGA GCCATCGAGA CCTTGAAGAT GAAGCAGGCC AGCGGCGAAC CTTTGGAAGA CACCCAGGTC AGCAAAATCA ACAAGGAGGA CGACATCAGA AAGGAGTTGA GTGCATTGGG CTGGAACGAT TAGGCGACCT CAGTCTACTA CACTCTACCA TCTATATAAA TAATGTAGCA TTGCAGGTGT CCAGACCACA ATTCAAAATA TAACAGAGTA GAAATGTCTC CTGTAAAATG GGACTAGAAC TATAGAGATG TACCACTACG GCGCATTGTA AAATCTTGAG TATTGCCACT CCACAATATA TGATAAAAGT C
|
Protein sequence | MSKSTEFFCR LPRSIELTHD YEEITPSPKA AEECRSALYS PNGAFFAYTQ PNEVIVLNTK SKAELYHKIE LPEVFDIYFS PQGTFLCLWC KPIQINRENG TWNNNLKIFN LKTKSLIVEW SQKHQSGWKP QFTQDEKLVA KNFNNKEIHF FDISTSSQET ININQPTHKY KVADAKQPFQ NFQISPGLNP SVAIFIPEAS GKPASVLIYN VPNFNQPTCS KNFFKAERCQ LKWNSLGTAL LALASTDHDT SNKSYYGETN LYLLGIAGSY DSRIDLKREG PIHDITWSPS AREFAVIYGY MPSETTFFDA RGNAIHSLPT APRNTILYSP HARYVLVAGF GNLQGTVDVY DRQNKFSKVV TFEAANTSVC EWSPCGRYIL TATTSPRLRV DNGLKVWHAS GQLVYLKEYQ ELYAIGWKPQ TIAEFPPLKQ LEPAPPAHDS AREYIAKKAA ASATAASKPA GAYRPPHARG SSAPSTATSL YQKELQNNLK LQQQQRNGIN PSSGRARVVP GANPVEIKES KTAQKNRKKR EAKKNSKEDS PSIEGSPAPS AAPLVAPAVA ATSSQGGVVV GGVASLEEKK IRSLLKKLRA IETLKMKQAS GEPLEDTQVS KINKEDDIRK ELSALGWND
|
| |