Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_35884 |
Symbol | |
ID | 4838889 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 721536 |
End bp | 724445 |
Gene Length | 2910 bp |
Protein Length | 706 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640390204 |
Product | predicted protein |
Protein accession | XP_001384100 |
Protein GI | 150865048 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGTGA ATCCAGTCCA TTTAGAACGA AAAGAGTCAG CCTATCAGCC AGTGCAAGAA AGCCATGACC AAGCCAATGA CCAAGTTGAT ACTGTACTAA ACGAAGATGT CAATAGGGTT GAAAGTGTCG ATGCGCCAGA AGTCGTGGTT GCTACTTCAG AACAATATGA TCCCTTGGTC CAAGAAGAAG AGGACCTTGA AGACTATGAG CCTAGAATTG ACCATGAACA AGTAGGTGGT TCAGACAGCG AAGTATCGGT ATCCCATCCT GCCGAGGAAG CTCAGGACGA AGAAGAAGAG TACGATCCTG AAGCAGCCTT TAATGAGTTT GAGCAAGAGT TGAAACATGA GCAGAATCAG GAGACTGAAG TTCGTAAAAT CGAAAATCAT GAACCTCAAA GTGTAGTTGA AAACCCTGAA ACTGAAAGAA GTCATGAAGG AGAAAATTTT GAGACCCAAA CTAATTTTGC TAATCATGAA ACTGAACCTG TAAATGAGAA AGTAAATCAC GATGAAACAC ATGTAATCGA AAGGCACAAT ACTGATCACT TAACTGAAAG TTCTACGCCT GAAAATCAAA CTGAAGCTAA TAAACCAAAT CATGATCAAG AGCTTGATGG AACTCATAAC TTTGAGAATG TGATTGAAAA AGATGATGAT CTCCAAATGC GTAATCCTGT CGATGATGTT GAAGACGAGT TGTATGTTCG CAAAACTAAT GTTGAAGACA ATGCGGATGA AGAAAATGAT GAAGACGAAG ACGATTATGA TCCGGAAAGT GCCTTGGACA GAAACATTAA GCCATCCCAG TCTCCTGTGC CTATAAATGT AGCCTCTGAA CCTGAAAACA AACCGTTAAA CCCCATTCTT AAGGGAAAGA GCCCTGTAGA CAGTCTTCCA CCCAAACCTC AGGTAGGTGG TTTCAAATCT TCTCCTTCTC TTCCACCTGC TCCCACATCG CAACAAGATA TTAGACAAGC GTATGAGGCT ATTATGCAAA GCGATTTGGT TAAGGATCCC AATTTTGTCA ACTTGTCGCA GACAGAACAG ATGAAACTAA TTCAAGACCA GTTGGAGAAA AGGCAAATGA ACTTGGCTGG AAAAATTGAC CCAGATATGA ACTATGATCA AGTGTATTCG TACAATAAAC CCTATAAGAA CTTGAAAGAT CCCATCCCTC TTATACCTGT CAACAAGTTC TGTCGTAGAC CAAATATCAC TGCTCCTATG ACCCCCGAAG AGGAGGCAGC TTATAAGGAC TTTATTAAGA CAGAAGCTGA CTACTTGGAT CTGGTTACAT GGGAAGAGTT TCCTGACAAC CTGAGATTGT TCATAGGTAA CTTGCCAGCA AACACCATCT CCAAGCAAGA TTTGTTTCGT ATTTTCAGCA AGTATGGGGA GGTTATACAG ATAGCTATCA AAGCAGGATA TGGGTTTGCC CAGTTTAAGA CTGCCGAGGC ATGTTTGGAA TGCATTAAAG GAGAATCAAA TGTACCATTG CATAACAAGA TGATGAGACT CGATGCTTCT AAGCCTCAGA AAGTAAAGAA ACAAACAAAA CCTGAGTTCT CAGCTGGAGG TTCTGACGAG TCCTTCGGTA CAAAGAAATT CATTCCAGAC TGCCAATTGT ACATCACAGG AAAGTCTCCT GTATTTTTTA TTAGAAAGGT TAAGAAGACA TTTGCTAACT CTCAGATCAC CATTGATACG GAAGACGTCA CGCATAAGGA CATCGGTGAT GTCATCAGTG AAGCAGCGTA TTCTGGAGTT CTTGCTGCTT GTATCATCAA GGAGCTTAAG GTTGATGTAC AGACATTTGA AAGCACAGAA GATGGTGGCG TCAAGTTTGA CGAATACGCT GATATTGATC CAGAAGTAGC CGTAGAAATT TTGACAAAGG CAAAGGAAAA GAGATATGGA AACAATCTTC CACAATACAT CCCTCAAGAA GAGTCATATA ACGAAAAGTC ATATGGTGAA AAGTCATATG AAAAGCCACA TAACGAAACT TCTCTTCCGC AGCAGCCATA CGGATTTCGT CAACAACATG ACCAGAGTCA ATATGGTGGA TATGAGGACA ATCAAGGTTA TCGTAAGAGA TCAGGTCCTG GTTATTCCGC AGGCTCTTAC AAGAGGCAGC ATTATAACCA GAATTACAGA CAACATCGCC ATGGTGACTG GGGCAAACAT CATCAAAATA ATCAGCAGCA ACAGCCATAT GGCCAACCTC CACCATTTCA AGCACAGCTT CCGTATGGCC AACCTCCTCC TCCTCCAAGT AATAATTACA CTTCATCTAC TCAGAACTAC AGCCAACCAA ACCTGTTTAG CCAACAGAAT CAACAAAATC AGTATCATCG ACAACAAATT AATCAACATC ACCAAGGACC ACAAACCCAG TATAACCAAG GACCACAAAA TCAATATAAC CAAAGACCAA AACAGCAATA TCAAGAACCA CCACAAATTC AACACCAACC ACCACCACAA ATTCAATATC AACAATCACA AGTTCCAGTA AACTCGAACC TGTCAAACAT TGCGCAAGCA TTGCAAGGTT TAGATGCCTC ACAAGTGCAG AATGTAATTA ATATTTTACA ACAACAACAG CAACAGGCAC CACCTCCTCT ACAACCCCAA GTTCCTCAGC TACTTATTCA ACGTCAACCA TTATCATATG TGCAAGCTCC ACAGATTAAC TATGGAGGAT ATAACCAACA ACAGCAATCA CCTCAACCAG AGCAATATGG TAATGCTCCT CTTAATAACC CAGCCAGTAG CCAAGTAAAT GCTTTGCTTT CACAATTGAA CCGCAACAAT AGCAATATTC AAGGATATCA AGGCTCCTCT CAGTCTAATA GTACACTGAC TCTCATGGAG ACGTTGGCAC GTTTAAGCAG AAAACAGTAA
|
Protein sequence | MEVNPVHLER KEVESVDAPE VVVATSEQYD PLVQEEEDLE DYEPRIDHEQ VGGSDSEVSV SHPAEEAQDE EEEYDPEAAF NEFEQELKHE QNQETEVRKI ENHEPQKNDE DEDDYDPESA LDRNIKPSQS PVPINVASEP ENKPLNPILK GKSPVDSLPP KPQVGGFKSS PSLPPAPTSQ QDIRQAYEAI MQSDLVKDPN FVNLSQTEQM KLIQDQLEKR QMNLAGKIDP DMNYDQVYSY NKPYKNLKDP IPLIPVNKFC RRPNITAPMT PEEEAAYKDF IKTEADYLDS VTWEEFPDNS RLFIGNLPAN TISKQDLFRI FSKYGEVIQI AIKAGYGFAQ FKTAEACLEC IKGESNVPLH NKMMRLDASK PQKVKKQTKP EFSAGGSDES FGTKKFIPDC QLYITGKSPV FFIRKVKKTF ANSQITIDTE DVTHKDIGDV ISEAAYSGVL AACIIKELKV DVQTFESTED GGVKFDEYAD IDPEVAVEIL TKAKEKRYGN NLPQYIPQED SHTDFVNNMT RVNMVDMRTI KVIVRDQVSV IPQALTRGSI ITRITDNIAM VTGANIIKII SSNSHMANLH HFKHSFRMAN LLLLQNYSQP NSFSQQNQQN QYHRQQINQH HQGPQTQYNQ GPQNQYNQRP KQQYQEPPQI QHQPPPQIQY QQSQVPVNSN SSNIAQALQG LDASQVQNSN STSTLMETLA RLSRKQ
|
| |