Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_33495 |
Symbol | |
ID | 4840628 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | - |
Start bp | 785147 |
End bp | 787372 |
Gene Length | 2226 bp |
Protein Length | 741 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640391943 |
Product | hypothetical protein |
Protein accession | XP_001386346 |
Protein GI | 150866678 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00756] pentatricopeptide repeat domain (PPR motif) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.375404 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0265895 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCCCT TGAGACAGAC GGTGTTCAGG AACTTAACCA GGACAAATAT TGTGAAACTC AGAAGACTGG GGCAATTTTT GAGTCGTTTT CAATCTACAG CAGCTGACAC TTTGAAGGCT ACTACTTCTA CGAGCAGCCC AATAGTGAAA AATGACAAGA ATGTCTCAGA AAAGGCACCA GAAGGTTCAA CTAAAAACGG AAACCACTCC AAAAAGGCTC CTCATCGTAA TGTAAAACTC AGAGATATCA GTAACCAGAT CCAAGATTTG GTTAAGTCGT CCAAATCTGA TCTCACAGAA GCCATAGAGA TCTTGGAAGA AGGGTTATCG TACTTGAGAG AAATTCAATT GGCTGAAAAC ATCTCCGACA ACTCGATCTA CTTCCGTTTC CAGCCAATTG TGACCGAGTT ATTGTTTAAA GCTTTAGACC CCGCTACTTC GCTCGGAAAT AAATCAGTGG AGGACGTTTT GGAAATCTTC ACCAAGTATG GCGTCTGCCA CAAATACCAT TACACTGTAG TAGCTTCCAG GTACTTCAAG AGTGGAGAAG ATAAGGCTGT GATCTATCAG AATGTGCTTA AGCTCTGGCT CCAATTTTTG GAATTCGAAA AATCGAACAA TGCTACCGGA ATGGCTTCTG TCAAGGCTGG AGAAATAGAA TACCGTCCTT ACTACCTCCC CAACCTTGTT TATTTCGCAT ACAGCCAGAC ATGTAGTCTT CAGGGGGTCA AGTTTTCGTT TGAAGATGCT TCCAAGCTCT TGAATCAGAC GTTGCCCAAT CCAGTTTTGA TCAGAAACTC GCTTATGGAC TTGCGTATAT TCGACGGCTA CAAAAAGGAG TTCCAAAGTT TTCAAAAGAG CATCCACGAG TTGTCTGCTG AGTCACTTGA TCCTAACGGT CCTGAAGTCT ACAGGAAAAT CAGAGAAGCT GGTGAAAAGA AGAATCAGGT AGCCTTGAAT ATAATCCAAA AAGAAATCCA GGAAGCTGCA GCTAGAAACA ATAATCCAAT CAATGAAGAC ACCTTGATCA GATTAATGGA TGGATACTAT GAGGCTGATA GACCAGATGA GGTTTTTGCT ATTTTCCAGA ACTTGCTTCT GCACGGAATC GAGAAACCCT CCATTCGTGC CTGGGATGTA GTGTTGAGAA CAATGGGTAC TCCATCATAT ATCTCTAAGA TTTCATCTGC TCAAAGAGGC AAGCTTATTA AAAGTATTGA GCTGACCATC GAAACTATCT TGAACAATGG CACAGAGATC ACTGCTAAGA CCCTTTCTAT CATCATTGGT AGTTTTGCCA ACTTGAACAA GTTCGACAAG GTTGACGAAT ACTTGCAGAG ATTTAGCATA GAAGGCGAGG GTAAGCTTCC TGTGATTGCT CCAACCAAGA ATAACATTTT GATTGGATTA GCGTTGAATA AAAAGATTAG CGAAGCGGAA GAAAAGTTGA AGGAGTTTGT CAGAGCTGGT GGGTACGTTC CTTCTACGTC TGTGATGAAT ACATTTTTAG GCTACTATGC TAAAATTAAT AACTATGCTG CCGTGGAAGG TATTCTTGAG TTCATGAAGA AGCACAACAT TCCAGAAGAA GTTGGAACTT ACACTTCAGT TATCGACATC TACTTTAAGA TGCACCGTGA AAAGGGATTG GTAGCTGACG TTGACAAAGT TCTCGACAAC ATCTCTGCTT CCAAATCTAT ACCATTGAAC GACTTCACTT ACACCGCATT GATTGACGGT TTGGTGAAGA ACGGAGCCAA TATAGAAGCT GCTCGTTCCA TCTTCGAAAA GGCTTCGAAG AAGTACCCTG CTTCTGCTCA CTTGTACACT GCCATGTTGA GAGGTGAGTT GGACCAAGGC TCTGTCAGTT CAGCAGAAAA GTTGTTTGAT GTCTACATTA AGAAGATTAG AAACGACGCC AGAATCTGGA ACACAATGAT AAACTCGTTG TTGAGCAAAC GTGAAGAGCT TGCCTTGCAA TACTACGAAA ACTTGAAGAA TGATGCCCAT TCTTCGCCAA ACCACTTCAC ATACTACTTC TTGTTCCATC ACTTCATAAA GAGAGGAAAC AAAGAAACAG TGCAACACCT AATTGACGAC TTGTCACAGA AACCTCTCAG AGACTTCGGT AATGAGTTGC CCAAGATGTT GGGCAAATTA ACGGGAGAAT ACAAATTTGG TCCAGAACTC ATCAACATTC TCTCCAACCA AAAGAAGCAG AACTAG
|
Protein sequence | MIPLRQTVFR NLTRTNIVKL RRSGQFLSRF QSTAADTLKA TTSTSSPIVK NDKNVSEKAP EGSTKNGNHS KKAPHRNVKL RDISNQIQDL VKSSKSDLTE AIEILEEGLS YLREIQLAEN ISDNSIYFRF QPIVTELLFK ALDPATSLGN KSVEDVLEIF TKYGVCHKYH YTVVASRYFK SGEDKAVIYQ NVLKLWLQFL EFEKSNNATG MASVKAGEIE YRPYYLPNLV YFAYSQTCSL QGVKFSFEDA SKLLNQTLPN PVLIRNSLMD LRIFDGYKKE FQSFQKSIHE LSAESLDPNG PEVYRKIREA GEKKNQVALN IIQKEIQEAA ARNNNPINED TLIRLMDGYY EADRPDEVFA IFQNLLSHGI EKPSIRAWDV VLRTMGTPSY ISKISSAQRG KLIKSIESTI ETILNNGTEI TAKTLSIIIG SFANLNKFDK VDEYLQRFSI EGEGKLPVIA PTKNNILIGL ALNKKISEAE EKLKEFVRAG GYVPSTSVMN TFLGYYAKIN NYAAVEGILE FMKKHNIPEE VGTYTSVIDI YFKMHREKGL VADVDKVLDN ISASKSIPLN DFTYTALIDG LVKNGANIEA ARSIFEKASK KYPASAHLYT AMLRGELDQG SVSSAEKLFD VYIKKIRNDA RIWNTMINSL LSKREELALQ YYENLKNDAH SSPNHFTYYF LFHHFIKRGN KETVQHLIDD LSQKPLRDFG NELPKMLGKL TGEYKFGPEL INILSNQKKQ N
|
| |