Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_1171 |
Symbol | |
ID | 4838103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 1272892 |
End bp | 1275552 |
Gene Length | 2661 bp |
Protein Length | 887 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640389418 |
Product | predicted protein |
Protein accession | XP_001383529 |
Protein GI | 150864629 |
COG category | [S] Function unknown |
COG ID | [COG5644] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0269052 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0478294 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ACCAAAAAGC GTAGTTCCAA GAAAATCTTG GACGCTTTCC AGATAGCAGA ACGAGCTGAA AATGGCTCCG ACAGCGACAA CTACAACTCT GGAAGCGACG ATGATCTCAC AGTACAAGAC GGAGTTCTAG ATGCTTCGAA ATTCTTCAAG GAGCTGAGAA GACCCGACGG CAAGTTCGAC GATGAAGAAA TTGACTCTGA TGAAGCTTTG GGCTCGGACG ATGACTACGA CATTTTGAAC TCGAAGTTTT CGCAGACTAT AAGAGACAAA GCAAAAAAGA GAAAGACGAA GGAGAAGAAG AGAAGAAGAG GAGAACAAGT TAGCAGTGAT GAAAGTGAGG AAGAAGAAGA TGAAGATGAA GCTGGGTATG CCAGTATTGA CGAATCACAG TTGGTTTCGT TGTCCGAAGC TTGGGATATG GACGATGCTG ATTTGGAAAA ACATAGTAAG TCTAAGTCTA AGTCAAAGTC TGATTCCAAT AAGAATGAAT TGGTTTTTGA TGAAGATGCC TGGGAGACTG AATCATCACA AGAGTCTGAT TCTGATGAAG AAGACGAGGA AGATGAAGGT GAAGATTTAG AAGTGGAATC TGAAGAGGAT ATCTTCAGAA ATGAGTCTGG AGAAGAAGAA GACGACGAAC TTGATTTATC ACACACTGTT TCACACTTAA AATCTCAGAT GAAGAAGCCA GAAATACAAA AAAAACGATT ATTGACTGAA ACCAGAGAAG AGAACGAATT TGTTTTGCCT ACAGGGGGTA ACAAGTTATC TTTGAGTGAG ATGATGGCAG CTGTAGATTC CAGTGTTTCC AATGAAGCAT TTTTAATTGA CAAGGATGCT GAAAAATCCA AAGCTTTGGC CACTCCATTG CCCAAGAGAA TACAGGAGAG ACACGAGAGG AAAGCTGCTT ACGAAATCAC AAAAGAAGAG GTTAGTAAAT GGGAGGATGC GGTGCAGTCT AACAGGCAAG CAGAAGTGTT GAAATTCCCC TTGAATGCCA CTGTTACACA TAACGATACT GCAAGTACAT TCAAAACATC CACTGAGCCC ACTACTGAGT TGGAAAAGCG TATTCATGAT GTTTTGACAG AATCATCATT GTTGGACGAC AAGAAGGAAG CCACTTTCGA AGAGATAGCT GTTGCCAAAA TGTCTGCTGA AGAGATGAAA AAAAGAACCA ATGAGTTACG TTTGATGAGA GAATTGATGT TCAGAGATGA AAAGAGAGCT AAAAGAATAA AGAAGATCAA ATCCAAGCAG TACCACAAGA TCCAGAAGAA GGAAAGATTG AGAAATCAGG AATTAGTGGA AGGTTCTGAT GCTGAGAGCG ATGGTGAGGA CCACGACTTG AAGAGAGCAA GAGAGAGAAT GACTTTAAAA CACAAAACAC AGTCAAGTTG GGCCAAGTCA ATGATCAAAT CTGGATTATC CAAAGATTCT AATAACAGAG CTGAGTTGGA AGAAATGTTG AGACAGGGCG AAAAGTTACG GGCCAAACAA TTGGGCTATG AAGATGGAGA TCAGAGTGAC GAAGGAGTCT CAGATATCGA AAGGGATTAT TCCAACGACG ACGAGAATGC TGACGAAAGT ATCAAGAAGC TCGGAAAGGG TGTTTTGGGA ATGGACTTCA TGAGAAATGC CGAAGAAAGA AGTAGGCAGT CCAACTTAAA GGAGTTGGAA GAATTAAGAA GATTGGAAAA TGAAAAGGAT TTAGATTCAT TCGAAGATTC GAACAGTGCT CTAAGCTCTA GCAAGAACCA AGGAAGAAGA GTCTATGCTC CATCGGCTGC AAGTGCCAAT GCCGAGCTAA ACACTGTAAA CCAGATTACC ATTGCTGAAA TTACTGAAGA AGAATCTGGC AATTTGACGA ATAAACTCAG CAAGAAGTTC GATGTCGTGG ATTCGACTAA GCAAAAGCAA AAGTCTAAAG AAATCAAAGA ACAAGATGTA AACGAAAATG AAGAAGAAGA ATTAAATCCT TGGTTGGTTA CTTCGGCAAA TGAAACTTCT CAGAAGTCAC AGAAAATTAC TGTTGTAGAC AAATCGTCTT CGAAGTTGGC TAAGGCTGCT GCTAAGATTG CCAAATCCAA AAAGAGAAAG CAAGGCAAAA ATTATGAAGA CGAGGATGTG ATTGATATGG GCGAATCGAT GAAGATTGTG GATCATCATC CAGGCAGTGA GTCTGAAGAT GAAGACGTTA ACGAGGTTAG AATGTTCAAA CAAAAGGATT TGATCAAACA GGCCTTTGCT GGTGATGACG TGGTTCAAGA ATTTGAAATA GAAAAGAAAA GAGTCGTCAG AGAAGAAGAT GATAAGGAAG AAGATATGAC CTTACCCGGT TGGGGTGACT GGGCTGGAGG TAACAAAAAG TCCAAGAAGA GAAAAGTTGT TCGTAAGGTT GACGGTGTTG TTCAAAAGGA CAAGAGAAAG GATAAGAATT TGAAAAACGT GATCATCAAC GAAAAGGTCA ACAAGAAGAA CTTGAAGTAC CAGTCGTCTG CTGTCCCATT CCCATTTGAG TCCAGAGAAC AATACGAAAG ATCGTTGAGA ATGCCACTTG GACAAGAATG GACTTCCAGA GAAACCCATC AAAGAATGAC GATGCCTAGA ATTATCACCA AGCAAGGTAC AGTTATCGAT CCATTGAAAG CCCCATTTAA G
|
Protein sequence | TKKRSSKKIL DAFQIAERAE NGSDSDNYNS GSDDDLTVQD GVLDASKFFK ESRRPDGKFD DEEIDSDEAL GSDDDYDILN SKFSQTIRDK AKKRKTKEKK RRRGEQVSSD ESEEEEDEDE AGYASIDESQ LVSLSEAWDM DDADLEKHSK SKSKSKSDSN KNELVFDEDA WETESSQESD SDEEDEEDEG EDLEVESEED IFRNESGEEE DDELDLSHTV SHLKSQMKKP EIQKKRLLTE TREENEFVLP TGGNKLSLSE MMAAVDSSVS NEAFLIDKDA EKSKALATPL PKRIQERHER KAAYEITKEE VSKWEDAVQS NRQAEVLKFP LNATVTHNDT ASTFKTSTEP TTELEKRIHD VLTESSLLDD KKEATFEEIA VAKMSAEEMK KRTNELRLMR ELMFRDEKRA KRIKKIKSKQ YHKIQKKERL RNQELVEGSD AESDGEDHDL KRARERMTLK HKTQSSWAKS MIKSGLSKDS NNRAELEEML RQGEKLRAKQ LGYEDGDQSD EGVSDIERDY SNDDENADES IKKLGKGVLG MDFMRNAEER SRQSNLKELE ELRRLENEKD LDSFEDSNSA LSSSKNQGRR VYAPSAASAN AELNTVNQIT IAEITEEESG NLTNKLSKKF DVVDSTKQKQ KSKEIKEQDV NENEEEELNP WLVTSANETS QKSQKITVVD KSSSKLAKAA AKIAKSKKRK QGKNYEDEDV IDMGESMKIV DHHPGSESED EDVNEVRMFK QKDLIKQAFA GDDVVQEFEI EKKRVVREED DKEEDMTLPG WGDWAGGNKK SKKRKVVRKV DGVVQKDKRK DKNLKNVIIN EKVNKKNLKY QSSAVPFPFE SREQYERSLR MPLGQEWTSR ETHQRMTMPR IITKQGTVID PLKAPFK
|
| |