Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_70016 |
Symbol | |
ID | 4837190 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 1158343 |
End bp | 1161763 |
Gene Length | 3421 bp |
Protein Length | 638 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640388505 |
Product | predicted protein |
Protein accession | XP_001382439 |
Protein GI | 150863830 |
COG category | [S] Function unknown |
COG ID | [COG4850] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.165048 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AAACTCCAAG CTCAAACTCA TCGCAGATAT CTATCGAGAC TACCGCGCTC TCAAGAAGCA CTACCCTGTT TCGCCCGAGC TTATCAGATT CTCCCGCAAA CGCTATGCCA AAATGACCAG TCCCTCTGAA TTCTCAAACA TCAATGGCAG AACGTCTTCT CCGCTAGATT CAGATCTCTC GCAGCCTCTT TCTCGTAGAC AAAGACTCTT GGGCCTAGCC AGAGCCACCA GAGACAACTA TATACCACGA CTCACTGGTC AAGTCACCCA GATAGCCTCA GGAGCTTCTC GCGCTTTTGC TACACCTGGA AGTGATCTCT ACGATGAACA GGGAAACGTC ATCTTCCCCA AAGATGCTTC TATTACCTTG TTCCCGTCGT ACACCAGACA GGTAGGCGAC AAATACTACA TAGACATCAA GGGCTGGGTC TCGTGCCCCG GTCTCATGAC TCGTAAAAAC AGATTGATCC TCTCATTAGT GAGGCAAGTT ACACGGTACA ACTCGGCCAA CTCAGATCAG GCTATACACC AGTTGGAAAG CGACAAGTTG AAACAGGATA TGCTCCAGGA TGACGTTTCT GATTTGGAAT CGTTTCATTC CGAAGCTTCT AAAGATTCGA ATCCAGACCA ACTTAGAAAT GTCACGAGCT CTAGCTCATC TGTACAGACT GGTCCGTCTG CTTTCAACAA CGAAGAGTTG ATGAAGGAGA GATTGGCATC ATTTATCGCC AGATCAATCC CTAATGCAGC ACTAACTGTG GTTATAGGCT CTCACGTAGT CCATTCTGAA ATCGCAGAAA AGGAAGTCTT CACAGATGCC AGTGGAAATT TCGAAACAAC AGTCCAGACC AGCTACCTTC CCTCAGTTAT TCAGGTCAAG GCTAACTCCG ATGATACTAT CTTTTCGTTT CAGGACGTCA TGTTCGTACC AGGTGAAGGG ATTGGTGTGA TCAGTGATAT CGACGACACT GTCAAATTGA CTGGTGTTAT AGGCGACAAA AGAGAATTGA TGACAAACCT TTTATTGAAG GAAGTGACAA CTTGGAGCAT TCCTCCCGTT ATTAGCTGGT ATGATAACAT CAAAAAACTA GACAACGTTT CGTTCCATTA TGTTTCCAAC TCACCCTGGC AATTGTTCAG TACAATCGAA CAATATTTTC GTGCCGTGAA ACTTCCCTAT GGGTCGTTCC ATCTAAAGCA CTACACCGGT AATATTATTT CCTCCCTTAT GGAGCCCTCT TCTTCTCGTA AAAAAAAGTC ATTAGACAAG ATCTTGAATG ATTTCCCTGA AAAGAAGTTT ATATGTGTTG GTGATTCAGG TGAAGCAGAT CTTGAAGCCT ACGTGGACTT GGCCAAATCT CACCCAGGGC ACATTTTGAG CATCAATATT CGTGTTGTAG AGGATTCTTT GTCTGACGTT GACGACAATA AGATCCTCAA TGAGCTTGTA AGAATATTGA CTACGAAAAG AAGAGTAACG TCTTCATCTG CAACTCCACA GCCGGTAGAA ATACCAAATT TGATTGACTT ATCTGATGAT AGCCCTGTAT CTACTCCACA GGCAGAAGAA AGAAGAGCCA AATTACCTCC AATGATACCG AAGAAACCAA CCAATTTGAA AGGAAATTCG CTTGAAAAGA AGCCACCCTT ACCTAGAAGA GATTATTTGG CAAGAGCTCA TACCGATTCT GAACTTGCTT CTAAACCTAC TGTGATTGAG TTAACAACTG TTGAACTGCC ACCTTTACCC AAACGACCTG ATGCCGTCTT GCATCACGCC AAAACTGAAA GCGATGAAGC TTTTTCACAG CATGAAAACA ATCTTTTTGA CAACTTGCAG AACATATACG ATTCGCCAAA TTTCTACGAA CTTGAAGAAA TGGATAGAAA AGGAGCCAAT TGGATTCGCA GAGTGATAAC ATCGTTACAA GATCTAGAAG GCTCCGGTAC GGAACTCAGG TTATTCTCTG ATGGAGACCA ACAGTTCTTT GCCAACAGCA CCGAGGACCT CCGAAATTTG AAGCGTTGAT GGTTACGAAA AAATTTCAAT GATGTTATAA TGTTATATAT AACGTAGGTA TATATAGATA TGGTACAAAC GTGAAAATGC TAGAGATCAC AATCAGGGAG GAAAATCAAT TCGTCTCGGA TCTGGAGATC AAGAATCTTT AAATGTATTA AAGACGAATA TCTTTAAGTA TTAAAGCTTA TGCCGACGAG ACGATGGGTT GCCCTTCTTC GGACACCGTC TCTTGAACAG AAGTATCCAC CTCTGTTTCT TTCGTGATTT CTGCTGTTTC TGAAGCAGCC TCTGGAGTAG ATCCTGGACC TGTCTCTGCC TCACTAGATT CAAGCACAGG CTCCTTTGGC TCTTCGGATT CCTTTGGTTC TTCAGATTCA TCGTTCTGTA AGTAAAGCTC CTGGATATGA GAACCATCGC CGTAATCTCT GACGTTTTCG TCGCTGTCTT CTGGTAAGTC CTTCCTGAAC TGCGACAAGT CCCATTGGTC TCCTTCATCC TCTTCGGTGT AATACTTGTC GGAGAACTTC TCCAATATGA CTTCTTTACC GGTCTTGGGT CTCTTTCCAG ACTTCTCGTC CTTCCTTTTG TCGTCTTCTC TCTTCTTCAA CTGATTGGCT TTCCATTTCT GGAATGTCTC GATAGTGATA GGCGTGAACT TGCTTTTGTC CAACTTGGTT CTTTCAAGCT CAAGGAACTC TTCCAAGGAA ATCTTGGGCT GTGACTCGGC AGCCAATCTT TCCAATCTCT TTTGTTCCTT TGTCTTCAAC ACAAATCCTG GAGGCAAAGA GTGTCTGTAC TTACATTCGT TACCACCATT GGGGCATACC CAGAACCAAC CATACTTTCC GTTTTCTACA GCTTCAATGA AATGCTTACA GACTTTATCA GTGGTAGTCT TGGGATTACC ATGCTTGGAC AGAATAACCT TACGTAATTT CTCTTCATCC CATTGGTCCA TGGTATCTTC TTCCTTTTCA GCTCTGGCAT CTGTGTACAA GTCCTTCTTG GCATCTTTTC TGCCGACATT GATATCGTGG GAAAACTTAC ACTTGTTACC TTTAGTACAA AGACCCTTTT TGAAAAATTC ACATAACACC GATTTGGGAT CCACACCAAA AGGAACTTTC TGCTGGACAA TACCAAAAAG TGCAGCTGCC TCTTTTTTTG CATCTTCCGC TGCCTTCTTC TCAGCTGCTA GTCTTTTAGC TTCAGCTTCC TTCTTTTTGG CCATTCCTCC ATCGATACCA GCCTGGATCT GCGAAATCTG CTGTTGCACT TTCTTGGATT TGTTCTTGTT CTTCAACCCG AACGTTTTAT CGTCCGCAGA CTTGGCCTTG GCCTTGGACT TGTTCTTGTC TGAAATCTGG GGCTGCTTCT TCTTTGGTGG CATCGCTGTT T
|
Protein sequence | MTSPSEFSNI NGRTSSPLDS DLSQPLSRRQ RLLGLARATR DNYIPRLTGQ VTQIASGASR AFATPGSDLY DEQGNVIFPK DASITLFPSY TRQVGDKYYI DIKGWVSCPG LMTRKNRLIL SLVRQVTRYN SANSDQAIHQ LESDKLKQDM LQDDVSDLES FHSEASKDSN PDQLRNVTSS SSSVQTGPSA FNNEELMKER LASFIARSIP NAALTVVIGS HVVHSEIAEK EVFTDASGNF ETTVQTSYLP SVIQVKANSD DTIFSFQDVM FVPGEGIGVI SDIDDTVKLT GVIGDKRELM TNLLLKEVTT WSIPPVISWY DNIKKLDNVS FHYVSNSPWQ LFSTIEQYFR AVKLPYGSFH LKHYTGNIIS SLMEPSSSRK KKSLDKILND FPEKKFICVG DSGEADLEAY VDLAKSHPGH ILSINIRVVE DSLSDVDDNK ILNELVRILT TKRRVTSSSA TPQPVEIPNL IDLSDDSPVS TPQAEERRAK LPPMIPKKPT NLKGNSLEKK PPLPRRDYLA RAHTDSELAS KPTVIELTTV ESPPLPKRPD AVLHHAKTES DEAFSQHENN LFDNLQNIYD SPNFYELEEM DRKGANWIRR VITSLQDLEG SGTELRLFSD GDQQFFANST EDLRNLKR
|
| |