Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_28190 |
Symbol | |
ID | 4850969 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 584270 |
End bp | 586303 |
Gene Length | 2034 bp |
Protein Length | 677 aa |
Translation table | |
GC content | 41% |
IMG OID | 640392677 |
Product | predicted protein |
Protein accession | XP_001387753 |
Protein GI | 126273930 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.458855 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.78983 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCCCA TAAGTCGAAA ACCGAGGGCA GTGTCCCGAA AGATTCTGAC TCCCTCACCC AGATTTGCAA GTCTAACACA AGCCAACTCA ACTTCACAAC AGCAGCCATT GCAGGCCGTA GACAAACAGG ATCTTCTTCA GAAATACCGG GAGGTAGGAA ATGGCCACGA AGCAGGTGAA GACATATTTG GTGGAGCTGA AGAGTCGGAT TTTGGCTTGG GTGACGATTT CAAAACTTTG AAAATTTCGA GCCTGGATCT CAATACGATC AGACAAAACC ATGCGGAGGC CAGGAAAGCG ATAAAATTCG ATGATGAACT TGGGCTACAA GACCAGGGCA CAATTTTTGG CAAGAATGGC TACTCTCCAC TAGACTCATT GTCCACCTCT TCACCAAAAC AGTCTGTTTT CACACCAGTT TCCCGTAATT CAGAGTCTAC GGATCGAAAT AATGGCATCA GAAGAGTGAC AAAGGAATCT TTATCGGATT TCAGTGAAGG AGAAGATACG GACATAACAT CAGAGTTCAA TGATAACGAT TTCGAAGACT TGGATAACAT CTTTGGAAAT GAAGAAAGCG GCATTTACGA TAAAATGAAC AAGATTCTTT CCAACAAAAA GCTGGCATTA CAGAAACAGG CTGATTCAGA AGAAATTGAG CTCAGAAGCC AGTTGGAAAA ACAGCAAGAG CAACAAAGAA ATACGCTCCT GGATGTGAAC GCTACGCTCA GATTAAGGGA TTTCAACAAA ATTCAGATAG ATGCCCCTAA AAACAACTTG ACTTCACAGA ATTTAAACAT ACTTGATCAG ATTGAAAATG AGAAGACTGT CAATTACGAA TATACCAGAG ATGATTTCGA AGAGTTCGAA ACTGGCTTTG AGGATAACTT TGAAAGCAAT CTCAAGAACA GCAGATCTGT GGGACCAAAA GCCACAAATA TGGCTACGGT TCGATCCAAA GCATCTATGC CTATTCTCAG CAGAAATAAC TCTTCTTCAG TAAGGCGATT CAAGTCAAAT ATGGATCTAG TAGGGAGTTA TGGCTTTGAG AACATTGATG AAGAGCTGAT GCATAATGAA CCAGAGTTCA ACTACAACAA CAACGTAATC CGTAAGTTAG ACAGAATACC GTCGTTCTAC AACAGCAACA GCAACAGCCG AAACAGCGAG CTTTCTTCAC GAAAATCACA GCTTCTTACC AAATACAAGG AGCAAGCACT TTCTGAAAAA GAGAAAAAGA GACAAAGTAG ACTTGCGAGG GCTGGACCTG AACAAAGCAA GCATCCCAAA CTAGGATTGG TGAAGTATTT AAATAATAAC TCGGTAATTA AAAACCCTTC TATACCGACA AACAACAAGA TGATGCGGTA CAACTCTGTA CGACAAGAAT GGGAAGGAAA CGAACACGAT CTTCTCCGAT TTGATAGCTT GAGCAAGCCT TCGCTTATAA CGATGAATGA GCTCCAAGAT CCGATTGATG ACGATTCCCT CAAACCGAAA ATAGGAAAGT TAGACGTCAA GGACAGTAGA AATCCTCATA TGGTGTACGA CAACGAGAAC AGAAGATGGA TCAACTTGAG GGAGGAAGAC GACTCCATTT TCAACGACAT TGAAGATCTT GTTGAGGATA ATGGAAATCT TGCCAAAAAG GAATACGTGT TGGCATCACC TCCACGCCAA ATTAAACCAA ACACACTAGC ATTCAAGGGT TCGATCTCTC CTTTTGTAGT ACCAAATATT ACGCATTTGC AATCACCCAT CACCACTCGG GGAATAAGTC AGTTCACCCA GAGGACAGCT TCCAGCAATA CAAATTCGTC TGCTACAGAA AGCTCAGAAG AGGAAGTAGA TGAAGCGTTC AAACTTTCCG CCAAGCTTAT AGACAAGTTC TACAAAGAAG AAGTGAAGAT TATCAAGAAA ACCCAGCATT GGTTCAATGC CAACGAGGCT TACGATTACA ATATCAAGAA AATGAATTTC ACAGATACCG AGTACTACTG GGAAATCCGC AAGATGGTTA TGGAGAATGA ATAA
|
Protein sequence | MNPISRKPRA VSRKILTPSP RFASLTQANS TSQQQPLQAV DKQDLLQKYR EVGNGHEAGE DIFGGAEESD FGLGDDFKTL KISSLDLNTI RQNHAEARKA IKFDDELGLQ DQGTIFGKNG YSPLDSLSTS SPKQSVFTPV SRNSESTDRN NGIRRVTKES LSDFSEGEDT DITSEFNDND FEDLDNIFGN EESGIYDKMN KILSNKKLAL QKQADSEEIE LRSQLEKQQE QQRNTLLDVN ATLRLRDFNK IQIDAPKNNL TSQNLNILDQ IENEKTVNYE YTRDDFEEFE TGFEDNFESN LKNSRSVGPK ATNMATVRSK ASMPILSRNN SSSVRRFKSN MDLVGSYGFE NIDEELMHNE PEFNYNNNVI RKLDRIPSFY NSNSNSRNSE LSSRKSQLLT KYKEQALSEK EKKRQSRLAR AGPEQSKHPK LGLVKYLNNN SVIKNPSIPT NNKMMRYNSV RQEWEGNEHD LLRFDSLSKP SLITMNELQD PIDDDSLKPK IGKLDVKDSR NPHMVYDNEN RRWINLREED DSIFNDIEDL VEDNGNLAKK EYVLASPPRQ IKPNTLAFKG SISPFVVPNI THLQSPITTR GISQFTQRTA SSNTNSSATE SSEEEVDEAF KLSAKLIDKF YKEEVKIIKK TQHWFNANEA YDYNIKKMNF TDTEYYWEIR KMVMENE
|
| |