Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_40872 |
Symbol | |
ID | 4836774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 284076 |
End bp | 285833 |
Gene Length | 1758 bp |
Protein Length | 585 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640388089 |
Product | predicted protein |
Protein accession | XP_001382289 |
Protein GI | 150863724 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.788268 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.345333 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGTCT TGGCTCCTTT AGTCAGTGCC AATCGTGCTC TATTTGATGA AGACACCAAC CTGCAAAGAT GTTCAGGCAT GTATGCCAAA CATGACTGGG GTGGTTCTTA CAAGCCGCAG ATTTCATTGC TGTTACTGCA ATTTGACAAA TACAAATACG ACTCCAAGAA GGACAATGCC GAAGAACACG ACGAAGACAT TTCTGTGAGC TTCATTATTT TCGAATACAA GGACCTCGGC AATATCGGCG TGGAATTGAG TGATGGATCC AGCAAGTACA TCTGCGATGA CTATGCGATT GATACCTTGG GAATCTGTGA AGCTAAGCAG AAGGGTAAAT TCTTGCTCAA TTCCAATAGC ACCAACTCTA CCATTATGAC TTCCCAGTTG ATCCATTTGG GGCCTTCTGA TATACATTAC TCCGTCAACA GGACTGGATA CTACTGTGTC TCGACGTATA ACTTCGATAA AAAGTACAGA GGAGTCATCA ACTTCCAGAA CGCTTTTGGC CAATTGAGTG CCTCTGAAAT TCCCAAATTG CCAGCTTACG GTATTCTCAC TTTGTGCTAT GCTATAGCTC TAGCTTTGTT CGGGTTCCAG TTCTTCAAAA AGAGAAAGGA AAACCAGATT CTCCCATTGC AGAGATACTT GTTGGCCATG TTGGGCTTTT TAGCTTTTGA CACTATGGTT GTGTGGTCCT ACTACGACTT GGTCAACCGA ACCAAGAACC CTTCCAACGC CTTCGTAACT TTCTACATGT TTTTCTTATC GCTAATGAAT GCTGCGAAAA TCACCTTCTC GTTCTTTTTG CTTTTGTGCA TTTCCTTGGG CTACGGTGTA GTCTTGTTGA AGTTGGACAA GAAAACAATG CTTAAATGTA AAATCTTGGG AGTTGTCCAC TTTGTGGCCT CTATAGTGTA TTTGGTAGCC ACTTACTATG GTGGATCCTC AAAGTCAACT ACTTCCGGAG GCAACATTGG TGAAGGAAGC ATGGGTAGCT TTTTGGGATT GTTACCTTTG ATCCCAGTCA CTATCACATT AACAATTTAC TACATTGCGA TCTTGGTGTC TATTAAAAAG ACCACTGCGA ACTTACACAA GCAACGCCAA ATCATTAAAT TGCAGTTGTA CGAAAACTTG TTCAGAATTA TTTTCTTTTC TGTCGTGTTA ACCTTCGGTG GGCTCATTTT GTCTTCCATA GTCTATTTGA GCATGTCCAC TACTGACATG ATCGAAGAAC ACTGGAAGAG TGCATTCTTT ATTTTTGAGT TCTGGCCCAG TGTGATTTTC TTCTTCGTTT TTATGGGTAT TGCCTGGTTG TGGAGACCTA CTGAAACAAG TTACATGTTG GCTATTTCTC AGCAATTATC CACTGGCGAA GGATTAGACG ACGAAGCAGA TGGACAGGGT TACCAACAAG GTGGGCACGA ATTCGAATTG GACGACTTGT CTTTAATCAG TCATAGTGAC GATGAAGCAA GGGGCCCTGG TAACGCTGAA CACGACAGTT TCGAATTGTC CAGAGAAGCT CAACCTTTCC CCAAGTCTAC CGATGGGCCT CCAGGATACA GTGAAGTGAA TGGAAAGGAA AACCCATTCA ATGATCCAGA GAATCCCTTT GAAGAAAATA GCTCAAGAAC CGAAGGTAAT ACATTATTTG AATTGGGAGA AGATGACGAG GACGATTCAC GTTTAGTTGA AGACAATGAT AATGAGGTTG TAGATGACAG ATTAAAGGAT GCTAGACACA AAGAGTAG
|
Protein sequence | MAVLAPLVSA NRALFDEDTN SQRCSGMYAK HDWGGSYKPQ ISLSLSQFDK YKYDSKKDNA EEHDEDISVS FIIFEYKDLG NIGVELSDGS SKYICDDYAI DTLGICEAKQ KGKFLLNSNS TNSTIMTSQL IHLGPSDIHY SVNRTGYYCV STYNFDKKYR GVINFQNAFG QLSASEIPKL PAYGILTLCY AIALALFGFQ FFKKRKENQI LPLQRYLLAM LGFLAFDTMV VWSYYDLVNR TKNPSNAFVT FYMFFLSLMN AAKITFSFFL LLCISLGYGV VLLKLDKKTM LKCKILGVVH FVASIVYLVA TYYGGSSKST TSGGNIGEGS MGSFLGLLPL IPVTITLTIY YIAILVSIKK TTANLHKQRQ IIKLQLYENL FRIIFFSVVL TFGGLILSSI VYLSMSTTDM IEEHWKSAFF IFEFWPSVIF FFVFMGIAWL WRPTETSYML AISQQLSTGE GLDDEADGQG YQQGGHEFEL DDLSLISHSD DEARGPGNAE HDSFELSREA QPFPKSTDGP PGYSEVNGKE NPFNDPENPF EENSSRTEGN TLFELGEDDE DDSRLVEDND NEVVDDRLKD ARHKE
|
| |