Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_34689 |
Symbol | |
ID | 4851933 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 3228746 |
End bp | 3230689 |
Gene Length | 1944 bp |
Protein Length | 574 aa |
Translation table | |
GC content | 40% |
IMG OID | 640393641 |
Product | predicted protein |
Protein accession | XP_001386942 |
Protein GI | 126276063 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000996882 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGTGA CTAGTAACGC CTACTACTCT CCACAATACC TAGATAAGCA GAAGCGATCC ACTCGAAGTC AATACAGTGG GTATTTTCGG GGCTCGCGGG ACAATTCGTA CAATGACTTT ATTAATCAAG CCCAAATTGA TGATTGGAAT GACAGGGTTT CACTGGACAG AATTTCTGGC TACAAACCAG TTTCTGAAGG CAGCTCTAGC AGAAACAGAA ATCGTCAAGC AACAGCAATA GATTCCATTA GAGAACCCGA AGGCTCTGTC AAAAGTTCCA CTCGAGTTTC CAAAGCTCCT GTGCTCAATA GAGCCAATTC TACTCCTCTT TCTGTAGTGC AATCTACAAT TGCGGCAGAC GATTCCATAG AATCTGATCT TGAAGATGAA GAAAGCTATG ATCCAGACGC ATTGGTACAT GAAATCACAC CGGCTCAGCT CATCATACAA CAACTGAATA CAGTAACTCC TCACAGAGTT GAGAGAAGAG GTGCTTACAA ACCTCGAGAA CAAACCCCTG AGATTAAAGG CTATGATGGG TACTACATAA ACCAGAAGAA AAGATATGAA GAGTCTCCCT ATAAGCCCAA ATTGTACACA CACAAGACAT TTCGTGATGT GTTTAACGAT AAGGAGGAAA GTACAGATAA GTACAACCCT ATGGAATTTG TATTTCCAGA AGGAGAGAAG AAGAGCAAAT TTGCCAAGAA CGTACAATTT GTTCTAGGAA AGGACAACTA TGACGAATAT AATTATTACG ACCACAACAA GGGAGAAAAG AAGAAAAAGA GGAAGAAGAA AATCAATGAA CCGACTGAAG TTTTTGTTAA AGAACTTAGC GATGACGACG AAGAAGATTA TACTGAAAAT GGCCCTAAAA TATATTTAAC TGAGGAAGAA CAGCAGAAAG CTAAAAAGAA CAGGAAATTC ACTAAGGTGT TCAAGTCTAA AATGAAGAGA GCTAGAAAGG AATTGGGCAA AGATTTTGTG AACAATGCGA TAAAGCAGCA GGAACTTGAG CTGAGACGAA AGGAAGAGAA ACTGGAAAAG AAAAGAGCTG AAGAAGAAGA CAAACAAATA GCTTTGAAGG AAGAGGCCGA GAGAATACGT GCATTAGAAG AAGAGCAAAG GAGAATAGGT CAAAATCCAG AGTTTCATCC AATTTGGAAT TATATATTGT CGTGGTTGGT ATACGATGCT TCAATATCCA AGACACCAGC TGTTGATTCT CATATTGAAG AACTCGACTC TTATGAAATC CACGAAAAGG CTGACGAAGA ACCTGTGGAG AACAAAGAAC AGCAGGAAAC ATCTAAGTCC AAGAAGTTAA TTATTTCCTC TAAGAATTTT AAGAACATAA AGAAGAACTA CCTCAACTTG GTGCACAAAT GGAACGAACC TGTCTCACAT GTTTTTAACG AGCCTCCTCC TCCATACCCA ACGTCTCGAT CCATAAAAGC AAAAACGCTA AGGTCCTTCG AATCTTCAGC TTTTGATGAA GGTGATGGTG ACTCAAAAGA GTTCGTTATT GAATACGACG ACGATGGAAC TGAAATCACA CAAGAGTTAT ACTACAATCC TGTAACCAAA CAGCTCGAAG CCACACCACC AACGTCATAT TCGTCATTGG ACCCTTCAGC CAAATCTGTA AGCTCCTCTA TGATGGGCTA TGGCATTGAT ACTACAGGAA GCGCTGTAGC CATTATCTCC AACATCAATG CTTTGATCAA GAGCATCAAG ATAATGAAAA TCCTTTTCGC ACCCATCGAT GTAGTCTCAG AATATTTCCC CAATCTCCAG ACGATTGTCA TCTTGGTGGA GTTGGTGATT TTTGTGTGGA TCTTGTACGA AGTCAGCCTC TTAATCGATG CCTTATGTAT GATGGTCAAG GCTGTATGTG CGCCCATGAT AGCTATGGGA AGGTTTATGA ACAGAATAAT GTAA
|
Protein sequence | MNVTSNAYYS PQYLDKQKRS TRSQYSGEPE GSVKSSTRVS KAPVLNRANS TPLSVVQSTI AADDSIESDL EDEESYDPDA LVHEITPAQL IIQQLNTVTP HRVERRGAYK PREQTPEIKG YDGYYINQKK RYEESPYKPK LYTHKTFRDV FNDKEESTDK YNPMEFVFPE GEKKSKFAKN VQFVLGKDNY DEYNYYDHNK GEKKKKRKKK INEPTEVFVK ELSDDDEEDY TENGPKIYLT EEEQQKAKKN RKFTKVFKSK MKRARKELGK DFVNNAIKQQ ELELRRKEEK LEKKRAEEED KQIALKEEAE RIRALEEEQR RIGQNPEFHP IWNYILSWLV YDASISKTPA ADEEPVENKE QQETSKSKKL IISSKNFKNI KKNYLNLVHK WNEPVSHVFN EPPPPYPTSR SIKAKTLRSF ESSAFDEGDG DSKEFVIEYD DDGTEITQEL YYNPVTKQLE ATPPTSYSSL DPSAKSVSSS MMGYGIDTTG SAVAIISNIN ALIKSIKIMK ILFAPIDVVS EYFPNLQTIV ILVELVIFVW ILYEVSLLID ALCMMVKAVC APMIAMGRFM NRIM
|
| |