Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_29713 |
Symbol | |
ID | 4836878 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 941537 |
End bp | 943930 |
Gene Length | 2394 bp |
Protein Length | 770 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640388193 |
Product | predicted protein |
Protein accession | XP_001382951 |
Protein GI | 150864217 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.585457 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAAG CTCCCAACGA TGTCATCAAG TTAACCCGAC TGTAAGTACA AACTCTTTTT CTACTCTAGC CTGCTATATT CTATTCTTAT CTTCTGTGCA ACCGTACTAA CAATAATTCT AGCCACACCA ACCCAAAGAA GGGTCACAAG CTCGGCTCAG CTTTGGCAAC ATGGAAGAAG ATCAAATCGG CTGCCGCTGC TACTGGTACT GCTGCTGCTG AAACTGAGTC AACTTCTACT GATGAGCTTC TAAGTGATCT AAGTACGGAA AACTCACTCT TTGAGTACGG TAAGAATGGC GAGATCATCA TCGTCTACCG TGACAGCCGA ATCTTGCAAG TCTGGGACTG GAAAGCGCCT TCCGAGCAAA CGAGAACTGA ATCGGAAACA TTGTACTTGA ACTTCCAGAC CAAGATCGAA GTAATGCCTT CAAAGAACCA GTTATCGATC ATAACGTTCA AGCTACTCGA CCTACCTGTA AATGATCCCA ACCTGATCGC GGTCGTTTTA CTTGTCAAAA CACAAACTGA AACAACAGTT AAGTATTCAT TGATCACTAA GAAAATCAAC TTTTCTTCAT CTTTCCACAA CTCCCAAGCT GTAGAATTGT CCCCGGTTTT TCAAAACCTG ACAGATTTCT CGTTGAAAGC CAGCAAGAAG TTTGTCGTGG TCGCCAACAA CGAAGGATTT ATCTATATAT ATCGCTATAA TGTTGCTGAT TTCAAATTAA CTCCAGCCAA CGCCGACTTC AGCTTGCCGT CTATACGAAA GAGCTCAACT CAACCCGTGA TCAGCAGACA CCATCTTCAG TCTGTAGGTG ATGGAGATAT CCTTTTGCAG ACCAGTTGTA ACCAAGATAA CTGTCCAATA TTTGATATCG AAGACAACTG GCTCGTCTAC TCGCCTACAA AGTTCGAGTA CAAACACTTG AAAGCCATCA GTAGTTCCGC GCCTAGTGTT TCAGCGAACC CCATGGCTCA GGATCCAGTG ATAACTCTTC CGCTGAATAA CGAAACCGTA TCCACTCATA GCAATCTCTA TACTCCGGTG AAATTGCCGG CTTCAGGGCC ATTGTTGAAC AAACTATTGT CAACAATATC AAACACCGCA TTAGATGGAC TTTTCCGGTT ATCTGAAATC AGTTCTTCCA AGGTAAAGTC GTATATGAAC TCAAAGAGTA AAGAAACCTA TAAGGCACCA ACGATTAATT CTATCAGCAA ATCGCTAGGT AAACTCTTGT ATTCTACAGC TTCTACTACA GCTACAACAT TAGAGAATAG CACGAGAAGC TTAAAACCCA ACAACAACCA GATTATCAAA GTCATAGATC TTTCCAACGA CAAGGTTTTA GGCGTCTTCA AGCCTTTGGG AGGTGTTTCC AACGTTTCGC TCTCGCCTTA CGACTTACAT CTTGTACATT CGAACTATAG AGGAGACACT TTGTTCATGT GGGATTTGTA TAGATTGCCC AGTGAAGTGT CCTTGATAGG CAAGTTCACC AGAGGAAAGA CATCAGCTAT CATTGAAGAA ATCTTCTGGT TCAACAACAA CTACGGAGAT CAAATTAATA GTAGTAGTTC TGGAAACTCA AATAACGAGC CTAGCATCAA GGGTATGAAC TCGGGCTTTG GGTGTATTAC CAAGTCTACT GGTTCTGTTC ATTGGTTCAA CATCAACTAC TTGTCTGGTA ATATGAACAA CAATTTCCCC AACAGTCTAA ATAAGGAGAA GGTGCGAAGA AATCTGCAAT CGAGCCAGTT CTTGGACTCG TGGATTTTGT CGTCGTTGAA AGCTCGCAGA TTTGTTGCTT TGCCTGAACT TTGTAACTCC ATTGCACCCA CTGCATCCTT CGGGGACTCT GATCCTGGAT GTGTAGCAAA TCGTCTTGCT ATTAACCAGT TGGCAATTAT CGATAGCGAT AACCAGTTGA AGCTCATCTC GACTTTGAAT GGAAGACATC TCTACAAGTA CGAGTTGCCT ATTGCGCCAG TAGCTGAATC GTTCATACCT TTTAATTCTC GACGGGCAGA AGCTAAAGTG GAAGATAGCA AGGATCGCGT CAATCCGTTA TCACAGGCTG AGATCGAAAC GAGTGTCCCT TTCTTGAACT TGATCAACAA CAAGAACATC GAGTTTGCTG TGTTTTCTTT TGAAGGAGAG GAAGGAGATA GAAACAACTT TTTCCACTGC TTCAAGGAGT TTGGCAACGA TGTTCCAGAA AAGGTGATCA AGTTTGAGAA TGGAAACCAT CGGTCAAATA AGATAATCTT TGACTTGAAG AAGGACGAAG ACGTAAAGCC CGAGGACAGA TTGGCATTGC TCGATGGGTT GTATATTGAC CAAGGTGAAG GAAGCATCGA AGCCCAGAGC AGTCCTGTTG TAGATGACCA GTAG
|
Protein sequence | MPEAPNDVIK LTRLHTNPKK GHKLGSALAT WKKIKSAAAA TGTAAAETES TSTDELLSDL STENSLFEYG KNGEIIIVYR DSRILQVWDW KAPSEQTRTE SETLYLNFQT KIEVMPSKNQ LSIITFKLLD LPVNDPNSIA VVLLVKTQTE TTVKYSLITK KINFSSSFHN SQAVELSPVF QNSTDFSLKA SKKFVVVANN EGFIYIYRYN VADFKLTPAN ADFSLPSIRK SSTQPVISRH HLQSVGDGDI LLQTSCNQDN CPIFDIEDNW LVYSPTKFEY KHLKAISSSA PSVSANPMAQ DPVITLPSNN ETVSTHSNLY TPVKLPASGP LLNKLLSTIS NTALDGLFRL SEISSSKVKS YMNSKSKETY KAPTINSISK SLGKLLYSTA STTATTLENS TRSLKPNNNQ IIKVIDLSND KVLGVFKPLG GVSNVSLSPY DLHLVHSNYR GDTLFMWDLY RLPSEVSLIG KFTRGKTSAI IEEIFWFNNN YGDQINSSSS GNSNNEPSIK GMNSGFGCIT KSTGSVHWFN INYLSGNMNN NFPNSLNKEK VRRNSQSSQF LDSWILSSLK ARRFVALPEL CNSIAPTASF GDSDPGCVAN RLAINQLAII DSDNQLKLIS TLNGRHLYKY ELPIAPVAES FIPFNSRRAE AKVEDSKDRV NPLSQAEIET SVPFLNLINN KNIEFAVFSF EGEEGDRNNF FHCFKEFGND VPEKVIKFEN GNHRSNKIIF DLKKDEDVKP EDRLALLDGL YIDQGEGSIE AQSSPVVDDQ
|
| |