Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_89047 |
Symbol | |
ID | 4838555 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 859594 |
End bp | 862559 |
Gene Length | 2966 bp |
Protein Length | 837 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640389870 |
Product | predicted protein |
Protein accession | XP_001384466 |
Protein GI | 150865309 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TATCCAAGAG TCTTGATAGG GTGCTGTTAC TTATAAATAT ATCTCGTATC CAAGAGAATT CTCTCAACAA AAGCAAAACT GAAACTCTCT TGATCTTCTA CTGTCGTTCA GTAAACTACT GAAAAATCTT GGATCTCGTC TCAGATCTTC TCCACAACGA TTCGAAGGCA TTCACGAAAC CCAATACTGA AGATATTCCA CAACTATCAC TGAAATTTTC CAGTTAAGAA TCTAAGCAAC GGTTTATCTT TCACACTATG GAAACATTGG AGATCCATTC CAAGGACTTC TTGGTCAAAT GGGTCCATGC GCCCGACAAC TGTGTCATTG ACTGGCAAGT TAAACCGCTC AAAAAGTCCA TCAACTTCGC CATTTACAAG TTGAATGACG AGACGCCCAG TGAAAACTCA GTAGAGTCTT TCCAGGCTCC ACCTCCTATA GGCCATAACG ATTCTTCAAC CAGTGTAAAT GGTTCTAGCA GAATCAGATC AAGTTCGGTC ACTTCGGTCA ACCAGATCAC CAATAACAAC AACCCCTACA AAACCAAGTC AAGGTCGTCG ACCTTCTCCA CGAATTTGAT CAACTCGGAC TTGACGTTGT TGAAGAACTA CAACAAGTTG ATTGCGGGGG AGTTGGTCCA CGGCAAGTTT GACGTAGCCA AAGACGGAAT GTACGCCTTT GTCTTTGACA ACTCCTTCTC TAAAACGACT GGAAAGAAGG TCTTTTTCAG TAGCAAAATC GTTTCTGACA ATGCTGCTGT TTCCAGAAGA AAATCCGTCG CAAGACTGTC TAGTTTCCGA GGCAATGGTG TTGATCCAGG TGCCCCTTCT ACTATACCTG CGCTGACGAT TCAGTTTCAA ACACCATTTG AAGCTGACGA AAATGAAGTT GAGGGCGATG TTCCTCTTGA TAAGAGAGGC AACATTCTTC GTCCCAAGAA CGGAGAGTTG TTACAAAGTA TCTTGTTGAA AAAGAGAAGA AAGAAGCTTC AGGGCTTCAC CAAGAGGTAC TTTGTTCTTA ACTTCAAGTA CGGCACCTTA TCGTACTTCC GAGTCAAGGA CAATAAGTTG AGAGGTCAAA TGCCAATCAA ACATTCCATC GTCAGTGCCA ATGCGAAATC AAGAGAAATA TTTATAGATT CAGGTATGGA AGTGTGGAAT TTAAAGGCTT TGAACGAGAA AGAATTCAAT GCTTGGGTTG ACGCTTTCAA CCAAATCAAG AAATCCAGCG ACGAAACACC AACTGAAGAA GCCTTTTATG AAGAAGAAGA ACAGGGCATT CTTGCTCTGG AATTGGAATC GATTTCGACA AAGTTGACCC AGCTCAAAAT GACTACAGGG GATAACGCTC CAGCTGCTAA ATTGGTCGAT AGCATCTCGT TAGATATCAA CAGCTTGTTA GCCAGAGTGA TACCAGCCAA CAGAAACTCA CTACATGATC TAACATCAGT TAAATCGTCT TCTGAGTTCT ACGATGCCCA GGAGTACCTC GATGTAATGA GCTCTGGTGT TGTTCTTTTG GACACGCCAA TACCACCATT AGAGAGCAAG GTTATCGGCC AGCTAGAAAC GCCATCTGAT GAAAGCATAG ATGAAAACTT AGATGGCTTG TCGTTGTCGT CGTCTTCTTC AGAAGAAGAT GAAGACATTG AGCCAACCAA ACCTGTTGAA GTAATCCAGA AAGTCAAGCT GGCAGATGAC AGTGACGATA CTTTATATCC CTTACCTCAT GACCCAATCG AGAGAGAATC GGATATTCCC GTGTGTAACC ATACCCCTCC TAGTATATTG GCCTTTGTAC GTAAGAATGT CGGTAAGGAC TTGTCCACTA TTGCCATGCC GGTGACAATG AACGAGCCTA TTACTTTCTT GCAAAAGTAT GCCGAAATAT TTGAGTATAG CGATTTGATC AACAACGCTT TGCAGCCCAG TTTTTCCGAC GAGTCCGGTG AAAAGATCTT GAGAATCGCT GCCTTTGCTC TCAGTTACCT TTCCAGTGCC AGAGTCAAAG AAAGAAACAA CCGTAAACCC TTCAACCCAT TGTTAGGAGA AACGTTTGAG TTGGTCAGAG AAGATCGTGG AATCCGTGTA GTCAGTGAAA AGGTTAGCCA CAGGCCACCT GTATTTGCTT TCTTTGCCGA ATCAGAAAAG TGGGACTTGT CGTTTAATCC AGCTCCTAAC CAGACTTTCT GGGGTAAGAA TGCTGAAATT GTAACGAAGG GTACTGCCAA GTTAACCATT AAGTCAACCG GTGAGGTGTT CACTTGGTCT CATCCAGCTA CTTTGTTAAA GAATATCATC GCTGGTGAAA AGTATTCCGA GCCCTCAGCT CCTATGACGA TCAAGTCATC TTCTGGTTAC AAGGCTGTTG TAGAGTTTGC TAAGGGAGGT TTGTTCAGCG GCAGATCTGA GGATTTGACC ATCAAGGCAT TCAACCCCAA CAAGAAGCAA TTAGCATATA CTGTCAGCGG AAAGTGGACC GAGTCCTTGA CGTTGAAAAC TAACACCACT GAAAAGTTGA TCTGGGAAGT TGGTGACTTG TTGCCTAACT CCAACAAGAA GTTTGGTTTC ACTGCATTTT CTGGTACTTT GAACAAAATC TATGCCATTG AAGATGGTAA ATTGCCACAC ACAGATTCTA GGTTGAGACC AGACATACAT ACCTACGAGA AAGGTGACGT CGACAAGGCT GAAGCACAAA AGGTTGAATT GGAAGAGAAG CAGAGAGAAA GAAGAAAAGA ATTGGAAGAA AGCGGGAAGT CTCATGTACC CAACTTCTTT ACCCAAGTTA GTGGCGACAC TCCTGACTCG GGTGAATGGG CTTACATCAG AGGAAAGAAG AGTTATTGGA ATAGAAGAAA GCATGGCGAT TGGGATGACA TCACCAGACT CTGGTAGTCA TGAATTAGCT TGGAATGTCT AAAAGTTATC ATATTTATGA AGTTGTACAA TTCTATATAT CATAGC
|
Protein sequence | METLEIHSKD FLVKWVHAPD NCVIDWQVKP LKKSINFAIY KLNDETPSEN SVESFQAPPP IGHNDSSTSV NGSSRIRSSS VTSVNQITNN NNPYKTKSRS STFSTNLINS DLTLLKNYNK LIAGELVHGK FDVAKDGMYA FVFDNSFSKT TGKKVFFSSK IVSDNAAVSR RKSVARSSSN ILRPKNGELL QSILLKKRRK KLQGFTKRYF VLNFKYGTLS YFRVKDNKLR GQMPIKHSIV SANAKSREIF IDSGMEVWNL KALNEKEFNA WVDAFNQIKK SSDETPTEEA FYEEEEQGIL ASELESISTK LTQLKMTTGD NAPAAKLVDS ISLDINSLLA RVIPANRNSL HDLTSVKSSS EFYDAQEYLD VMSSGVVLLD TPIPPLESKV IGQLETPSDE SIDENLDGLS LSSSSSEEDE DIEPTKPVEV IQKVKSADDS DDTLYPLPHD PIERESDIPV CNHTPPSILA FVRKNVGKDL STIAMPVTMN EPITFLQKYA EIFEYSDLIN NALQPSFSDE SGEKILRIAA FALSYLSSAR VKERNNRKPF NPLLGETFEL VREDRGIRVV SEKVSHRPPV FAFFAESEKW DLSFNPAPNQ TFWGKNAEIV TKGTAKLTIK STGEVFTWSH PATLLKNIIA GEKYSEPSAP MTIKSSSGYK AVVEFAKGGL FSGRSEDLTI KAFNPNKKQL AYTVSGKWTE SLTLKTNTTE KLIWEVGDLL PNSNKKFGFT AFSGTLNKIY AIEDGKLPHT DSRLRPDIHT YEKGDVDKAE AQKVELEEKQ RERRKELEES GKSHVPNFFT QVSGDTPDSG EWAYIRGKKS YWNRRKHGDW DDITRLW
|
| |