Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_58162 |
Symbol | |
ID | 4838623 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 978361 |
End bp | 980466 |
Gene Length | 2106 bp |
Protein Length | 656 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640389938 |
Product | predicted protein |
Protein accession | XP_001384142 |
Protein GI | 126135236 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0479106 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.311392 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACT ACTACGAACC TACACTTTTG TTCAGGCAAA ACGCCTTAAG AAGATACTGT CCAAGTCTTT CCCCAATTTC GAGTGTAGAA ACCTTGAGCT CCAGTATCTT GACAAATGAA AACATTAAGG GAAATGTATC TTCTTGGATG TTCAATAGTG CCAATCCTAA AGATACAGAG GTGTTGAACC AGTCCTGTTC CTTGAACAGA ATGAATATCA AGTCTAACTA CTGGAAGATC CCAGACACTA ATATGAACCT TACGGCAATG GCCATCACAG ACACACATAC CGACAACCCG TTATTTTCTG TGTCGAGTGC CAACAATGAG TCCAACTTGT TCATCTATGA ATTGGATCTT CTTGGCAATT ATTTGACCCA CCACAACACA ATCAGTCTTC CAAATATCAA CGGAATGAAA TGGTTACCAA ATAGCAATAG GCATTTGGTC ACTGGCAATA GCAAAGGCTA TGCTCATTTA GTTTCCATCC CTGAAGTAAA CAGAACGGGC AACGAAGATT CTGAAGAACA ATCGGCCGAG ATCTGCAAGA GATTCAACCA CAGAAAGCAC ATCAAGAGCA AACAAGACAT CAACAAACAT AGTACAATTT CCAAACTCGA TTTCATGAAT AACGACAACA GCAGCCTCCT TTCCATTTAC AACAACAACC TCTTCTACTG GGACATGAAT GATGCCGAAG CCCAAAGAAG ACCTACTCCA ATATCCATAT CGTCCATATC TGGTCTTGCC AATTTCGACC CATTACCTAC TCACAATGCC AACTTGGTAG GAATTTGCGG TAAGTTTGGT GTCTCTTTGT TTGACTTGAG ACAGCCCAAG TTCACTGTTC CTCCTTCCAT TTTGGAGTAT GCATCCAAGA AGAAATTAGG TGCAAACCAA ATGAGATGGA ATCCCAACAA TGAAAATGTT TTTGCAGCAG CTCACAGAGA TGGAGTTGTA CGGTTGTGGG ATATCAGAAA GCAAGACAAC TTTGCCAATT TGAGTGGACA CACCGATAAG ATCAGCAGTT TGGAGTGGAA CGATGGTGAT TTGTTCAGTG GATCCAGAGA CGGTAATATA GTGCATTGGG ACTTGACCAG TGATTTAAGT GCCAACAACC AATTCATGAA CTGTGGATTG AAAGAAGGCT TGGATAGCGT TCATTTCAAC CCACATATGA ACAGGTTGGA GAGAGCCATC AACGAAAGAC AATGTGGTAC AGTTTTGCCA GCTTCTAACA CCAACATCAT AAGCATGTGT TCTGTAACCG GCAGTGACAA TTCTAAAGAC GACATGAAAG TGCTATCTAT CGATGGTAGT TCGTTCTTTG GTGTGCACTC CAAGATATTC GATGCTGTGA ATATTTCCAT GACTTCAGAC AAGTTGTACT ATACTGAATC TGATATTCAA TTGATGATGA AGAGCGAGAA TTCCAACAAT ACATTAGTAG GCTCTACCGA TAGCATCAAC GAGCAAGTAA CTGCTCCTCT TGCCATTACT AGAAAGTCTA CTTTGAAGGA TTTCGCTCAA GCCGCCGATG CTGCCAGACC ATCCAACCTT TCCAAAGACA CTTTGTTGGG ATCAGTTGAA GATTTGAAAT TGGCTCCTGA GCCTATTGTG GTGGATGATG ATGATTTGAA AATCACCAAG GAGATTGAAG TTATCGATGT AGATGCTGAA GCTGGAGAAG CGCAAGAAAT AATAGCAGAG GACGATTTAG AAGATTATAA CGATTTCACA TTTGCTCCAC CTTCGTTCAT TCCTATACAG AATGGCAATG TGTCTACGTG CTCTGTCGAC TACAGTGAAA CAAGTTCACG TAATGCTAAA GAGATGTTTA ACGACTCCAC TGATACTCTT ACTACAGACC CTACCGAGCA CGAGCTTGAT AGTGAAGAAG ACAGCGGGAT CTCTTCCGTC GAATCGTCGC CTTTAAAGAG GGAAGCTTCA TTCAAGTTCC AGTTATTGGA TTCTCTTGAT TTCGAGGAGA AGAAGTTGCC TCGTGACGAT TCGTTTAATA CTGAAATGTT TAATGACTTA AGAATGGCCA GGCAGGCATC TGTCAGAACC ATCGGGACAC ACTATCGCAA TGTTTACAAT GGTTAG
|
Protein sequence | MTDYYEPTLL FRQNALRRYC PSLSPISSVE TLSSSILTNE NIKGNVSSWM FNSANPKDTE VLNQSCSLNR MNIKSNYWKI PDTNMNLTAM AITDTHTDNP LFSVSSANNE SNLFIYELDL LGNYLTHHNT ISLPNINGMK WLPNSNRHLV TGNSKGYAHL VSIPEVNRTG NEDSEEQSAE ICKRFNHRKH IKSKQDINKH STISKLDFMN NDNSSLLSIY NNNLFYWDMN DAEAQRRPTP ISISSISGLA NFDPLPTHNA NLVGICGKFG VSLFDLRQPK FTVPPSILEY ASKKKLGANQ MRWNPNNENV FAAAHRDGVV RLWDIRKQDN FANLSGHTDK ISSLEWNDGD LFSGSRDGNI VHWDLTSDLS ANNQFMNCGL KEGLDSVHFN PHMNRLERAI NERQCGTVLP ASNTNIISMC SVTGSDNSKD DMKVLSIDGS SFFGVHSKIF DAVNISMTSD KLYYTESDIQ LMMKSENSNN TLVGSTDSIN EQVTAPLAIT RKSTLKDFAQ AADAARPSNL SKDTLLGSVE DLKLAPEPIV VDDDDLKITK EIEDDLEDYN DFTFAPPSFI PIQNGNVSTC SHELDSEEDS GISSVESSPL KREASFKFQL LDSLDFEEKK LPRDDSFNTE MFNDLRMARQ ASVRTIGTHY RNVYNG
|
| |