Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_43636 |
Symbol | |
ID | 4837842 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 904748 |
End bp | 906706 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 12 |
GC content | 38% |
IMG OID | 640389157 |
Product | predicted protein |
Protein accession | XP_001383805 |
Protein GI | 150864824 |
COG category | [R] General function prediction only |
COG ID | [COG2102] Predicted ATPases of PP-loop superfamily |
TIGRFAM ID | [TIGR00289] conserved hypothetical protein TIGR00289 [TIGR00290] MJ0570-related uncharacterized domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.26747 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTCG TAGCCTTAAT TTCTGGTGGC AAGGATTCGT TCTTCAATAT TCATCATTGT CTTTCCAATG GACACGAATT GGTAGCACTT GCAAATTTAT ATCCTGAGGA ACAGAACAAG GATGAGATTG ATTCATTCAT GTTTCAGACC GTAGGTCATG ATATCATAGA TAGCTACTCT GAATGCTTAG GTGTTCCAAT TTATAGACAG GCCATCACGG GAGGTTCAAC AAATCAGCTG TTGGAGTATT CCAAGACTGA AAATGACGAG ATCGAAGACT TGTACAAACT ATTGAAGCTG GTCAAAGAAG CTCATCCAGA TGTAGTAGCC GTCAGTTGTG GAGCCATTCT TTCACATTAC CAGAGAACCA GGGTCGAAAA TGTATGTGGA AGATTGAATC TTACTTCTTT GGCTTATCTT TGGCAGCGAG ACCAGTACGA ATTGATGCAA GAGATGATCA GATACCAATT GGATGCTAGA CTTATAAAGG TTGCAGCCAT TGGTCTTAAC TCAACCATGT TGGGAAAGTC AATAACAGAA ATGTTTCCTA CGTTGGTCAA GTTGAATCTG ATGTACGACG TTCATATATG TGGTGAAGGA GGAGAATTTG AAACAATTGT TTTAGATTCT CCTATATTCA AGAAGAAATT GGAGATTACT GACAGAGAAG TGATCGACCA TTCCAGTGAT GATGTTTCAT ACTTGAGAGT AAAGGTCAAA GTTCTTGACA AAGAACATTT CCAATGGACT AAGATCGCTT GTCCTCCTTT GTTAAAAGAA GAATTTTCCA ATATTCTCGG TTCAGCACCA GTTCTTGATA TCCTGACTCT TCAGATTAAG GAAACAGAAA CTCAACCGAA GAGCTTAACT TCCGAAAAGC TTAATTTGGA TGTAGTCATT AAGTCTACGG AGACAAAATT GTATATATCG AATCTAATGT CCGACAAAGA TAGCCCAGAA GAGAATACGG CTGATATTTT CATGAAACTA GCCTCATTAT TGGAAGACGC TAAGCTCAGC TTCAACAACA TTCAGCATAT CACTTTGCTT TTGTCAGATA TGTCTCTATT CGAAAAAGTA AACGGAATAT ATAGCAAGAG CTTCGAAAAC CTATATCTTC CACCATCAAG AATCTGTATT GAGACAGAAC TTCCCTCTTC TATAATGTTA TCTTGCATAG TGTTGAAAGA TCAAAACGCA GACAAGAAGA CAGGTTTACA TATTCGCTCT CGGTCTTACT GGGCACCGCA AAACATTGGC CCATATTCAC AGACAACAGT AGAACAAAGA GAAACTTACA AATTAGCGAC ACTATCTGGA CAAATCCCAT TAGTCCCAAG TAGTATGGTT CTTAATGAGG CTGATATCAC ATATAATTCA TTGTTATCTT TGGAACACCT TCACAAAGTC AAGAGTTTGG TGGGAGTAAA GAAGCTTGCA CAGGTAATAT GCTTTGTCAC TAAAAACAGT TATGTTCCAA CGGCATCTTG GGCATGGGAT GCTTACAACT CTGACTTTGA AAGTTCTTCG AACTCCCCTC AAATGGTAAT TGTCAAAGTC AAATCTTTGC CTAAGGGAGC AAATATAGAA TGGGGTGGGC TCAGTTATGA GAAGTTGGTG GATATGTACC ACGATTCAGA TGATGACGAA GATAATGATC AAGGCAAGGA AGTATTGTTG GAAGATGTCA GTAAATTCGA CACAAGTAGC GTTGTAAATG TCAGTAATAG TGAAAGAATA GCTACTTTGT TCACTGATGA TAGCACTGTT GCAACTGACT TTATCGCTAA GTACAACAAA ACTAATTATA TTGAGGTTTT ATCGACTCAA AATGACTTTA TTGACATCTT CTCAATTGTT GGGAAGAATG CTGTTGGTGT ATTGCCAGTT CAAGCAGTGT TTGATAACAA AGGTAAACCC TTCAGATATG CCTTGATTGC GAAATTAGAA AAACAATAG
|
Protein sequence | MKFVALISGG KDSFFNIHHC LSNGHELVAL ANLYPEEQNK DEIDSFMFQT VGHDIIDSYS ECLGVPIYRQ AITGGSTNQS LEYSKTENDE IEDLYKLLKS VKEAHPDVVA VSCGAILSHY QRTRVENVCG RLNLTSLAYL WQRDQYELMQ EMIRYQLDAR LIKVAAIGLN STMLGKSITE MFPTLVKLNS MYDVHICGEG GEFETIVLDS PIFKKKLEIT DREVIDHSSD DVSYLRVKVK VLDKEHFQWT KIACPPLLKE EFSNILGSAP VLDISTLQIK ETETQPKSLT SEKLNLDVVI KSTETKLYIS NLMSDKDSPE ENTADIFMKL ASLLEDAKLS FNNIQHITLL LSDMSLFEKV NGIYSKSFEN LYLPPSRICI ETELPSSIML SCIVLKDQNA DKKTGLHIRS RSYWAPQNIG PYSQTTVEQR ETYKLATLSG QIPLVPSSMV LNEADITYNS LLSLEHLHKV KSLVGVKKLA QVICFVTKNS YVPTASWAWD AYNSDFESSS NSPQMVIVKV KSLPKGANIE WGGLSYEKLV DMYHDSDDDE DNDQGKEVLL EDVSKFDTSS VVNVSNSERI ATLFTDDSTV ATDFIAKYNK TNYIEVLSTQ NDFIDIFSIV GKNAVGVLPV QAVFDNKGKP FRYALIAKLE KQ
|
| |