Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_30161 |
Symbol | |
ID | 4837264 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 2104136 |
End bp | 2105467 |
Gene Length | 1332 bp |
Protein Length | 379 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640388579 |
Product | predicted protein |
Protein accession | XP_001382630 |
Protein GI | 150863969 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0648] Endonuclease IV |
TIGRFAM ID | [TIGR00587] apurinic endonuclease (APN1) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCCTA AAGTGTAAGT TCGTGTTTAA TTTAAGTACT GGGCAGTAGA AATGCCGCAA TTGTGTAACT GATACGAGAA GATGGTCTCC ATATTCTATA TTTCACGCCT CTGCAAGTTT TCTGAAGTGA AAGCTTCTCT ATGAAAATGC ATTGTATTCT TGCGTTCAGT TTTCCTCTTC TCTTGTACAT ACTAACAGGT TTCAAGTAAA ACTGAGGTAG CCAAAGCTGT CAAGGAAACT GTCAAAAAGA CGGTCAGAGC CAAAAAGGTT AAGAAGGTCG AGTTCAAGTA TGAAAGACTG ACTACGAGCA AATTCAAATT TGGAGCTCAC GTAGGTGCAT CTGGTGGAGT TTCAAATTCC GTCATCAATG CCAGAAACCT TGGTGCTAAC AGTTTTGCCT TGTTTTTGAA GTCTCCTAGG AAATGGGTGA GTCCCCCCAT CTCTGCTGAA GAAATTGACA AATTCAAGCT GTTGTGCGAA GAACACGGAT ATGATCCCAG AACCGACGTT TTACCTCATG GATCGTATTT CATTAACTTA GCCAATCCAG ACCCTGAAAA GGAGGAAAAA GCGTTTGACG GATTTCTAGA TGATTTGCAC AGATGTGAAC AATTGAACAT TGGATTATAT AACTTTCATC CAGGCTCGAG TTTAGACGGG GACCATAGGG AAGCTCTTGA GAGGTTGGCC AAAAACATCA ACAGGGCTAT TAAAGAAACA AGCTTTGTCA AAATCGTGAT TGAAAATATG GCTGGCCACG GTAACTTGAT CGGATCCAAT CTACAAGATA TCAGAGACGT CATAGACATC GTAGAGGACA AGCTGAGAGT CGGAGTTTGC GTCGACACTT GCCACACATT TGCTGCTGGG TACGATATTT CGACTGAGGA GAAGTTTGAA GCGTTCTGGA AGGAGTTTGA CAACATTGTG GGTGCCGAAT TCTTGAGTGC CATCCATTTG AATGACTCCA AAGCTCCTTT GGGTGCCAAC AGAGATTTAC ACCAATTCTT GGGACAAGGA TTTTTGGGCT TGGAAGCATT CAGAGTCGTG GCTAACTCGC CCAGATTGCA CAATATTCCC ATCATCTTGG AAACCCCCGT AGGTAACGAC GATAGTTACT ATGGAGAAGA GATTAAACTC TTGGAACTAT TAGAGGATAA AACCATTGAC GACACGGAGT TTGTAGAGAA GAAAGAAAAG CTCTCGAAGT TGGGAGCAAA AGAGAGATCC GAGCATGAGA AGAAGTTCGA GACCAAGAAG GCTAAGACGG CTAAGAAGAC TGCTGGTGAT GATATCGCTT CGTTGGTTAC AAAGAGACCC AAAAGGAAGT AG
|
Protein sequence | MPPKVKTEVA KAVKETVKKT VRAKKVKKVE FKYERSTTSK FKFGAHVGAS GGVSNSVINA RNLGANSFAL FLKSPRKWVS PPISAEEIDK FKSLCEEHGY DPRTDVLPHG SYFINLANPD PEKEEKAFDG FLDDLHRCEQ LNIGLYNFHP GSSLDGDHRE ALERLAKNIN RAIKETSFVK IVIENMAGHG NLIGSNLQDI RDVIDIVEDK SRVGVCVDTC HTFAAGYDIS TEEKFEAFWK EFDNIVGAEF LSAIHLNDSK APLGANRDLH QFLGQGFLGL EAFRVVANSP RLHNIPIILE TPVGNDDSYY GEEIKLLELL EDKTIDDTEF VEKKEKLSKL GAKERSEHEK KFETKKAKTA KKTAGDDIAS LVTKRPKRK
|
| |