Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_35111 |
Symbol | OGG1 |
ID | 4837452 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 1610062 |
End bp | 1611180 |
Gene Length | 1119 bp |
Protein Length | 336 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640388767 |
Product | 8-oxoguanine DNA glycosylase |
Protein accession | XP_001382534 |
Protein GI | 150863900 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase |
TIGRFAM ID | [TIGR00588] 8-oxoguanine DNA-glycosylase (ogg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.678319 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGTAG ACATATTATG GAGACAGATA CCTATACGGG AAGCAGAGTT GAGTCTAACT AAGGTCCTTC GCTGTGGACA GACGTTTCGC TGGAAGAATA TCAATAATGT ATGGTCTTTC GCAATCCACG ATCGAGTAGT ACTACTTAAG CAAGATGAAG AGTATCTTCA TTATTCACAT TTAATGAAAG AGAGCTTGAC TACACAAAAA TCAACTGCAG AATCAGAGAA ACAGACTCTT GAGTTTGTCA AAGATTACTT CAATCTTTCT GTCAACTTAG TCGATCTCTA CGACCATTGG TCATCCCAGC ACGAGCCATT CAAGAAATCG AAGCTAACCC CATTTGAGCA ATTCAAAGGT ATTAGGATTC TTCGGCAGGA TCCTTGGGAA ACTGTCGTAT CATTTATCTG CTCTTCCAAC AATAATGTCA AAAGAATTTC CAAGATGTGT GACAGTTTGT GTCTGGAGTT CGGAGACTAT ATCAACGAAC ATGATGGAGT GCAGTATCAT TCGTTTCCAA GTGCTGAAAA ATTAGCATCT TCAAGTAAGA TAGAGACTCG TTTAAGGGAA TTGGGGTTTG GTTACAGAGC TAAGTACATT TATCAAACAG CGGTCAAATT TGTAGATAAC AAAGGTTTTC CTGATATCAC CATAGAGAAA TTGAACAGCC TTAGAAACGA AGAGTATGAG CTCTCCCACA ATTTCTTGTT GCAATTAACG GGTGTGGGTC CAAAAGTTGC TGATTGTATC TGCCTCATGG CTTTGGACAA ACACGATTGT GTTCCGGTGG ATACCCATGT GTTTCAGATT GCTGTAAGGG ACTATAAGTT TAAGGGAAAG AAAGACATGA AGACTATGAA CAAGGTCACT TACGATGCGA TTAGACTCTT CTTTAAGGAC TTGTTTGGAG AGTATGCTGG TTGGGCGCAA TCGGTGCTAT TTGCGTCTGA TCTTGGTGAC TTGAACAATG GTACGAACAG AATAGAAGAC ACCGAAGTCA AGAAAGAAGA AGTTGATGTT AAGAAAGAAA AGGTTCAGGA AAACAGAAGA AAATTGACTA CTACTAGCAC CAACAGAGTC ACGAAAAGGG CCAAGACTCT CGTAAAAGTA GGAGCCTGA
|
Protein sequence | MTVDILWRQI PIREAELSLT KVLRCGQTFR WKNINNVWSF AIHDRVVLLK QDEEYLHYSH LMKESLTTQK STAESEKQTL EFVKDYFNLS VNLVDLYDHW SSQHEPFKKS KLTPFEQFKG IRILRQDPWE TVVSFICSSN NNVKRISKMC DSLCSEFGDY INEHDGVQYH SFPSAEKLAS SSKIETRLRE LGFGYRAKYI YQTAVKFVDN KGFPDITIEK LNSLRNEEYE LSHNFLLQLT GVGPKVADCI CLMALDKHDC VPVDTHVFQI AVRDYKFKGK KDMKTMNKVT YDAIRLFFKD LFGEYAGWAQ SVLFASDLGD LNNVTKRAKT LVKVGA
|
| |