Gene PICST_35111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_35111 
SymbolOGG1 
ID4837452 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1610062 
End bp1611180 
Gene Length1119 bp 
Protein Length336 aa 
Translation table12 
GC content40% 
IMG OID640388767 
Product8-oxoguanine DNA glycosylase 
Protein accessionXP_001382534 
Protein GI150863900 
COG category[L] Replication, recombination and repair 
COG ID[COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase 
TIGRFAM ID[TIGR00588] 8-oxoguanine DNA-glycosylase (ogg) 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.678319 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTAG ACATATTATG GAGACAGATA CCTATACGGG AAGCAGAGTT GAGTCTAACT 
AAGGTCCTTC GCTGTGGACA GACGTTTCGC TGGAAGAATA TCAATAATGT ATGGTCTTTC
GCAATCCACG ATCGAGTAGT ACTACTTAAG CAAGATGAAG AGTATCTTCA TTATTCACAT
TTAATGAAAG AGAGCTTGAC TACACAAAAA TCAACTGCAG AATCAGAGAA ACAGACTCTT
GAGTTTGTCA AAGATTACTT CAATCTTTCT GTCAACTTAG TCGATCTCTA CGACCATTGG
TCATCCCAGC ACGAGCCATT CAAGAAATCG AAGCTAACCC CATTTGAGCA ATTCAAAGGT
ATTAGGATTC TTCGGCAGGA TCCTTGGGAA ACTGTCGTAT CATTTATCTG CTCTTCCAAC
AATAATGTCA AAAGAATTTC CAAGATGTGT GACAGTTTGT GTCTGGAGTT CGGAGACTAT
ATCAACGAAC ATGATGGAGT GCAGTATCAT TCGTTTCCAA GTGCTGAAAA ATTAGCATCT
TCAAGTAAGA TAGAGACTCG TTTAAGGGAA TTGGGGTTTG GTTACAGAGC TAAGTACATT
TATCAAACAG CGGTCAAATT TGTAGATAAC AAAGGTTTTC CTGATATCAC CATAGAGAAA
TTGAACAGCC TTAGAAACGA AGAGTATGAG CTCTCCCACA ATTTCTTGTT GCAATTAACG
GGTGTGGGTC CAAAAGTTGC TGATTGTATC TGCCTCATGG CTTTGGACAA ACACGATTGT
GTTCCGGTGG ATACCCATGT GTTTCAGATT GCTGTAAGGG ACTATAAGTT TAAGGGAAAG
AAAGACATGA AGACTATGAA CAAGGTCACT TACGATGCGA TTAGACTCTT CTTTAAGGAC
TTGTTTGGAG AGTATGCTGG TTGGGCGCAA TCGGTGCTAT TTGCGTCTGA TCTTGGTGAC
TTGAACAATG GTACGAACAG AATAGAAGAC ACCGAAGTCA AGAAAGAAGA AGTTGATGTT
AAGAAAGAAA AGGTTCAGGA AAACAGAAGA AAATTGACTA CTACTAGCAC CAACAGAGTC
ACGAAAAGGG CCAAGACTCT CGTAAAAGTA GGAGCCTGA
 
Protein sequence
MTVDILWRQI PIREAELSLT KVLRCGQTFR WKNINNVWSF AIHDRVVLLK QDEEYLHYSH 
LMKESLTTQK STAESEKQTL EFVKDYFNLS VNLVDLYDHW SSQHEPFKKS KLTPFEQFKG
IRILRQDPWE TVVSFICSSN NNVKRISKMC DSLCSEFGDY INEHDGVQYH SFPSAEKLAS
SSKIETRLRE LGFGYRAKYI YQTAVKFVDN KGFPDITIEK LNSLRNEEYE LSHNFLLQLT
GVGPKVADCI CLMALDKHDC VPVDTHVFQI AVRDYKFKGK KDMKTMNKVT YDAIRLFFKD
LFGEYAGWAQ SVLFASDLGD LNNVTKRAKT LVKVGA