Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2068 |
Symbol | |
ID | 4810666 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2460560 |
End bp | 2461447 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640107475 |
Product | 8-oxoguanine DNA glycosylase-like protein |
Protein accession | YP_001038468 |
Protein GI | 125974558 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase |
TIGRFAM ID | [TIGR00588] 8-oxoguanine DNA-glycosylase (ogg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATACA AGGGATATAA AATTGAGCAA AAGGATAACA AGGTCATTGT TGAGAAAGTG GAGGATTTTA ACCCTGTCCA TGTATTTGAC TGCGGGCAGT GTTTCAGATG GATAAGACAG CCGGACGGAA GTTATACGGG GGTTGCCTGC GGCAGGGCTT TAAACGTAAA GTACCGGGAT GGAGTTTTGG AGTTGTCGAA CACCGGTATA GAGGATTTTA AAAGTATATG GTTTGACTAT TTTGATCTCG GCAGGGACTA TTCGCATATT AAAGAGAAAG TAATGAAAGA TGAGATAATG AGGGAAGCGG TAAAATTCGG CTCAGGGATA AGGCTTCTTA AGCAGAATAT ATGGGAGACC TTGATATCTT TCATTATCTC TGCCAACAAC AGAATTCCCA GGATAATGAA AACCGTGGAT GAAATATCCC GGCTTTATGG CTGTGAAATA GAAATGGACG GTGAAAAGTA TTATGCGTTT CCCTCTGCCA AGCAGCTTTC GCACGCAACT TTGGAAGAAT TGGAACAAAC CGGAGCAGGA TTCAGATGCA AATACATAAT GAATGCGGCA AAAATGGTTA ACGAGGGGAA AATAAACCTT GAGGACGTTT GCTCCATGGA TACCGTTGAG GCGAGGGACT TTCTTATGAG ATTTCAGGGG GTGGGGCCTA AAGTTGCGGA CTGCACACTT TTGTACAGCG GGACAAAATA TGATGTGTTT CCCACCGACG TCTGGGTGAA AAGAGTAATG GAGGAGCTGT ATTTTAAAAG CGAGGCAAGC TTTGGCGAGA TACAGGAATT TGCCCGGGAT TATTTTGGCA AGTACGCGGG GTTTGCCCAG CAATATCTTT TTTATTATGC CCGGGAGAAC AGAATCGGGG CAAAATGA
|
Protein sequence | MEYKGYKIEQ KDNKVIVEKV EDFNPVHVFD CGQCFRWIRQ PDGSYTGVAC GRALNVKYRD GVLELSNTGI EDFKSIWFDY FDLGRDYSHI KEKVMKDEIM REAVKFGSGI RLLKQNIWET LISFIISANN RIPRIMKTVD EISRLYGCEI EMDGEKYYAF PSAKQLSHAT LEELEQTGAG FRCKYIMNAA KMVNEGKINL EDVCSMDTVE ARDFLMRFQG VGPKVADCTL LYSGTKYDVF PTDVWVKRVM EELYFKSEAS FGEIQEFARD YFGKYAGFAQ QYLFYYAREN RIGAK
|
| |