Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0468 |
Symbol | |
ID | 7978619 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 520936 |
End bp | 522036 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644797445 |
Product | A/G-specific adenine glycosylase |
Protein accession | YP_002948645 |
Protein GI | 239826021 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR00586] mutator mutT protein [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000000736881 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGACAC GAACGTTGCT GGATGGCTTT AACATCGAAC AATTTCAGTT AGATTTAATC GGCTGGTTTG AAAAAGAGCA GCGTGACCTG CCGTGGCGCA AAGATAACGA CCCGTATAAA GTATGGGTGT CGGAAATTAT GCTTCAGCAA ACAAAAGTAG ACACCGTTAT TCCGTATTTT AATAAGTTTA TCGAGCAATT TCCGACACTG GAGGCGCTTG CGGAAGCGGA CGAAGAAGAG GTGCTGAAAG CGTGGGAAGG ACTCGGTTAC TATTCGCGCA TTCGTAATTT GCATGCGGCA GTAAAAGAAG TGAAAGAACA ATATGGCGGG AAAATTCCCG ACAACCGAGA ACAGTTTTCC AAATTAAAAG GGGTTGGGCC ATATACGACA GGAGCGGTAT TAAGCATCGC CTATGGCATT CCGGAACCTG CCGTCGACGG CAATGTGATG CGGGTATTAT CACGGATTTT TCTTGTCTGG GAAGATATCG CCAAAACAGG AACGAGAAAG TTATTCGAAG CGATTGTCCG GCAAATTATT TCACGCGAAA ACCCTTCGTA TTTCAACCAG GCGTTGATGG AGTTAGGAGC GCTTATTTGT ACACCGCGTA ATCCGGCTTG TCTCCTTTGC CCGGTACAAG CGCATTGCCG GGCGCTGCAA GAAGGGGTAC AGACGGAGCT TCCGGTAAAA ACGAAAAAAA CGAGTGTTAA ACAAGTGGCG ATTGCAGCTG CGGTGCTGAA AGATGAACAT GGCAAAGTAC TGATCCATAA GCGGGACAGC GATGGATTGC TTGCGAATTT ATGGGAATTT CCGAACTGCG AAGTAGCCCA TTCACGGGAA AATCCGGAAA GACAGTTGGA AAAGTTTTTG AAGGAAGAAT ACGGGGCAAT AGTCCAGCTT GAGAAACCTT TTGCGGTTTT AGAACATGTG TTTTCTCACT TAGTTTGGAA TATTACCGTT TATGACGGCA AACTAGTTAA TGGTTTTACG GAAACAGAAC AGCTGAAACT TGTTGATGAG CGAGAAATCA GTTTATACGC CTTTCCTGTT TCCCATCAAC GAATTTGGCG AGAGTATAAA GAGAAAAAAA CAGGTGGATA A
|
Protein sequence | MKTRTLLDGF NIEQFQLDLI GWFEKEQRDL PWRKDNDPYK VWVSEIMLQQ TKVDTVIPYF NKFIEQFPTL EALAEADEEE VLKAWEGLGY YSRIRNLHAA VKEVKEQYGG KIPDNREQFS KLKGVGPYTT GAVLSIAYGI PEPAVDGNVM RVLSRIFLVW EDIAKTGTRK LFEAIVRQII SRENPSYFNQ ALMELGALIC TPRNPACLLC PVQAHCRALQ EGVQTELPVK TKKTSVKQVA IAAAVLKDEH GKVLIHKRDS DGLLANLWEF PNCEVAHSRE NPERQLEKFL KEEYGAIVQL EKPFAVLEHV FSHLVWNITV YDGKLVNGFT ETEQLKLVDE REISLYAFPV SHQRIWREYK EKKTGG
|
| |