Gene GWCH70_0468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0468 
Symbol 
ID7978619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp520936 
End bp522036 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content45% 
IMG OID644797445 
ProductA/G-specific adenine glycosylase 
Protein accessionYP_002948645 
Protein GI239826021 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR00586] mutator mutT protein
[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000736881 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGACAC GAACGTTGCT GGATGGCTTT AACATCGAAC AATTTCAGTT AGATTTAATC 
GGCTGGTTTG AAAAAGAGCA GCGTGACCTG CCGTGGCGCA AAGATAACGA CCCGTATAAA
GTATGGGTGT CGGAAATTAT GCTTCAGCAA ACAAAAGTAG ACACCGTTAT TCCGTATTTT
AATAAGTTTA TCGAGCAATT TCCGACACTG GAGGCGCTTG CGGAAGCGGA CGAAGAAGAG
GTGCTGAAAG CGTGGGAAGG ACTCGGTTAC TATTCGCGCA TTCGTAATTT GCATGCGGCA
GTAAAAGAAG TGAAAGAACA ATATGGCGGG AAAATTCCCG ACAACCGAGA ACAGTTTTCC
AAATTAAAAG GGGTTGGGCC ATATACGACA GGAGCGGTAT TAAGCATCGC CTATGGCATT
CCGGAACCTG CCGTCGACGG CAATGTGATG CGGGTATTAT CACGGATTTT TCTTGTCTGG
GAAGATATCG CCAAAACAGG AACGAGAAAG TTATTCGAAG CGATTGTCCG GCAAATTATT
TCACGCGAAA ACCCTTCGTA TTTCAACCAG GCGTTGATGG AGTTAGGAGC GCTTATTTGT
ACACCGCGTA ATCCGGCTTG TCTCCTTTGC CCGGTACAAG CGCATTGCCG GGCGCTGCAA
GAAGGGGTAC AGACGGAGCT TCCGGTAAAA ACGAAAAAAA CGAGTGTTAA ACAAGTGGCG
ATTGCAGCTG CGGTGCTGAA AGATGAACAT GGCAAAGTAC TGATCCATAA GCGGGACAGC
GATGGATTGC TTGCGAATTT ATGGGAATTT CCGAACTGCG AAGTAGCCCA TTCACGGGAA
AATCCGGAAA GACAGTTGGA AAAGTTTTTG AAGGAAGAAT ACGGGGCAAT AGTCCAGCTT
GAGAAACCTT TTGCGGTTTT AGAACATGTG TTTTCTCACT TAGTTTGGAA TATTACCGTT
TATGACGGCA AACTAGTTAA TGGTTTTACG GAAACAGAAC AGCTGAAACT TGTTGATGAG
CGAGAAATCA GTTTATACGC CTTTCCTGTT TCCCATCAAC GAATTTGGCG AGAGTATAAA
GAGAAAAAAA CAGGTGGATA A
 
Protein sequence
MKTRTLLDGF NIEQFQLDLI GWFEKEQRDL PWRKDNDPYK VWVSEIMLQQ TKVDTVIPYF 
NKFIEQFPTL EALAEADEEE VLKAWEGLGY YSRIRNLHAA VKEVKEQYGG KIPDNREQFS
KLKGVGPYTT GAVLSIAYGI PEPAVDGNVM RVLSRIFLVW EDIAKTGTRK LFEAIVRQII
SRENPSYFNQ ALMELGALIC TPRNPACLLC PVQAHCRALQ EGVQTELPVK TKKTSVKQVA
IAAAVLKDEH GKVLIHKRDS DGLLANLWEF PNCEVAHSRE NPERQLEKFL KEEYGAIVQL
EKPFAVLEHV FSHLVWNITV YDGKLVNGFT ETEQLKLVDE REISLYAFPV SHQRIWREYK
EKKTGG