Gene M446_6939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_6939 
Symbol 
ID6130715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp7636292 
End bp7637509 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content78% 
IMG OID641647010 
ProductA/G-specific adenine glycosylase 
Protein accessionYP_001773607 
Protein GI170744952 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0201229 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAGCGC CCGCCGCGTC GCACTCAAGG CCGGACGCAG CCGACCTCCT GGTCTGGTAC 
GACCGCCACC GCCGCACGCT GCCCTGGCGC GCCGCGCCGG GCAGCGTGCC CGATCCCTAC
CGGGTCTGGC TGTCGGAGAT CATGCTGCAG CAGACCACCG TCGCTGCGGT GAAGCCCTAT
TTCGCCCGCT TCCTGGAGCG GTTCCCGACC GTGGCGGCGC TCGCCGCGGC CCCCGAGGAG
GCGGTGATGA GCGCCTGGGC GGGGCTCGGC TACTACTCGC GGGCCCGCAA CCTCCACGCC
TGCGCCAAGG CCGTGGCGGC GGCGGGCGGC TTCCCCGACA CGGTGGAGGG GCTGCGCCGG
CTCCCGGGCA TCGGCGCCTA CACGGCCGGG GCCATCGCGG CGATCGCCTT CGACCGGCCG
GCCGCCGCCG TGGACGGCAA CGTCGAGCGG GTGGTGTCGC GGCTCTTCGC GATCGAGACG
CCGCTGCCGG CGGCGCGGGC GGAGATCCGG GCGCTCGCCG AATCGCTGGT GCCGCGGACG
CGGCCGGGCG ATTTCGCGCA GGCGGTGATG GATCTCGGCG CGACGCTGTG CACGCCCAAG
CGGCCGGCCT GCGCCCTCTG CCCCTGGATG GCGCCCTGCC GGGCCCGCGC CGAGGGGCTG
CAGGAGAGCT TCCCGCGCAA GGTCCGGCGG GAGCCCGGCC TCCTGCGCCG CGGCGCCGCC
TTCGTGGCGG TGCGGGCCGG CGACGAGGCG GTGCTGCTGC GCACCCGCCC GCCCGAGGGG
CTGCTCGGCA GCATGGCGGA GCCGCCGACG AGCGCCTGGA CGCCGGATTA CGACCCCGCC
CACGGCCTGC TCGACGCGCC GCTCGATGCC CGCTGGAAGC GGCTGCCCGG GGTGGTGCGC
CACACCTTCA CGCATTTCCC GCTCGAACTG ACGGTCTTCC TCGCCCGCGT CGCGGCCCGC
ACCGAGGCGC CGGAGGGCAT GCGCTTCACC CCCCGCGACG CGCTCGCGGA CGAGCCCCTC
CCCGGGGCGA TGAGGAAGGT GCTGGCCCAT GCCCTGGAGC CGCGGCCCGC GCCGGCGCCG
CCGCCCGCCC CGCCGCCGGA CCTCGCCACG CCGCCCGAGC CCCCGGCCGC CCCGCGGCGG
GGCCCGCTGC CGAAGGTGCT GCCGCGCCGC CCGGCCTCGG CCGCGCCCAT CCGCAAGGAC
GGGCCGCGCA AGCGCTGA
 
Protein sequence
MVAPAASHSR PDAADLLVWY DRHRRTLPWR AAPGSVPDPY RVWLSEIMLQ QTTVAAVKPY 
FARFLERFPT VAALAAAPEE AVMSAWAGLG YYSRARNLHA CAKAVAAAGG FPDTVEGLRR
LPGIGAYTAG AIAAIAFDRP AAAVDGNVER VVSRLFAIET PLPAARAEIR ALAESLVPRT
RPGDFAQAVM DLGATLCTPK RPACALCPWM APCRARAEGL QESFPRKVRR EPGLLRRGAA
FVAVRAGDEA VLLRTRPPEG LLGSMAEPPT SAWTPDYDPA HGLLDAPLDA RWKRLPGVVR
HTFTHFPLEL TVFLARVAAR TEAPEGMRFT PRDALADEPL PGAMRKVLAH ALEPRPAPAP
PPAPPPDLAT PPEPPAAPRR GPLPKVLPRR PASAAPIRKD GPRKR