Gene Mchl_2046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_2046 
Symbol 
ID7118746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp2143300 
End bp2144658 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content71% 
IMG OID643524796 
Productguanine deaminase 
Protein accessionYP_002420821 
Protein GI218530005 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02967] guanine deaminase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.177828 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.651599 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGG CACCGGCCCG GATGCGGGCG ATCCGCGGAC AGGCGGTGAG CCTGACCGGC 
AACCCGTTCC TGAGCGAGGG CTGCCTCCAG CACGTGGCCG ACGCCCTGAT CCTGATCGAG
GACGGGCGCA TCACCGCCTT CGGCGACTTC GCCGACCTGG CGGACCGGGT CCCGGCGGGC
GTGGCGGTGA CCGTCTACGA GAACGCCCTG ATCCTGCCCG GCCTGATCGA CACCCACGTC
CATTATCCGC AGCTGCAGAT GATCGCCTCC TACGGCGAGC AGCTGCTGGC CTGGCTCGAG
AAGTACACCT TCCCCGCCGA GTTGCAGTTC GCGGATCAGG CCCATGCCGA GCGGGTGGCC
AAGCTGTTCT TCCGCGAGAT CCTGGGGGCG GGCACCACCA CGGCGGTGGT CTATTGCACG
GTCCATCCCG GCTCGGTCGA AGCGTTCTTC GCCGAATCGG CCCGCTTCAA CACCCGGATG
ATCGCCGGCA AGGTCCTGAT GGACCGCAAC GCCCCGGCCG GGCTCCTCGA CACGGCCCAG
CGCGGCTACG ACGAGAGCAA GGCCCTGATC GACCGCTGGC ACGGGCGCGG GCGCCAGCAT
TACTGCGTCA CCCCGCGCTT TGCCCCCTCC TGCACGCAAG CCCAGCTCGA CGCCGCCGGC
GCGCTGATGC GGGAGCACGA CGACCTCTTC CTCCAGACGC ACCTGTGCGA GAACACCGAC
GAGATCGCCT GGGTGCGCGA GCTGTTCCCC GACCGGGCGA GCTACCTCGA CGTCTACGTG
CAGTCCGGCC TCGTCGGCCC GCGCACGGTG CTCGGCCACG CGATCCACCT GTCGGAGGAG
GATTTCTGCG CCTGCCACGC CAGCGGCGCG GCGATCGCCC ATTGCCCGAC CTCGAACGGC
TTCCTCGGCA GCGGCCTGTT CCGGCTGTTC GACGCCCTCG ATCCGCGCCG GCCGGTGCGG
GTCGGGCTTG GCACCGATGT CGGCGCCGGC ACGACCCTGT CGCTGCTGAA GACGCTCGGC
GAATCCTACA AGGTCGCGGC GCTCCGCGGC ACCCGGCTCG ACGCGGTCCG GGCGTTCTGG
CTCGCGACGC TCGGCGGCGC CGAGGCGCTG CGGCTGGACG ACCGGATCGG ACGGATCGCG
CCGGGACACG ATGCGGATCT CTGCGTGCTC GACCTCGCCG CGACGCCGCT CCTGGGTTTC
CGCACCGGCA CCTGCTCCAG CATCGAGGAG CTGCTGTTCG TGCTGATGAC GCTCGGCGAC
CACCGCACGG TGCGCGCGAC CTGGGTGGCA GGCGAGCCGG TCTACGACAA CCGGCGCGCG
GGCGACCCAC TCGTCTATCC GGAGACGGCG CGGGCCTGA
 
Protein sequence
MTAAPARMRA IRGQAVSLTG NPFLSEGCLQ HVADALILIE DGRITAFGDF ADLADRVPAG 
VAVTVYENAL ILPGLIDTHV HYPQLQMIAS YGEQLLAWLE KYTFPAELQF ADQAHAERVA
KLFFREILGA GTTTAVVYCT VHPGSVEAFF AESARFNTRM IAGKVLMDRN APAGLLDTAQ
RGYDESKALI DRWHGRGRQH YCVTPRFAPS CTQAQLDAAG ALMREHDDLF LQTHLCENTD
EIAWVRELFP DRASYLDVYV QSGLVGPRTV LGHAIHLSEE DFCACHASGA AIAHCPTSNG
FLGSGLFRLF DALDPRRPVR VGLGTDVGAG TTLSLLKTLG ESYKVAALRG TRLDAVRAFW
LATLGGAEAL RLDDRIGRIA PGHDADLCVL DLAATPLLGF RTGTCSSIEE LLFVLMTLGD
HRTVRATWVA GEPVYDNRRA GDPLVYPETA RA