Gene Mchl_2042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_2042 
Symbol 
ID7118742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp2139760 
End bp2141127 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content75% 
IMG OID643524792 
Productcytosine deaminase-like protein 
Protein accessionYP_002420817 
Protein GI218530001 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.670818 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAGC CGCCGTTCCT GCCCGAGGCC CACGCCTACC GCCTGCGCGA CGCGCGCGTG 
CCCGGAGCTT TCCTGACGGG CGGCGTGCCG GCCGGCGCGA CACTCGACGG GGACGGCTGC
GCGCTCCTCG ACATCGTCGT CGCGGACGGC GTCGTCGCCC GTCTCCTGCC GGCCGGGGGG
ACGCCGGATA CGCTGCCCGC CGCCGATCTC GCCGGGCGGC AGGTCTGGCC CCGCCTCGTC
GATGCGCATA CCCACCTCGA CAAGGGCCAC ACCGTCGCTC GCACGCCCAA TCCGGCCGGC
GACTTTCCCG GCGCCCGCGA CGCCACCACG GCGGACCGTA CCCGCCACTG GGACGCGGAG
GATCTGCGCC GCCGCATGAC CTTCGGCCTC GCCTGCGCCT TCGCGCACGG CACGGGCGCG
ATCCGCACGC ATCTCGACAG TCAGGAGAGC GGGGAGGGCG CGCCGCGCGA CCAGGCCGCG
ACGACCTGGG CGGTGTTCCG GGAGATGCGG GCGGCCTGGG CCGGGCGGAT CGCGCTGCAG
GGCGTCGGCC TGACCCCGAT CGACGCCTAC GCCACCGATT ACGGGCGCCG CCTCGCCGAC
CTGATCGCCG ATTCGGACGG GCTGATCGGC GGCGTGACCC GGCCGACCGG CGGCCTGCAT
GGCGGGGCGC TGGCCGAGAT CGACGCCCTG CTCGACCGCC TGTTCGGCCT CGCCCGCGCC
CGCAACCTCG ATGTGGACCT CCATGTCGAC GAGACCGGCG ATCCCGCGGC GGCCTCCCTC
GACGCGGTCG CGCGGGCGAC CCTGCGCCAC GGCTACGAGG GCCGCGTCAC CTGCGGGCAT
TGCTGCAGCC TCGCGCTCCA GCCCGAGGCG CAGGCTTCGG CCACGATCGC GCGGGTCGCG
CAGGCGGGAA TCCGCATCGT GACGCTGCCG ACCGTCAACA TGTACCTCCA GGACCGGCAG
CGGGGGCGCA CTCCGCGCTG GCGCGGCGTC GCCCCAGTTC AGGAACTGAT GGCAGCCGGC
GTGCCCGTGA TGGTCGCGGG CGACAATTGC CGCGACGCGT TCTACGCCTA CGGCGACCAC
GACATGCTCG ACACATTCCG GGCGTCGGTG CGGATCCTCC ATCTCGATCA TCCACTGGCC
GGCGCGCCCG CGCTCGCCGG GCCGGTGCCG GGGGCGATGA TGGGGCTCCC CCATGCCGGC
ACGATCCGCG AGGGCGCCCC CGCCGACCTG ATGCTTCTGG CCGCGCGCAG CCTCAACGAG
GTCGTCGCGC GGCCGCATGC GGACCGAATC ATCGTGGTCG CGGGCAGGCC CGTCGCGACG
CGGCTGCCGC CCTACGAGGC CCTGACCGGC GAGGCCGCCC CCTGGTAG
 
Protein sequence
MSEPPFLPEA HAYRLRDARV PGAFLTGGVP AGATLDGDGC ALLDIVVADG VVARLLPAGG 
TPDTLPAADL AGRQVWPRLV DAHTHLDKGH TVARTPNPAG DFPGARDATT ADRTRHWDAE
DLRRRMTFGL ACAFAHGTGA IRTHLDSQES GEGAPRDQAA TTWAVFREMR AAWAGRIALQ
GVGLTPIDAY ATDYGRRLAD LIADSDGLIG GVTRPTGGLH GGALAEIDAL LDRLFGLARA
RNLDVDLHVD ETGDPAAASL DAVARATLRH GYEGRVTCGH CCSLALQPEA QASATIARVA
QAGIRIVTLP TVNMYLQDRQ RGRTPRWRGV APVQELMAAG VPVMVAGDNC RDAFYAYGDH
DMLDTFRASV RILHLDHPLA GAPALAGPVP GAMMGLPHAG TIREGAPADL MLLAARSLNE
VVARPHADRI IVVAGRPVAT RLPPYEALTG EAAPW