Gene Mchl_0404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_0404 
Symbol 
ID7116412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp391759 
End bp392877 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content71% 
IMG OID643523204 
Product2'-deoxycytidine 5'-triphosphate deaminase 
Protein accessionYP_002419268 
Protein GI218528452 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0717] Deoxycytidine deaminase 
TIGRFAM ID[TIGR02274] deoxycytidine triphosphate deaminase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.37704 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGAG ACGTGCCGGC AGCATCCGGT ATCCTGCCGG CGCAGGCCAT CACGGCGCTG 
ACGCAAGCCG GGGCGATCCG CCCCGCGACC GCCTATGCCG CAGACCAGAT CCAGCCCGCC
AGCCTCGACC TGCGGCTCGG CCGCCGCGCC TACCGGGTGC GCACGAGCTT CCTGCCCGGC
AGCGGGCGCT CGGTGGCGAC CTGCGTCGAG GCCTTCGCGC TGCACGAGAT CGACCTGACG
CAAGGCGCCG TGCTGGAGAC AGGCTGCGTC TACATCGCCG AATTGCAGGA AACGCTGGCG
CTCCCCCCCG ATCTCAGCGC CAGCGCCAAC CCCAAAAGCT CGACCGGGCG CATCGACGTG
TTCACCCGCG TCATCACCGA CCGCGCCAGC GCCTTCGACC AGATCGAGGC GGGCTATGCC
GGCCAGCTCT ACGCCGAGAT TTCGCCGCGA ACCTTCCCGG TGCGGGTGCG GACCGGCTCG
CGCCTGTCGC AGATCCGCTT CCGCCAGGGC GACCCGCGGC TGCGGGAGAC GGAACTGGCC
GCGCTCCACG CCAGCGACCC GCTGATCGAT GCCGCAACCC CCTCGCTTCA GGGCGGCGTG
CCGGTCTCGG TCGATCTCGC GGGCTTCGAG GGGCTGATCG GCTACCGGGC CAAGCGCCAT
ACCGGCTTGA TCGACGTGGA CCGGCCGCGC GGTCACCGCA CCCGCGACTT CTGGGAGCCG
CTGCCGGCCG ACGGCAGCCG CACGCTGATC CTCGATCCCG GCCAGTTCTA CATCCTCGCC
TCGAAGGAGG CGGTGCGGGT GCCGGCCGAC TATGCAGCCG AGATGGTGCC GTTCGATCCT
CTCGTCGGCG AGTTCCGCGT CCATTATGCC GGCTTCTTCG ATCCGGGCTT CGGCCTCAGT
GAAGCGGGCG GGGCCGGCGC CCGCGCGGTG CTCGAAGTGC GCTCGCGCGA CGTGCCGTTC
CTTCTGGAAG ACGGCCAGAT CGTCGGCCGC CTCGTCTACG AGCGCATGCT GGAGCGGCCC
GCGACCCTCT ACGGCGCGGG CGCCGGCTCG AACTATCAGG CGCAAGGCCT GAAGCTCTCG
AAGCATTTCG CCAGCGAGCC GGAGCCGCCC GCGGCCTGA
 
Protein sequence
MSGDVPAASG ILPAQAITAL TQAGAIRPAT AYAADQIQPA SLDLRLGRRA YRVRTSFLPG 
SGRSVATCVE AFALHEIDLT QGAVLETGCV YIAELQETLA LPPDLSASAN PKSSTGRIDV
FTRVITDRAS AFDQIEAGYA GQLYAEISPR TFPVRVRTGS RLSQIRFRQG DPRLRETELA
ALHASDPLID AATPSLQGGV PVSVDLAGFE GLIGYRAKRH TGLIDVDRPR GHRTRDFWEP
LPADGSRTLI LDPGQFYILA SKEAVRVPAD YAAEMVPFDP LVGEFRVHYA GFFDPGFGLS
EAGGAGARAV LEVRSRDVPF LLEDGQIVGR LVYERMLERP ATLYGAGAGS NYQAQGLKLS
KHFASEPEPP AA