Gene M446_3603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3603 
Symbol 
ID6132913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4018008 
End bp4019306 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content75% 
IMG OID641643770 
Productamidohydrolase 3 
Protein accessionYP_001770418 
Protein GI170741763 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.893721 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0414709 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTTCG ATCACCTGTT CACCAACGCC CGCCTGCCCG GGCGCGACGG CCTCGTGGAC 
CTCGCGGTGC GGGACGGGCG CTTCGCGGCC GTCGAGCCCG GGCTGCCGCC GAACGGGCCG
AGCGAGGATC TCGGCGGGCG GCTGGTGATC CCGGGCTTCG TCGAGACGCA TCTCCACCTC
GACAAGGCCT GCCTCCTCGG GCGCTGCGAT TGCGGCGCCG GCAGCGTCGG GGAGGCGATC
GCGGCCGTCA CGGCGGCCAA GCGCGGCTTC ACCGAGGCGG ACGTCTACGA GCGGGCGCGC
CGGGTGCTGG AGCGCGCCGT CGCGCAGGGC ACCACCCGCA TCCGCACCCA CGTGGAGGTG
GATCCCCGCA TCGGGCTGAC GAGCTTCCGG GCGCTGAGGC GCCTCAAGGC CGACTACGCC
TGGGCGGTCG ATCTCGAACT CTGCGCCTTC CCGCAGGAGG GACTGCTCGA CGATCCGGGC
TGCGAGGACG TGCTGGTCGC GGCCCTTGAG GAGGGGGCGG ACCTCGTCGG CGGCGTCCCC
TACATCGACC GGGACGCGGA CGGCCACGTC GCGCGGATCT TCGCGCTCGC CCGGCGCTTC
GACGTCGACA TCGACTTCCA CCTCGACTTC GACCTCGACC CGACCTGGCT GCGCCTCGAC
GAGGTCTGCC GCTGGGCCGA CCGGACCGGC TGGGGCGGGC GCGTCGCCAT CGGCCACGCG
ACCAAGCTCT CGGCCCTGCC ACCGGAGGCC TTCGGACGGG CGGCGCGGCG CCTCGCCGGG
GCGGGCGTCG CCGTCACGGT CCTGCCCGCG ACCGACCTGT TCCTGATGGG GCGCGAGGCC
GCCTGCAACG TGCCGCGCGG CGTGGCGGCG GCGCACCGGC TGGCGCGGGC CGGCGTCACC
TGCTCGATCG CCACCAACAA CGTCCTCAAC CCGTTCACGC CCTACGGCGA CGCCTCGCTG
CTGCGGATGG CGAACCTCTA CGCCAACGTC GCCCAGGTCT CGTCCGAGCC GGACCTCGCC
CTCTGCCTCG ACCTCGTCAC CGACCAGGCG GCGCGGCTGA TGCGGTGTGC CGATTACGGC
CTCGCCCCGG GCCGCCGGGC CGACCTCGTC GTGCTCGACG CGCGCAGCCC CGCCGAGGCG
GTCTGCACCC TGGCCTGGCC GCTCCAGGGC ATGAAGAACG GGCGCCGGAG CTTCGCGCGG
CCGATGCCGC TCCTGTCGCC GCCCGGCGAG GCCGCCCGCG CCCCGGGCCC GCTCCTGGGT
CCGCCCCTGG GTCCTGCCCT GGGTCCTGCC CTGGGCTGA
 
Protein sequence
MDFDHLFTNA RLPGRDGLVD LAVRDGRFAA VEPGLPPNGP SEDLGGRLVI PGFVETHLHL 
DKACLLGRCD CGAGSVGEAI AAVTAAKRGF TEADVYERAR RVLERAVAQG TTRIRTHVEV
DPRIGLTSFR ALRRLKADYA WAVDLELCAF PQEGLLDDPG CEDVLVAALE EGADLVGGVP
YIDRDADGHV ARIFALARRF DVDIDFHLDF DLDPTWLRLD EVCRWADRTG WGGRVAIGHA
TKLSALPPEA FGRAARRLAG AGVAVTVLPA TDLFLMGREA ACNVPRGVAA AHRLARAGVT
CSIATNNVLN PFTPYGDASL LRMANLYANV AQVSSEPDLA LCLDLVTDQA ARLMRCADYG
LAPGRRADLV VLDARSPAEA VCTLAWPLQG MKNGRRSFAR PMPLLSPPGE AARAPGPLLG
PPLGPALGPA LG