Gene M446_3497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3497 
Symbol 
ID6129106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3902487 
End bp3904028 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content72% 
IMG OID641643668 
Productamidohydrolase 
Protein accessionYP_001770316 
Protein GI170741661 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.377886 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.816035 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAAC TCATCGTCCA GGCCGGCTGG GTCGTCACCG GGATCCGGGA CCGCCACACC 
CCGATCATCG TCGAGGACGG CGCCGTCCTC TCCCGCGACG GGATCATCGC GGCGGTCGGC
CCCGCGGAGG CGCTGCAGCG GCAGGCGCCG CGGGCGGAGC TGCGCCGCTA TCCCGGCCAC
GTCATGCTGC CGGGCTTCGT CAACAGCCAC CACCATGTCG GGCTCACCCC GCTGCAGCTG
GGCTCGCCCG ACTACGCGCT GGAATTGTGG TTCGCGAGCC GGCTGCCCGC CCGGCGCATC
GACCTCACCC TCGACACGCT CTACTCGGCC TTCGAGATGG TGGCCTCCGG CGTCACCACG
GTGCAGCACA TCCAGGGCTG GATGGCCGGC GGCTACGAGG CGATCCACGC GGCCGCGACG
CAGACGCTGA ACAGCTACCG CAGCCTCGGC ATGCGCGCCT CCTACTGCTA CGCGGTGCGC
GAGCAGAACC GCCTCGTCTA CGAGGCGGAC GAGGCGTTCT GCGCCCGGCT GCCCGCCGAT
CTCGGCGCCG AACTGGCGGC CCACCTGAGG GCGCAGGCGA TGCCCTTCGA CGACTTCCTG
CGCCTGTTCG ACGCGCTGCG GGAGGAGAAC GAGGGCCAGC GCCTCACCCG CATCCAGCTC
GCCCCGGCCA ACCTGCACTG GTGCACCGAC GAGGGGCTGG TGGCCCTGGA CGCGCGCGCC
CGCGCCGCCG GCGCGCCGAT GCACATGCAC CTGCTGGAGA CCGCCTACCA GAAGGAATAT
GCCCGCCGCC GCACCGGCAA GACGGCCCTG CGCCACCTGC ACGATCTCGG CGTGCTCGGG
CCGCACATGA CCCTCGGCCA CGGGGTCTGG CTCACCGCCG AGGACATCGA CATCGTCGCG
CAGACCGGCA CCTGCCTGTG CCACAATTGC TCCTCGAATT TCCGCCTGCG CTCGGGCCTC
GCCCCGCTCA ACACCTGGGA GCGCAAGGGC ATCACGGTCG GGATGGGGCT CGACGAGGCC
GGTCTCAACG ACGACCGCGA CATGCTCCAG GAGCTGCGCC TGGCGCTGCG GGTCCACCGG
GTGCCGGGCA TGGACGACGA CGACGTGCCG ACGCCCGCGC AGATCGTCCG GATGGCGACA
GAATCGGGGG CGATGACCAC GGCCTTCGGC GCCACGATCG GGCGGCTGGA GCCCGGGCGC
TTCTTCGACG CCTCGCTGAT CAACTGGCGC CGGGCCACCT ACCCGTTCCA GGACCCGGAC
ATCCCGATGC TCGACGCGCT GGTGCAGCGG GCCAAGATCG ACAGCGTGGA CGCGGTCTAC
ATCGACGGCG ACCTCGTCTA CGCCGAGGGC CGCTTCACGC GAATCGACCG GGACGCGGTG
CTGGCCGAGA TCGCCGCGAT CCTGACCCGC CCGCGCACGC CCGAGGAGGA GACCCGCCGC
CGCCTCGGCC GGGCGGTCTT CCCGCACGTG AAGGCCTTCT ACGACGGCTA CCTGCCGGAG
ACGCCGCGGA GCCCCTTCTA CGCCGCCTCG TCCGCCGTCT GA
 
Protein sequence
MAELIVQAGW VVTGIRDRHT PIIVEDGAVL SRDGIIAAVG PAEALQRQAP RAELRRYPGH 
VMLPGFVNSH HHVGLTPLQL GSPDYALELW FASRLPARRI DLTLDTLYSA FEMVASGVTT
VQHIQGWMAG GYEAIHAAAT QTLNSYRSLG MRASYCYAVR EQNRLVYEAD EAFCARLPAD
LGAELAAHLR AQAMPFDDFL RLFDALREEN EGQRLTRIQL APANLHWCTD EGLVALDARA
RAAGAPMHMH LLETAYQKEY ARRRTGKTAL RHLHDLGVLG PHMTLGHGVW LTAEDIDIVA
QTGTCLCHNC SSNFRLRSGL APLNTWERKG ITVGMGLDEA GLNDDRDMLQ ELRLALRVHR
VPGMDDDDVP TPAQIVRMAT ESGAMTTAFG ATIGRLEPGR FFDASLINWR RATYPFQDPD
IPMLDALVQR AKIDSVDAVY IDGDLVYAEG RFTRIDRDAV LAEIAAILTR PRTPEEETRR
RLGRAVFPHV KAFYDGYLPE TPRSPFYAAS SAV