Gene M446_0933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_0933 
Symbol 
ID6131978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1055240 
End bp1056538 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content74% 
IMG OID641641242 
Productcytosine deaminase 
Protein accessionYP_001767916 
Protein GI170739261 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.253377 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATCG ACCTCATCCT GCGCCGCGCG ACCCTGCCGG ACGGGCGCCG GGACCACGAC 
ATCGCGGTCG CGGGCGGGCG GATCGTCGGG ATCGCGCCGG GCATCCCGGG CCCGGCCGGC
GAGGAGATCG ACGCGGCCGG CCAGCTCGTG ACGCCGCCCT TCGTCGATTG CCACTTCCAC
ATGGACGCGA CGCTCTCCCT CGGGCACCCG CGCCTCAACC TGTCCGGCAC GCTCCTCGAA
GGCATCGCCC TCTGGGGCGA GCTGAAGCCG CTCCTCACCG AGGAGGCGGT GATCGCGCGG
GCGCTGCGCT ACTGCGACCT CGCGGTGGCG CAGGGGCTGC TCGCGGTGCG CTCCCACGTC
GACGTCTGCG ACGACCGGCT GCTCGCGGTC GACGCCCTGC TCGCCGTCAA GAAGCAGGTC
GCGCCGTACC TCGACCTCCA GCTCGTCGCC TTCCCGCAGG ACGGCTATCT GCGCGCGCCC
GGCGCGGCGC GCAACCTGGA GCGCGCCCTC GACCGCGGCG TCGAGGTGGT GGGCGGCATC
CCGCATTTCG AGCGCACCGC GGAGGAGGGG GCCGAATCCC TGCGCCGCCT GTGCCGGATC
GCGGCGGAGC GGGGGCTGCG CGTCGACATC CACTGCGACG AGACTGACGA TCCCCTGTCG
CGCCACGTCG AGACGCTCGC CGCCGAGACG GTGCGGCACG GGCTGCAGGG GCGGGTGGCG
GGCTCGCACC TCACCTCCAT GCACTCGATG GACAATTACT ACGTCTCGAA GCTGCTGCCC
CTGATGGCGG AGGCGCAGCT GCGGGTGGTG GCGAACCCGC TCATCAACAT CGTGCTCCAG
GGCCGGCACG ACAGCTACCC GAAGCGCCGC GGCCTCACCC GCGTGCCCGA GGCGCTGGCG
GCGGGGCTCA CCGTCGCCTT CGGCCAGGAT TGCTGCATGG ACCCCTGGTA CAGCCTCGGC
GCGGCCGACA TGCTCGACGT CGCCCATATG GGCCTGCACG TGGCGCAGAT GACCGGGCGC
GAAGCGATGC GGGCCTGCTT CGCGGCCGTG ACGACCCAGG CCGCCGCCGT GATGGGGCTT
GAGGATTATG GGCTGCATGT CGGCGCCCAC GCGGATCTGG TGCTGCTGCA GGCCCGCGAC
CCGATCGAGG CGATCCGGCT GCGCGCGACG CGGCTCGCGG TGATCCGCCG CGGCCGGGTG
GTGGCCCGCA CCCCCGCCCG GGCGGCCGCC CTCGCCCTGC CGGGACGGCC CGAGCGGGTC
GATCCGGCGG CCTACGCGCC GGAGGCGGCC GGGGCGTAG
 
Protein sequence
MDIDLILRRA TLPDGRRDHD IAVAGGRIVG IAPGIPGPAG EEIDAAGQLV TPPFVDCHFH 
MDATLSLGHP RLNLSGTLLE GIALWGELKP LLTEEAVIAR ALRYCDLAVA QGLLAVRSHV
DVCDDRLLAV DALLAVKKQV APYLDLQLVA FPQDGYLRAP GAARNLERAL DRGVEVVGGI
PHFERTAEEG AESLRRLCRI AAERGLRVDI HCDETDDPLS RHVETLAAET VRHGLQGRVA
GSHLTSMHSM DNYYVSKLLP LMAEAQLRVV ANPLINIVLQ GRHDSYPKRR GLTRVPEALA
AGLTVAFGQD CCMDPWYSLG AADMLDVAHM GLHVAQMTGR EAMRACFAAV TTQAAAVMGL
EDYGLHVGAH ADLVLLQARD PIEAIRLRAT RLAVIRRGRV VARTPARAAA LALPGRPERV
DPAAYAPEAA GA