Gene M446_2623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_2623 
Symbol 
ID6134440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp2914677 
End bp2916041 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content73% 
IMG OID641642837 
Productguanine deaminase 
Protein accessionYP_001769496 
Protein GI170740841 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02967] guanine deaminase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.19525 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCCG AGCCACGGCC CCGCCGCGCC GCCCTGCGCG GGCAAGCGAT CAGCTTCTCG 
GGCAATCCCT TCCTGCAGGC GCCGCGGGAC TGCTTCGTCC ACCACGAGGA CGCGCTGATC
CTGATCGAGG ACGGGCGGAT CGCCGCCTTC GCGCCCTACG ATCCCGCCCT GCTGCCGCCG
GGCGTGACGC CGGTCCACTA CGAGAACGCC CTGATCACGG CGGGCTTCGT CGACGCGCAC
GTGCACTACC CGCAGGTCGG GATGGTGGGC GCCTTCGGCG AGCAGCTGCT CGCCTGGCTC
GAGCGCTACA CCTTCCCGGC CGAGCGCGCC TTCGCGGACC CGGCCCATGC CGAGGCCGCC
GCCAGGATCT TCCTGCGCGA GATCCTGCGG GCCGGCACCA CCACGGCCTC GGTCTACTGC
ACGGTCCATC CCCACTCGGT CGACGCCCTG TTCGCCGAAT CCGAGCGGTT CAACACCCTC
ATGGTGGCCG GCAAGGTGCT GATGGACTCG CACGCGCCGG ACGACCTGCG CGACACGGTC
GAGAGCGGCT ACGAGGACAG CCTCGCGCTG ATCCGCCGCT GGCACGGGCG CGGCCGGCAG
CATTACGGGG TGACGCCGCG CTACGCCGGA AGCTGCAGCG CGGCGCAGCT CGACTCGGCC
GGGGCGCTGC TGCGCGGCCA TGAGGGCCTG TTCCTGCAGA CCCACCTGAG CGAGAGCCCG
GCCGAGGTCG CCTGGGTGCG CGACCTGTTT CCGGCCCGCT CCAGCTACCT CGACATCTAC
GCCCATGCGG GGCTGGTGCG GCCGCGCGCG ATCTTCGGGC ACGGCATCCA CGTCGGCGAG
GAGGAGTTCT GCACCTGCCA CGCGGCCGGC GCGGCGCTCG CCCATTGCCC GACCTCGAAC
CTCTTCCTCG GCAGCGGCCT GTTCCGGCTC TTCGACGCCC TCGATCCCCG CCGCCCGGTG
CGGGTCGCGA TCGGCACCGA TATCGGGGCG GGGACCAGCT TCTCCGCCCT GCGCACGCTC
GGCGAGGCCT ACAAGGTCGC CGCCCTGCGC GGCGACGCCC TCGACGGGCT GAGGGCCTTC
CACCTCGCGA CGCTTGGGGG CGCGGCCGCG CTCCACCTCG ACGACCGGAT CGGCCGGATC
GCGCCGGGCT ACGACGCCGA TCTCTGCGTG CTCGGCCTCG ACGCGACGCC GTTCCTGGCC
TTCCGCACCG CCCGCTGCGA GCGCGTCGAG GACCTGATGT TCGTCCTGAT GACGCTCGGC
GACGAGCGCG CGGTGCGGGC GACCTGGGTG GCGGGCGAAT GCGTCTACGA CGCCGCCCGC
CGGCCGGACC CGTTCCGCTA CGTGGACGGC GCGGCGGCCG GATGA
 
Protein sequence
MPPEPRPRRA ALRGQAISFS GNPFLQAPRD CFVHHEDALI LIEDGRIAAF APYDPALLPP 
GVTPVHYENA LITAGFVDAH VHYPQVGMVG AFGEQLLAWL ERYTFPAERA FADPAHAEAA
ARIFLREILR AGTTTASVYC TVHPHSVDAL FAESERFNTL MVAGKVLMDS HAPDDLRDTV
ESGYEDSLAL IRRWHGRGRQ HYGVTPRYAG SCSAAQLDSA GALLRGHEGL FLQTHLSESP
AEVAWVRDLF PARSSYLDIY AHAGLVRPRA IFGHGIHVGE EEFCTCHAAG AALAHCPTSN
LFLGSGLFRL FDALDPRRPV RVAIGTDIGA GTSFSALRTL GEAYKVAALR GDALDGLRAF
HLATLGGAAA LHLDDRIGRI APGYDADLCV LGLDATPFLA FRTARCERVE DLMFVLMTLG
DERAVRATWV AGECVYDAAR RPDPFRYVDG AAAG