Gene Smed_4007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4007 
Symbol 
ID5318287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp462221 
End bp463519 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content65% 
IMG OID640775815 
Productamidohydrolase 3 
Protein accessionYP_001312748 
Protein GI150376152 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATACGG TTATTGGAGC CCGTCAAGGC GGCGCCCGCC GCGCACTTCT CTTGCGGGGA 
TGCCGGATCG CCGGCGAAAT TGCGGCCGAG GCCGGGCGCG ACATCCTCGT TGGAGAGAAT
GGCCGCATCG CAAGGATCGG CTCCCGGCTG GACGTGGGTC CGGATGTGGC CGTCGTCGAG
GTCCGCGGTG CACTCATCTC ACCGGGATTT GTCGACGTAC ATCAGCATCT GGACAAGACC
GGCGTTCTCA GGTTCACGCC GAATCCCTCG GGAACATTGC AGGGCGCGCG GGAGGCATTC
GCCGAATATG CCCGCAAGGC GCCGGAAGAG GATGTTACGC GACGGGCTGC GCGGACCATG
GCGCGCTGCC TTGGCCGGGG CACCACGGCG ATCCGGAGCC ATATAAATGT CGACAAGGAT
GCCGGCTTCA ACGGGATCAA CGCCCTGGCC CGGCTGCGCT CTGAATGGGC CGACCGGCTC
ACGCTGCAGA TAGTCGCATT CATGACGCCG CATCCCAACC AGGATCTCGC CTGGCTGGAG
AGCAACATCG ATGCCGCCGT TGAACAGGCG GACGCGGTGG GAGGCACACC GGCCGTCGCC
GAGGACCCGA TCCGCTATCT CGACATTCTG TTTGCAGCCG CCAAGCGGCA TGGCCGGCCC
ATCGACCTGC ACCTCGACGA ACACCTGAAC CCCGAACGGC CGCTTTTCGA TGCGGTGTTC
GAACGCGTCC GCAAATTCGG CCTGCAGGGA CGGACCGTCC TCGGGCACGC CTCCGTCCTG
AGTGCACTTC CGAGGACGGA ATTCGAGCGC ATCCGCGACC GCATGATCGA CCTCGACATC
GCGGTCGTGA CCTTGCCTGC CGCCAACCTC TATTTGCAGG GCCGAAGCCA CGACATGTTA
CCGCCCCGAG GCTTGACGCG CGTCGCCGAG CTGATCCGCT CGGGCGTGGC AATCGCAACC
GCGTCGGACA ACATTCAGGA TCCGTTCGTG CCGACGGGAT CGGGCGACAT GCTCGAGATC
GCGCGCTGGA CGCTGCTCGC CGGTCATCTG CGCGGCGACG AGCTCGCCAC AGCCTATGAC
ATGATCACCA AGATTCCGGC ACGCATGATG AATTTGGGCG CTGACTACGG CATTCGCGAG
GGCGCCTGGG CGGATCTCGT CATCAGCGAT TGCGAGGATG TGAGCGCGCT CGTCAGCGCA
GGTCCGGACT GCATGCAGGT CCTTGCAAAA GGCCGGCCGA TCGCAGCACC CGCATTCCCT
GCCATGGCAG CCATATGCGC ATTGGAAAAT GTCCCCTGA
 
Protein sequence
MHTVIGARQG GARRALLLRG CRIAGEIAAE AGRDILVGEN GRIARIGSRL DVGPDVAVVE 
VRGALISPGF VDVHQHLDKT GVLRFTPNPS GTLQGAREAF AEYARKAPEE DVTRRAARTM
ARCLGRGTTA IRSHINVDKD AGFNGINALA RLRSEWADRL TLQIVAFMTP HPNQDLAWLE
SNIDAAVEQA DAVGGTPAVA EDPIRYLDIL FAAAKRHGRP IDLHLDEHLN PERPLFDAVF
ERVRKFGLQG RTVLGHASVL SALPRTEFER IRDRMIDLDI AVVTLPAANL YLQGRSHDML
PPRGLTRVAE LIRSGVAIAT ASDNIQDPFV PTGSGDMLEI ARWTLLAGHL RGDELATAYD
MITKIPARMM NLGADYGIRE GAWADLVISD CEDVSALVSA GPDCMQVLAK GRPIAAPAFP
AMAAICALEN VP