Gene Smed_3843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3843 
Symbol 
ID5318571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp300408 
End bp301808 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content65% 
IMG OID640775655 
Productamidohydrolase 
Protein accessionYP_001312588 
Protein GI150375992 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.865403 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGATT TGCGCGACAG GTTTCTGGGC AACGGTGATT TCCTGCTTCA CCCCGGCAAG 
GTCCTCTTGC CGGAAGGCCC TCGATCCGGA ATCGGGATAG TGGTCCGCAA GGGGCGCTTC
TCGGAAATCG GTGCCGCAGG ACTTGTCGGC AGCCGGAATC CCGACCTGAG GCCGATAGAG
TTGCCGCATC ATCTTGTGAT GCCAGGCTTC ATCGACACCC ATACCCATCT GACCCAGTCC
CTGGGGAAGT CGCTCGTCTT CGGCGAGCCG TCCGAGATCT TTCGCCGTAT CTGGGTGCCG
CTCGAAGGCA GTCTGGATGA ACGGATGGTC TATCTTTCGG CAAAGCTGGC GGCGCTTGAA
TGCCTGCGCG GCGGCTTTAC CGCCGCAGTC GATGCGGGCA CGCGTTCCGC GGGCCACATG
GACGCGCTGA TACGGGCGGC GCGTGAAACC GGCTTGCGTA GCGTCATAGG ACTTATCTGC
AACGATCTCG GCGGAGCCGC GGTGGTCCCC GACCGCACGA CGATCCTCAG GAATGCGGCC
GGACATCTGG CCGCATTCGA GGGCGATTCG CTCGTACATC CCTCACTCGC CATTTCCATT
CCCGAGGCGG CCAGCGACCA CATGCTGGCT GACGTCTCCA GCATGGCGCG GGAAGCGGGT
GTCATATTCC AGACCCATGT CAACGAACAC CTCGTCGCAG TCGAGCGCTC GCTGGTTGCA
AACGGCCGCC GTCCGCTGGA GCATCTCGCT CATCTCGGCG CACTCGGCCC GCATGTGCTG
ATCGCCCATT CCACGCTGGT GACACCGCAC GAACTGAACC TGTTGCGCCA CAGCGATACG
GCGGTCGCAT ACAATCCGGT GGCGAGCTTG TGGAAAGGCA ATGCCATCGC ACCCGCGCTG
CAAATGGCCG CACTCGGGAT CCGCTTCGGA CTGGGAACCG ACGGCACCCG CGCAGACGGT
TTCCGCCTCA TGGATGCCGC CGAGGGCCTG CAGCGCGCCG GCTTCGGGCT TGCGACGGGC
GACTCTTCCT GTGGAGGCGG CTGGCTCTGG ATCGACCGGG CAACAGCCCA GGCGGCGGAT
GCCGCAGGTC TTGGCTGCGT GACCGGCGCG ATCCGCGAGA AGCTTGCGGC CGATTTCCTC
CTGGTGGATC TCGACCGTCC CGAATTCACG CCCTCCCACG ATCTCATGTG GGAACTCGTG
CGCTACGGCA ACCGCGACCA GATCGACGCC GTCTTCACCG CCGGAATGCT TCGCCTCTGG
CAAGGCTGGC CGGTCCAATG GGATGCACGC GCGCTTCTTG CCGAGGTGCG CGAGGTCACG
GCCGATGCCA TAGCAAGGGC GCCGATCCAG CGCGTACACA AGCCATCGGC GGAGCACCGG
GCGCTGGGGC ATTTCGCATG A
 
Protein sequence
MTDLRDRFLG NGDFLLHPGK VLLPEGPRSG IGIVVRKGRF SEIGAAGLVG SRNPDLRPIE 
LPHHLVMPGF IDTHTHLTQS LGKSLVFGEP SEIFRRIWVP LEGSLDERMV YLSAKLAALE
CLRGGFTAAV DAGTRSAGHM DALIRAARET GLRSVIGLIC NDLGGAAVVP DRTTILRNAA
GHLAAFEGDS LVHPSLAISI PEAASDHMLA DVSSMAREAG VIFQTHVNEH LVAVERSLVA
NGRRPLEHLA HLGALGPHVL IAHSTLVTPH ELNLLRHSDT AVAYNPVASL WKGNAIAPAL
QMAALGIRFG LGTDGTRADG FRLMDAAEGL QRAGFGLATG DSSCGGGWLW IDRATAQAAD
AAGLGCVTGA IREKLAADFL LVDLDRPEFT PSHDLMWELV RYGNRDQIDA VFTAGMLRLW
QGWPVQWDAR ALLAEVREVT ADAIARAPIQ RVHKPSAEHR ALGHFA