Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4007 |
Symbol | |
ID | 5318287 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 462221 |
End bp | 463519 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640775815 |
Product | amidohydrolase 3 |
Protein accession | YP_001312748 |
Protein GI | 150376152 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATACGG TTATTGGAGC CCGTCAAGGC GGCGCCCGCC GCGCACTTCT CTTGCGGGGA TGCCGGATCG CCGGCGAAAT TGCGGCCGAG GCCGGGCGCG ACATCCTCGT TGGAGAGAAT GGCCGCATCG CAAGGATCGG CTCCCGGCTG GACGTGGGTC CGGATGTGGC CGTCGTCGAG GTCCGCGGTG CACTCATCTC ACCGGGATTT GTCGACGTAC ATCAGCATCT GGACAAGACC GGCGTTCTCA GGTTCACGCC GAATCCCTCG GGAACATTGC AGGGCGCGCG GGAGGCATTC GCCGAATATG CCCGCAAGGC GCCGGAAGAG GATGTTACGC GACGGGCTGC GCGGACCATG GCGCGCTGCC TTGGCCGGGG CACCACGGCG ATCCGGAGCC ATATAAATGT CGACAAGGAT GCCGGCTTCA ACGGGATCAA CGCCCTGGCC CGGCTGCGCT CTGAATGGGC CGACCGGCTC ACGCTGCAGA TAGTCGCATT CATGACGCCG CATCCCAACC AGGATCTCGC CTGGCTGGAG AGCAACATCG ATGCCGCCGT TGAACAGGCG GACGCGGTGG GAGGCACACC GGCCGTCGCC GAGGACCCGA TCCGCTATCT CGACATTCTG TTTGCAGCCG CCAAGCGGCA TGGCCGGCCC ATCGACCTGC ACCTCGACGA ACACCTGAAC CCCGAACGGC CGCTTTTCGA TGCGGTGTTC GAACGCGTCC GCAAATTCGG CCTGCAGGGA CGGACCGTCC TCGGGCACGC CTCCGTCCTG AGTGCACTTC CGAGGACGGA ATTCGAGCGC ATCCGCGACC GCATGATCGA CCTCGACATC GCGGTCGTGA CCTTGCCTGC CGCCAACCTC TATTTGCAGG GCCGAAGCCA CGACATGTTA CCGCCCCGAG GCTTGACGCG CGTCGCCGAG CTGATCCGCT CGGGCGTGGC AATCGCAACC GCGTCGGACA ACATTCAGGA TCCGTTCGTG CCGACGGGAT CGGGCGACAT GCTCGAGATC GCGCGCTGGA CGCTGCTCGC CGGTCATCTG CGCGGCGACG AGCTCGCCAC AGCCTATGAC ATGATCACCA AGATTCCGGC ACGCATGATG AATTTGGGCG CTGACTACGG CATTCGCGAG GGCGCCTGGG CGGATCTCGT CATCAGCGAT TGCGAGGATG TGAGCGCGCT CGTCAGCGCA GGTCCGGACT GCATGCAGGT CCTTGCAAAA GGCCGGCCGA TCGCAGCACC CGCATTCCCT GCCATGGCAG CCATATGCGC ATTGGAAAAT GTCCCCTGA
|
Protein sequence | MHTVIGARQG GARRALLLRG CRIAGEIAAE AGRDILVGEN GRIARIGSRL DVGPDVAVVE VRGALISPGF VDVHQHLDKT GVLRFTPNPS GTLQGAREAF AEYARKAPEE DVTRRAARTM ARCLGRGTTA IRSHINVDKD AGFNGINALA RLRSEWADRL TLQIVAFMTP HPNQDLAWLE SNIDAAVEQA DAVGGTPAVA EDPIRYLDIL FAAAKRHGRP IDLHLDEHLN PERPLFDAVF ERVRKFGLQG RTVLGHASVL SALPRTEFER IRDRMIDLDI AVVTLPAANL YLQGRSHDML PPRGLTRVAE LIRSGVAIAT ASDNIQDPFV PTGSGDMLEI ARWTLLAGHL RGDELATAYD MITKIPARMM NLGADYGIRE GAWADLVISD CEDVSALVSA GPDCMQVLAK GRPIAAPAFP AMAAICALEN VP
|
| |