Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4625 |
Symbol | |
ID | 5318936 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 1128002 |
End bp | 1129345 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640776424 |
Product | guanine deaminase |
Protein accession | YP_001313356 |
Protein GI | 150376760 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | [TIGR02967] guanine deaminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.430513 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGGATC TCTTGATCCG CGGCCGGGTG CTGACCTTTC TCAAAGAGCC CCAGGGCATC GACGATACGG CTTCTTACCG CTTCATCGAG GACGGCGCCG TTTTCGTAAG GAGCGGCAAG ATTGTACACA TCGGCCCTTA CGTTGACGTG GCGAAGCAGG CAGGACCTGA CACAGAGGTC GCCGACCACC GGCCGCATCT GGTCCTGCCC GGCTTAATCG ACACGCACCT GCATTTTCCG CAGACCCAGG CGATCGCCTC CTACGGCGCG CAGCTTCTGG AATGGCTGAA CACCTATATC TTCGTGGAAG AGCAGAAGTT CAAAGCTCCG GAGCATGCCG CCTTCATCGC GGGCCGCTTC ATGGACGAGC TCCTGTCCAA CGGCACGACC ACGGCCGCCG CCTATTGCTC GGTGCATCCG GAAAGCGTCG ACGCCTTCTT CGCGGCGGCC GAAGAGCGCG ACATGCTCAT GATCGGCGGC AAGGTGATGA TGGACCGCAA CGCCCCGGAC GCGCTGCAAG ACACGTCTCA AAAGGGCTAT GACGAGACCA AGGCGCTCAT CGCGAGATGG CATGGCCGCG GCCGCGCGCA TTATGCGATC ACCCCGCGCT TCGCCATCAC CTCGACGCCG GAGCAGATGG AGATGAGCCG CGCCCTCGCC GCAGAGCATC CGGACTGCTA CGTTCAGACG CATCTGTCCG AAAACCGGGA CGAAATCACC TTCGCCACTT CGCTCTATCC GGAAGCCAAG GACTATACCG ATATTTATGC TCGCTATGAG CTTCTCGGCC GCAAGACCCT GCTCGGCCAT TGCATCCACC TGAGCGACCG CGAAATATCG GCGCTCGCCG AGACGGAGGC GGTGGGCGTC TTCTGCCCAA CGTCCAACCT CTTCCTCGGC AGCGGCCTCT TCGATCGCGA CCGCTTCGAC AAGCTTGGAG CACGCCATGC GGTCGCCACC GATGTCGGCG CCGGCACAAG CTTCTCGATG CTCGAAACAA TGGATGAAGC CTACAAGGTA CTGCACCTGC AGGGGCAGCG GCTCTCCCCT CTCAATTCCT TCTATATGAT GACGCTCGGC AACGCGCGCG CGCTCGATCT CGAGGATCGC ATCGGCTCGC TGCACGCCGG CGCGGATGCG GACATCGTTG TTCTCGACAG CCGAGCCAAG CCGGCCATGG AACTCAGGAT GCAGGTCGCC TCGTCGCTTG CCGAAGAGCT GTTCATCGTG CAAACGATGG GGGACGACCG TTCGGTGGCT GAAGTCTACG TGGCCGGTAA GCCGATGAAA TGCCGCCGCC GGCAGACGAA AGCTCAGGCG GACGTCGAAC TGGCAACGGC ATGA
|
Protein sequence | MRDLLIRGRV LTFLKEPQGI DDTASYRFIE DGAVFVRSGK IVHIGPYVDV AKQAGPDTEV ADHRPHLVLP GLIDTHLHFP QTQAIASYGA QLLEWLNTYI FVEEQKFKAP EHAAFIAGRF MDELLSNGTT TAAAYCSVHP ESVDAFFAAA EERDMLMIGG KVMMDRNAPD ALQDTSQKGY DETKALIARW HGRGRAHYAI TPRFAITSTP EQMEMSRALA AEHPDCYVQT HLSENRDEIT FATSLYPEAK DYTDIYARYE LLGRKTLLGH CIHLSDREIS ALAETEAVGV FCPTSNLFLG SGLFDRDRFD KLGARHAVAT DVGAGTSFSM LETMDEAYKV LHLQGQRLSP LNSFYMMTLG NARALDLEDR IGSLHAGADA DIVVLDSRAK PAMELRMQVA SSLAEELFIV QTMGDDRSVA EVYVAGKPMK CRRRQTKAQA DVELATA
|
| |