Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5791 |
Symbol | |
ID | 5320093 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | - |
Start bp | 762592 |
End bp | 763785 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640777496 |
Product | cytosine deaminase |
Protein accession | YP_001314428 |
Protein GI | 150377833 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.159101 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.000445304 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACGACC TTTTGCTCCG CAACGTCAGG CCCATGGCAG GCGAAAGCTG CGATATCCTG ATTAGGGACG GAAAGATCGC CGGTTTCGGG CGTTTTGAAG CGGAACCAGG CATGGCCGTG GAAGATGGCG GCAACGCCAT CGCCGCCCCC GGGCTGATCG ATGCGCATAC CCATCTCGAC AAGACGACCT GGGGCATGCC GTGGCATGTC AACAACCGCG CCGCAGTCCT GCGTGAGCGT ATCGATTTCG AACGCGAGCA TCGTCTGGAG ATCGGCATCG ATCCGCACCG CCAGTCGATG CGTCATGCGA TCGGTCTGGC CGCGCATGGC GCAACGCATA TCCGAAGCCA TGTCGATATC GATCCGGTTC ATCGCCTGTC GCTGGTCGAG GGCGTCTGGG AAACGCGCGA GAAGCTCAGG GGCATCATCG ACATCGAAAT CGTCGCGTTT CCCCAATCAG GCCTGATGGT CATGCCCGGC ACGAAGGAGT TGCTCGACGA GGCGCTGCGT CAGGGCTGCG AAGTGCTGGG CGGCATCGAT CCGTGCGGGA TAGACCGCGA TCCGAAGGGC CAGCTCGACA TTCTGTTTGC ACTCGCCACC AAGCATGGCG TTCCGATCGA CATTCACCTG CATGAGACGG GCGATCTCGG CGCCTTCACC ATGGAACTCA TCTTCGAGCG GATCCGCGCC AACGGCATGG AAGGCAAGGT GGCAATCAGC CACGCCTTTG CGCTCGGCAT GAACGACTAT CTGCGCGTCG GCCAGCTGAT CGAGCAGCTC GCTATTCTCG ACGTCGCGAT CCTCACCACC GGCGCGCCTT CGGCCACGGT GCCCTCGATC AAGCGCCTGA AGGAAGCGGC CGTGCGCGTC GGCGGCGGCT GTGACGGTAT CCGTGACACC TGGGGACCAT GGGGCCAGCC GGACATGCTG GACCGCGCCA AGGTTATCGG CATGAAGAAC GGCGTGCGCT CGGATCACGA TCTGGAGCAT TTGCTGCACA TCGTCTCGCA AGGCGGTGCG GATATCATGC GGCTTGAAAA TTACGGCCTT GAAGTCGGCC GCGATGCGGA CTTCACCCTG TTGACCGGCG AGACGCTGGC GCATGCCGTG GTCGATGTCG CCCCGCGTCC GCTGGTCGTC AAAGGGGGTC GCGTCACGGC CCGTCAGGGT GTCGCCGTCG TGGAGATGCC GTAA
|
Protein sequence | MNDLLLRNVR PMAGESCDIL IRDGKIAGFG RFEAEPGMAV EDGGNAIAAP GLIDAHTHLD KTTWGMPWHV NNRAAVLRER IDFEREHRLE IGIDPHRQSM RHAIGLAAHG ATHIRSHVDI DPVHRLSLVE GVWETREKLR GIIDIEIVAF PQSGLMVMPG TKELLDEALR QGCEVLGGID PCGIDRDPKG QLDILFALAT KHGVPIDIHL HETGDLGAFT MELIFERIRA NGMEGKVAIS HAFALGMNDY LRVGQLIEQL AILDVAILTT GAPSATVPSI KRLKEAAVRV GGGCDGIRDT WGPWGQPDML DRAKVIGMKN GVRSDHDLEH LLHIVSQGGA DIMRLENYGL EVGRDADFTL LTGETLAHAV VDVAPRPLVV KGGRVTARQG VAVVEMP
|
| |