Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5333 |
Symbol | |
ID | 5319635 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | - |
Start bp | 293945 |
End bp | 295243 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640777106 |
Product | cytosine deaminase |
Protein accession | YP_001314038 |
Protein GI | 150377443 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGATC TGATCATCCG CAATGCAAAC CTTCCCGACG GGCGCGAGGG CCTGGACATA GGCGTTTCCA GCGGCAAGAT CGCCGCCATC GGGAAATCCA TCGTCGTGGC CGCCGGCGAG GAGATCGACG CGGCCGGCCG GCTGGTGAGC CCGCCCTTCT GCGATCCGCA TTTCCACATG GATGCGACCC TGTCGCTCGG TTTGCCGCGC ATGAATGTTT CCGGGACGCT GCTCGAGGGC ATCGCGCTCT GGGGCGAGTT GCGCCCGCTC CTCACGAAGG AAGCCCTGGT CGAGCGGGCC CTGCGCTATT GCGACCTCGC CGTCACACAA GGCCTCCTTT ATATTCGCAG CCATGTGGAC ACGTCCGATC CGCGCCTGGT GACGGCCGAG GCGCTGCTCG AGGTGAAGGA AAAGGTCGCC CCCTATATCG AGTTGCAACT CGTCGCCTTC CCCCAGGATG GCTACTACCG TGCGCCGGAC GGCGTCTCCT CGCTCGATCG CGCTCTCGAC ATGGGCATCG GCATCGTCGG GGGCATTCCG CATTTCGAGC GGACGATGGA GGACGGCGCG CGCTCGGTCG AGGCGCTGTG CCGGATCGCC GCCGACCGCG GCCTGCCGGT CGACATGCAT TGTGACGAGA CCGACGATCC CATGTCGCGC CACATCGAGA CGCTGGCGGC GCAGACCGTG CGCTTCGGCC TCCAGGGGCG CGTGGCAGGT TCGCATCTCA CCTCGATGCA TTCGATGGAC AACTATTACG TCTCGAAGCT CATTCCGTTG ATGGCCGAGG CGGAAATCAA CGTGATCCCC AATCCGCTGA TCAACATCAT GCTGCAGGGG CGCCACGACA GCTATCCGAA ACGGCGCGGC ATGACGCGGG TGCGGGAGCT GATGGCGGCG GGCCTCAACG TCTCCTTCGG CCATGACTGC GTCATGGACC CCTGGTACTC GATGGGCTCT GGCGACATGC TGGAGGTCGG TCACATGGCC ATCCATGTCG CGCAGATGGC CGGCATCGAC GACAAGCGCA GGATATTCGA CGCGATCACC GTCAATTCGG CAAAGACCAT GGGGCTTGAA GGCTACGGCC TGGACGTCGG GTGCAAGGCA GATCTGATCG TGCTGCAGGC CGCTGATGTC ACCGAGGCGT TGCGGCTGAA GCCGAACCGG CTGTTCGTCA TCAAAGCGGG CAAGGTCGTC GCCAGGACCG CGCCGCGCGT CGGCGAGCTC TTCCTCGCCG GGCGGCCTGC CTCGATCGAC ACGGGACGCG ACTACGTTCC CCCGGTGCTG CAGCGCTGA
|
Protein sequence | MFDLIIRNAN LPDGREGLDI GVSSGKIAAI GKSIVVAAGE EIDAAGRLVS PPFCDPHFHM DATLSLGLPR MNVSGTLLEG IALWGELRPL LTKEALVERA LRYCDLAVTQ GLLYIRSHVD TSDPRLVTAE ALLEVKEKVA PYIELQLVAF PQDGYYRAPD GVSSLDRALD MGIGIVGGIP HFERTMEDGA RSVEALCRIA ADRGLPVDMH CDETDDPMSR HIETLAAQTV RFGLQGRVAG SHLTSMHSMD NYYVSKLIPL MAEAEINVIP NPLINIMLQG RHDSYPKRRG MTRVRELMAA GLNVSFGHDC VMDPWYSMGS GDMLEVGHMA IHVAQMAGID DKRRIFDAIT VNSAKTMGLE GYGLDVGCKA DLIVLQAADV TEALRLKPNR LFVIKAGKVV ARTAPRVGEL FLAGRPASID TGRDYVPPVL QR
|
| |