Gene Smed_5333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5333 
Symbol 
ID5319635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp293945 
End bp295243 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content65% 
IMG OID640777106 
Productcytosine deaminase 
Protein accessionYP_001314038 
Protein GI150377443 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGATC TGATCATCCG CAATGCAAAC CTTCCCGACG GGCGCGAGGG CCTGGACATA 
GGCGTTTCCA GCGGCAAGAT CGCCGCCATC GGGAAATCCA TCGTCGTGGC CGCCGGCGAG
GAGATCGACG CGGCCGGCCG GCTGGTGAGC CCGCCCTTCT GCGATCCGCA TTTCCACATG
GATGCGACCC TGTCGCTCGG TTTGCCGCGC ATGAATGTTT CCGGGACGCT GCTCGAGGGC
ATCGCGCTCT GGGGCGAGTT GCGCCCGCTC CTCACGAAGG AAGCCCTGGT CGAGCGGGCC
CTGCGCTATT GCGACCTCGC CGTCACACAA GGCCTCCTTT ATATTCGCAG CCATGTGGAC
ACGTCCGATC CGCGCCTGGT GACGGCCGAG GCGCTGCTCG AGGTGAAGGA AAAGGTCGCC
CCCTATATCG AGTTGCAACT CGTCGCCTTC CCCCAGGATG GCTACTACCG TGCGCCGGAC
GGCGTCTCCT CGCTCGATCG CGCTCTCGAC ATGGGCATCG GCATCGTCGG GGGCATTCCG
CATTTCGAGC GGACGATGGA GGACGGCGCG CGCTCGGTCG AGGCGCTGTG CCGGATCGCC
GCCGACCGCG GCCTGCCGGT CGACATGCAT TGTGACGAGA CCGACGATCC CATGTCGCGC
CACATCGAGA CGCTGGCGGC GCAGACCGTG CGCTTCGGCC TCCAGGGGCG CGTGGCAGGT
TCGCATCTCA CCTCGATGCA TTCGATGGAC AACTATTACG TCTCGAAGCT CATTCCGTTG
ATGGCCGAGG CGGAAATCAA CGTGATCCCC AATCCGCTGA TCAACATCAT GCTGCAGGGG
CGCCACGACA GCTATCCGAA ACGGCGCGGC ATGACGCGGG TGCGGGAGCT GATGGCGGCG
GGCCTCAACG TCTCCTTCGG CCATGACTGC GTCATGGACC CCTGGTACTC GATGGGCTCT
GGCGACATGC TGGAGGTCGG TCACATGGCC ATCCATGTCG CGCAGATGGC CGGCATCGAC
GACAAGCGCA GGATATTCGA CGCGATCACC GTCAATTCGG CAAAGACCAT GGGGCTTGAA
GGCTACGGCC TGGACGTCGG GTGCAAGGCA GATCTGATCG TGCTGCAGGC CGCTGATGTC
ACCGAGGCGT TGCGGCTGAA GCCGAACCGG CTGTTCGTCA TCAAAGCGGG CAAGGTCGTC
GCCAGGACCG CGCCGCGCGT CGGCGAGCTC TTCCTCGCCG GGCGGCCTGC CTCGATCGAC
ACGGGACGCG ACTACGTTCC CCCGGTGCTG CAGCGCTGA
 
Protein sequence
MFDLIIRNAN LPDGREGLDI GVSSGKIAAI GKSIVVAAGE EIDAAGRLVS PPFCDPHFHM 
DATLSLGLPR MNVSGTLLEG IALWGELRPL LTKEALVERA LRYCDLAVTQ GLLYIRSHVD
TSDPRLVTAE ALLEVKEKVA PYIELQLVAF PQDGYYRAPD GVSSLDRALD MGIGIVGGIP
HFERTMEDGA RSVEALCRIA ADRGLPVDMH CDETDDPMSR HIETLAAQTV RFGLQGRVAG
SHLTSMHSMD NYYVSKLIPL MAEAEINVIP NPLINIMLQG RHDSYPKRRG MTRVRELMAA
GLNVSFGHDC VMDPWYSMGS GDMLEVGHMA IHVAQMAGID DKRRIFDAIT VNSAKTMGLE
GYGLDVGCKA DLIVLQAADV TEALRLKPNR LFVIKAGKVV ARTAPRVGEL FLAGRPASID
TGRDYVPPVL QR