Gene Smed_5791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5791 
Symbol 
ID5320093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp762592 
End bp763785 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content63% 
IMG OID640777496 
Productcytosine deaminase 
Protein accessionYP_001314428 
Protein GI150377833 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.159101 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000445304 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACGACC TTTTGCTCCG CAACGTCAGG CCCATGGCAG GCGAAAGCTG CGATATCCTG 
ATTAGGGACG GAAAGATCGC CGGTTTCGGG CGTTTTGAAG CGGAACCAGG CATGGCCGTG
GAAGATGGCG GCAACGCCAT CGCCGCCCCC GGGCTGATCG ATGCGCATAC CCATCTCGAC
AAGACGACCT GGGGCATGCC GTGGCATGTC AACAACCGCG CCGCAGTCCT GCGTGAGCGT
ATCGATTTCG AACGCGAGCA TCGTCTGGAG ATCGGCATCG ATCCGCACCG CCAGTCGATG
CGTCATGCGA TCGGTCTGGC CGCGCATGGC GCAACGCATA TCCGAAGCCA TGTCGATATC
GATCCGGTTC ATCGCCTGTC GCTGGTCGAG GGCGTCTGGG AAACGCGCGA GAAGCTCAGG
GGCATCATCG ACATCGAAAT CGTCGCGTTT CCCCAATCAG GCCTGATGGT CATGCCCGGC
ACGAAGGAGT TGCTCGACGA GGCGCTGCGT CAGGGCTGCG AAGTGCTGGG CGGCATCGAT
CCGTGCGGGA TAGACCGCGA TCCGAAGGGC CAGCTCGACA TTCTGTTTGC ACTCGCCACC
AAGCATGGCG TTCCGATCGA CATTCACCTG CATGAGACGG GCGATCTCGG CGCCTTCACC
ATGGAACTCA TCTTCGAGCG GATCCGCGCC AACGGCATGG AAGGCAAGGT GGCAATCAGC
CACGCCTTTG CGCTCGGCAT GAACGACTAT CTGCGCGTCG GCCAGCTGAT CGAGCAGCTC
GCTATTCTCG ACGTCGCGAT CCTCACCACC GGCGCGCCTT CGGCCACGGT GCCCTCGATC
AAGCGCCTGA AGGAAGCGGC CGTGCGCGTC GGCGGCGGCT GTGACGGTAT CCGTGACACC
TGGGGACCAT GGGGCCAGCC GGACATGCTG GACCGCGCCA AGGTTATCGG CATGAAGAAC
GGCGTGCGCT CGGATCACGA TCTGGAGCAT TTGCTGCACA TCGTCTCGCA AGGCGGTGCG
GATATCATGC GGCTTGAAAA TTACGGCCTT GAAGTCGGCC GCGATGCGGA CTTCACCCTG
TTGACCGGCG AGACGCTGGC GCATGCCGTG GTCGATGTCG CCCCGCGTCC GCTGGTCGTC
AAAGGGGGTC GCGTCACGGC CCGTCAGGGT GTCGCCGTCG TGGAGATGCC GTAA
 
Protein sequence
MNDLLLRNVR PMAGESCDIL IRDGKIAGFG RFEAEPGMAV EDGGNAIAAP GLIDAHTHLD 
KTTWGMPWHV NNRAAVLRER IDFEREHRLE IGIDPHRQSM RHAIGLAAHG ATHIRSHVDI
DPVHRLSLVE GVWETREKLR GIIDIEIVAF PQSGLMVMPG TKELLDEALR QGCEVLGGID
PCGIDRDPKG QLDILFALAT KHGVPIDIHL HETGDLGAFT MELIFERIRA NGMEGKVAIS
HAFALGMNDY LRVGQLIEQL AILDVAILTT GAPSATVPSI KRLKEAAVRV GGGCDGIRDT
WGPWGQPDML DRAKVIGMKN GVRSDHDLEH LLHIVSQGGA DIMRLENYGL EVGRDADFTL
LTGETLAHAV VDVAPRPLVV KGGRVTARQG VAVVEMP