Gene Smed_4625 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4625 
Symbol 
ID5318936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1128002 
End bp1129345 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content62% 
IMG OID640776424 
Productguanine deaminase 
Protein accessionYP_001313356 
Protein GI150376760 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02967] guanine deaminase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.430513 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGATC TCTTGATCCG CGGCCGGGTG CTGACCTTTC TCAAAGAGCC CCAGGGCATC 
GACGATACGG CTTCTTACCG CTTCATCGAG GACGGCGCCG TTTTCGTAAG GAGCGGCAAG
ATTGTACACA TCGGCCCTTA CGTTGACGTG GCGAAGCAGG CAGGACCTGA CACAGAGGTC
GCCGACCACC GGCCGCATCT GGTCCTGCCC GGCTTAATCG ACACGCACCT GCATTTTCCG
CAGACCCAGG CGATCGCCTC CTACGGCGCG CAGCTTCTGG AATGGCTGAA CACCTATATC
TTCGTGGAAG AGCAGAAGTT CAAAGCTCCG GAGCATGCCG CCTTCATCGC GGGCCGCTTC
ATGGACGAGC TCCTGTCCAA CGGCACGACC ACGGCCGCCG CCTATTGCTC GGTGCATCCG
GAAAGCGTCG ACGCCTTCTT CGCGGCGGCC GAAGAGCGCG ACATGCTCAT GATCGGCGGC
AAGGTGATGA TGGACCGCAA CGCCCCGGAC GCGCTGCAAG ACACGTCTCA AAAGGGCTAT
GACGAGACCA AGGCGCTCAT CGCGAGATGG CATGGCCGCG GCCGCGCGCA TTATGCGATC
ACCCCGCGCT TCGCCATCAC CTCGACGCCG GAGCAGATGG AGATGAGCCG CGCCCTCGCC
GCAGAGCATC CGGACTGCTA CGTTCAGACG CATCTGTCCG AAAACCGGGA CGAAATCACC
TTCGCCACTT CGCTCTATCC GGAAGCCAAG GACTATACCG ATATTTATGC TCGCTATGAG
CTTCTCGGCC GCAAGACCCT GCTCGGCCAT TGCATCCACC TGAGCGACCG CGAAATATCG
GCGCTCGCCG AGACGGAGGC GGTGGGCGTC TTCTGCCCAA CGTCCAACCT CTTCCTCGGC
AGCGGCCTCT TCGATCGCGA CCGCTTCGAC AAGCTTGGAG CACGCCATGC GGTCGCCACC
GATGTCGGCG CCGGCACAAG CTTCTCGATG CTCGAAACAA TGGATGAAGC CTACAAGGTA
CTGCACCTGC AGGGGCAGCG GCTCTCCCCT CTCAATTCCT TCTATATGAT GACGCTCGGC
AACGCGCGCG CGCTCGATCT CGAGGATCGC ATCGGCTCGC TGCACGCCGG CGCGGATGCG
GACATCGTTG TTCTCGACAG CCGAGCCAAG CCGGCCATGG AACTCAGGAT GCAGGTCGCC
TCGTCGCTTG CCGAAGAGCT GTTCATCGTG CAAACGATGG GGGACGACCG TTCGGTGGCT
GAAGTCTACG TGGCCGGTAA GCCGATGAAA TGCCGCCGCC GGCAGACGAA AGCTCAGGCG
GACGTCGAAC TGGCAACGGC ATGA
 
Protein sequence
MRDLLIRGRV LTFLKEPQGI DDTASYRFIE DGAVFVRSGK IVHIGPYVDV AKQAGPDTEV 
ADHRPHLVLP GLIDTHLHFP QTQAIASYGA QLLEWLNTYI FVEEQKFKAP EHAAFIAGRF
MDELLSNGTT TAAAYCSVHP ESVDAFFAAA EERDMLMIGG KVMMDRNAPD ALQDTSQKGY
DETKALIARW HGRGRAHYAI TPRFAITSTP EQMEMSRALA AEHPDCYVQT HLSENRDEIT
FATSLYPEAK DYTDIYARYE LLGRKTLLGH CIHLSDREIS ALAETEAVGV FCPTSNLFLG
SGLFDRDRFD KLGARHAVAT DVGAGTSFSM LETMDEAYKV LHLQGQRLSP LNSFYMMTLG
NARALDLEDR IGSLHAGADA DIVVLDSRAK PAMELRMQVA SSLAEELFIV QTMGDDRSVA
EVYVAGKPMK CRRRQTKAQA DVELATA