Gene Smed_5797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5797 
Symbol 
ID5320099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp770950 
End bp772425 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content62% 
IMG OID640777501 
Productamidohydrolase 
Protein accessionYP_001314433 
Protein GI150377838 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00332293 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATGGGC TCCTGCTGCT TCACGGGACG ATCGTGACCG TCGACGACAA ACGGCGGATC 
ATCGAGGACG GAGCGCTGGC GGTCGAGAAC GACAAAATCG TCGACATCGG CACATCGCAA
GCGTTGGCTC CGCGCCATGC CGGAAAGAAG GTGATCGACT GCCGGGGAAA GATTATCATT
CCCGGATTGA TTGATGCTCA TGGACATGCC GGTCACGCTC TGATCCGCAG CATTGGTGCC
GACACCAATG CGCTCTGGAT GCGTATCGTC ACCCCGACCT ATTATCACTA TGTCACCCGC
GATTACTGGT ATGCGGACGG CCTGGTTTCC GGCCTGGAAC GGCTTCGAGC GGGTGTGACC
ACCTCGGCCA GCATCGTCAC GTCGATGCCG CGCAGCGACG ACCCGGTTTT TGCGATTAAT
CACGCACGCG CCTATTCGGA ACTGGGCCTG CGCGAGATCA TCTGCGTCGG CCCTGCGGGC
CTGCCCTGGC CGCAATCGGT CACACGCTGG GAAAGCGGCA AGCCGGAGCG GCGCAGCGTT
TCCTTCGAGG CAATGATGGA GGGCGCGGAA GCCGTCATCG AAAGCCTGAA CGGAACGTCC
GAAGGGCGCA TCAAGGTCTT TTTGACGCCA TTCACCATCG TGCCTTCCGT GGAACCGTCG
AACGCCTCGA CACCGGATTT CGCCGTCAAT CTGACGGAAG ACGACCGGAT GCAGGCGCGC
CGCATCCGCG AAACCGCACG CAAGTGGGGC GTGCGCATTC ATTCCGATGC TTTCGCCGGA
CAGATACGCA TGGCCTGGCA GGACAGGGAG AATGCGCTGC TCGGTCCGGA TGTGCACCTG
CAGCATTGCT GGGGAATTTC CCACGAAGAG ATCGACATCC TGGCCGAAAC CGGCACCCAT
GTCACCCATG CGCCGCCGGG ACGCAGCACC CCGGTCATGC AGATGATGGC CCGGGGTGTG
TCGGTCGCAA TCACGTCGGA CGGCGCGGCT CCGAGTCGCC ATTTCGATAT GTTCCAGATC
GCGCGTACCG CGCAGGCCAC CCAGCACATC CTGCATAATC ATGACCGCTA CATCCTGCCG
CCGGGCAAAA TCTTCGAGAT GATCACCATC GACGCGGCCC GCGCCATCGG CATGGGTCAC
GAGATCGGTT CGCTCGAGGT GGGCAAGAAG GCGGACATCG CCGTCATAGA CATGCGCAAG
CCGCATCTGA CGCCCAACTG GATGCCCGTG CACCGGCTGA TCCATCAGGT GCTCGGAAGC
GACGTCGACA CGGTGATCGT CGACGGCAGG ATCATCATGG AGGAAGGCAA GGTCCTGACG
GCCGACATGT ACGAGGCGCT TGCATTCGGG GAGGCCGAAG CGAAGGCCCT TGTGGAACGG
GCCGGTTTGC AGGCCCACAT GCACGATCCC GGCTGGGGGC AATTACATCG GACCTTTGAA
AGACCTGTTC CGCTCCCGAC ACCGCCGGAT TGTTGA
 
Protein sequence
MDGLLLLHGT IVTVDDKRRI IEDGALAVEN DKIVDIGTSQ ALAPRHAGKK VIDCRGKIII 
PGLIDAHGHA GHALIRSIGA DTNALWMRIV TPTYYHYVTR DYWYADGLVS GLERLRAGVT
TSASIVTSMP RSDDPVFAIN HARAYSELGL REIICVGPAG LPWPQSVTRW ESGKPERRSV
SFEAMMEGAE AVIESLNGTS EGRIKVFLTP FTIVPSVEPS NASTPDFAVN LTEDDRMQAR
RIRETARKWG VRIHSDAFAG QIRMAWQDRE NALLGPDVHL QHCWGISHEE IDILAETGTH
VTHAPPGRST PVMQMMARGV SVAITSDGAA PSRHFDMFQI ARTAQATQHI LHNHDRYILP
PGKIFEMITI DAARAIGMGH EIGSLEVGKK ADIAVIDMRK PHLTPNWMPV HRLIHQVLGS
DVDTVIVDGR IIMEEGKVLT ADMYEALAFG EAEAKALVER AGLQAHMHDP GWGQLHRTFE
RPVPLPTPPD C