Gene Smed_5790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5790 
Symbol 
ID5320092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp761099 
End bp762592 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content63% 
IMG OID640777495 
ProductN-ethylammeline chlorohydrolase 
Protein accessionYP_001314427 
Protein GI150377832 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.574512 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000478395 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGGAG CGTTCGATAC CCTTCCGCTC GGGCGCCGGC CGGAAGGCCG CACACTGATC 
ACCGCAAGCT GGGTCGTCGG GCATAAGGAC GGCAGACACC GGCTTCTCCG CAATGGCGAG
GTGGTTTTCG AGAACGGCGA GATTCTGTTC GTCGGGCACC GTTTCGCCGG TGAAACGGCG
CGCCGGATCG ACTTCGGCGA CGCCTTGATC AGCCCCGGCC TGATCGATCT CGACGCGCTG
TCCGATCTCG ATACGACCAT CCTCGGCATC GACAATCATC CCGGCTGGGC CAAGGGCCGC
GTCTGGCCGC GCTCCTATGT CGAGGCCGGT CCTTATGAAA TGTACACGCA GGAGGAGCTT
GCTTTTCAGA AGCGGTTCGC CTTCGCCCAG CTGTTGTTGA ACGGCATTAC CACGGCCGCT
CCCATCGCCT CGTTGTTTTA TCGAGAATGG GGTGAAACGG TCGCCGAATT CGACGCCGCT
GCGGAGGCCG CGGGGGAACT TGGCTTGCGT GTCTATCTGA GCCCCGCCTA TCGTTCGGGC
GGAATGGTGC TGGAAGCGCC TGGCAAGATC GTGCCGGTCT TCGACGAGGA GCGCGGTATT
CAGGGTCTGA AAGACGCCAT TGCCTACATT GAACGCCAGA ATGGCCGCCA TGGCGACCTG
GTGCGCGGCA TGCTGGCGCC GGACCGAGTG GAGACCTCGA CGATCGGGCT CTTACAGCGC
ACCGATGCCG CCGCTCGCGA TCTAGGCTGC AAATTCCGGC TGCATATGGC GCAGGGTGCG
ATGGAAGTCG ACACTGTGCG CATGCTGCAC GGCTCGACAG CACCGGTCTG GCTTTCAAAC
CACGGCCTGC TGAGCGACCG CCTGATCGCG CCGCATGCCA CCAATGCCAC GGACGAAGAC
CTCGGCCATT ACGCGGCGAA CGGCGTTTCC ATCGTCCATT GTCCGCTCGT CTCGGGCCGT
GGCGGTTCGA TCCTAAACTC CTTCTCCTCC TGCGTGAAAC GCGGGATCAA TATTGCCATG
GGCACCGACA CGACCCCGCC CGACATGCTG ATGAACCTGC TGGCGGGCCT CATCACGGGC
CGCATCGCCG ACGGCGCACC GGACCGCCTG CGTTCAGCCG ACCTGTTCGA TGCGGCGACG
ATCGGCGGCG CTAAGGCGCT GGGCCGTTCC GATCTCGGCC ATCTGTCACC GGGAGCGCGC
GCCGATATTG CGATCTTCCG ACTCGACGAC GTCTTCATGG CTCCGTCAAT CGACCCGATC
ACCACGATCG TCACCGGCGG TTCGGGAAAG ATCACCCATG CGGTCTTCGT CGACGGCCGC
GTCTCCATGC TTGATCGCCG GTTGGCCGGC TTCGACATGC AGGAGGCGCG CATTCGGGCG
CAGATACAAT ATGACGGGCT GGTCGCCCGA TATCCCGAGC GGAGCTGGAA CAATCCTCCG
GTCGCCGAAA TCTTCCCGCC CAGCTATCCG ATAGAGGGGA ACGTCAATGG GTGA
 
Protein sequence
MSGAFDTLPL GRRPEGRTLI TASWVVGHKD GRHRLLRNGE VVFENGEILF VGHRFAGETA 
RRIDFGDALI SPGLIDLDAL SDLDTTILGI DNHPGWAKGR VWPRSYVEAG PYEMYTQEEL
AFQKRFAFAQ LLLNGITTAA PIASLFYREW GETVAEFDAA AEAAGELGLR VYLSPAYRSG
GMVLEAPGKI VPVFDEERGI QGLKDAIAYI ERQNGRHGDL VRGMLAPDRV ETSTIGLLQR
TDAAARDLGC KFRLHMAQGA MEVDTVRMLH GSTAPVWLSN HGLLSDRLIA PHATNATDED
LGHYAANGVS IVHCPLVSGR GGSILNSFSS CVKRGINIAM GTDTTPPDML MNLLAGLITG
RIADGAPDRL RSADLFDAAT IGGAKALGRS DLGHLSPGAR ADIAIFRLDD VFMAPSIDPI
TTIVTGGSGK ITHAVFVDGR VSMLDRRLAG FDMQEARIRA QIQYDGLVAR YPERSWNNPP
VAEIFPPSYP IEGNVNG