Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5790 |
Symbol | |
ID | 5320092 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | - |
Start bp | 761099 |
End bp | 762592 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640777495 |
Product | N-ethylammeline chlorohydrolase |
Protein accession | YP_001314427 |
Protein GI | 150377832 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.574512 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.000478395 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCGGAG CGTTCGATAC CCTTCCGCTC GGGCGCCGGC CGGAAGGCCG CACACTGATC ACCGCAAGCT GGGTCGTCGG GCATAAGGAC GGCAGACACC GGCTTCTCCG CAATGGCGAG GTGGTTTTCG AGAACGGCGA GATTCTGTTC GTCGGGCACC GTTTCGCCGG TGAAACGGCG CGCCGGATCG ACTTCGGCGA CGCCTTGATC AGCCCCGGCC TGATCGATCT CGACGCGCTG TCCGATCTCG ATACGACCAT CCTCGGCATC GACAATCATC CCGGCTGGGC CAAGGGCCGC GTCTGGCCGC GCTCCTATGT CGAGGCCGGT CCTTATGAAA TGTACACGCA GGAGGAGCTT GCTTTTCAGA AGCGGTTCGC CTTCGCCCAG CTGTTGTTGA ACGGCATTAC CACGGCCGCT CCCATCGCCT CGTTGTTTTA TCGAGAATGG GGTGAAACGG TCGCCGAATT CGACGCCGCT GCGGAGGCCG CGGGGGAACT TGGCTTGCGT GTCTATCTGA GCCCCGCCTA TCGTTCGGGC GGAATGGTGC TGGAAGCGCC TGGCAAGATC GTGCCGGTCT TCGACGAGGA GCGCGGTATT CAGGGTCTGA AAGACGCCAT TGCCTACATT GAACGCCAGA ATGGCCGCCA TGGCGACCTG GTGCGCGGCA TGCTGGCGCC GGACCGAGTG GAGACCTCGA CGATCGGGCT CTTACAGCGC ACCGATGCCG CCGCTCGCGA TCTAGGCTGC AAATTCCGGC TGCATATGGC GCAGGGTGCG ATGGAAGTCG ACACTGTGCG CATGCTGCAC GGCTCGACAG CACCGGTCTG GCTTTCAAAC CACGGCCTGC TGAGCGACCG CCTGATCGCG CCGCATGCCA CCAATGCCAC GGACGAAGAC CTCGGCCATT ACGCGGCGAA CGGCGTTTCC ATCGTCCATT GTCCGCTCGT CTCGGGCCGT GGCGGTTCGA TCCTAAACTC CTTCTCCTCC TGCGTGAAAC GCGGGATCAA TATTGCCATG GGCACCGACA CGACCCCGCC CGACATGCTG ATGAACCTGC TGGCGGGCCT CATCACGGGC CGCATCGCCG ACGGCGCACC GGACCGCCTG CGTTCAGCCG ACCTGTTCGA TGCGGCGACG ATCGGCGGCG CTAAGGCGCT GGGCCGTTCC GATCTCGGCC ATCTGTCACC GGGAGCGCGC GCCGATATTG CGATCTTCCG ACTCGACGAC GTCTTCATGG CTCCGTCAAT CGACCCGATC ACCACGATCG TCACCGGCGG TTCGGGAAAG ATCACCCATG CGGTCTTCGT CGACGGCCGC GTCTCCATGC TTGATCGCCG GTTGGCCGGC TTCGACATGC AGGAGGCGCG CATTCGGGCG CAGATACAAT ATGACGGGCT GGTCGCCCGA TATCCCGAGC GGAGCTGGAA CAATCCTCCG GTCGCCGAAA TCTTCCCGCC CAGCTATCCG ATAGAGGGGA ACGTCAATGG GTGA
|
Protein sequence | MSGAFDTLPL GRRPEGRTLI TASWVVGHKD GRHRLLRNGE VVFENGEILF VGHRFAGETA RRIDFGDALI SPGLIDLDAL SDLDTTILGI DNHPGWAKGR VWPRSYVEAG PYEMYTQEEL AFQKRFAFAQ LLLNGITTAA PIASLFYREW GETVAEFDAA AEAAGELGLR VYLSPAYRSG GMVLEAPGKI VPVFDEERGI QGLKDAIAYI ERQNGRHGDL VRGMLAPDRV ETSTIGLLQR TDAAARDLGC KFRLHMAQGA MEVDTVRMLH GSTAPVWLSN HGLLSDRLIA PHATNATDED LGHYAANGVS IVHCPLVSGR GGSILNSFSS CVKRGINIAM GTDTTPPDML MNLLAGLITG RIADGAPDRL RSADLFDAAT IGGAKALGRS DLGHLSPGAR ADIAIFRLDD VFMAPSIDPI TTIVTGGSGK ITHAVFVDGR VSMLDRRLAG FDMQEARIRA QIQYDGLVAR YPERSWNNPP VAEIFPPSYP IEGNVNG
|
| |