Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5797 |
Symbol | |
ID | 5320099 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | + |
Start bp | 770950 |
End bp | 772425 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640777501 |
Product | amidohydrolase |
Protein accession | YP_001314433 |
Protein GI | 150377838 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.00332293 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATGGGC TCCTGCTGCT TCACGGGACG ATCGTGACCG TCGACGACAA ACGGCGGATC ATCGAGGACG GAGCGCTGGC GGTCGAGAAC GACAAAATCG TCGACATCGG CACATCGCAA GCGTTGGCTC CGCGCCATGC CGGAAAGAAG GTGATCGACT GCCGGGGAAA GATTATCATT CCCGGATTGA TTGATGCTCA TGGACATGCC GGTCACGCTC TGATCCGCAG CATTGGTGCC GACACCAATG CGCTCTGGAT GCGTATCGTC ACCCCGACCT ATTATCACTA TGTCACCCGC GATTACTGGT ATGCGGACGG CCTGGTTTCC GGCCTGGAAC GGCTTCGAGC GGGTGTGACC ACCTCGGCCA GCATCGTCAC GTCGATGCCG CGCAGCGACG ACCCGGTTTT TGCGATTAAT CACGCACGCG CCTATTCGGA ACTGGGCCTG CGCGAGATCA TCTGCGTCGG CCCTGCGGGC CTGCCCTGGC CGCAATCGGT CACACGCTGG GAAAGCGGCA AGCCGGAGCG GCGCAGCGTT TCCTTCGAGG CAATGATGGA GGGCGCGGAA GCCGTCATCG AAAGCCTGAA CGGAACGTCC GAAGGGCGCA TCAAGGTCTT TTTGACGCCA TTCACCATCG TGCCTTCCGT GGAACCGTCG AACGCCTCGA CACCGGATTT CGCCGTCAAT CTGACGGAAG ACGACCGGAT GCAGGCGCGC CGCATCCGCG AAACCGCACG CAAGTGGGGC GTGCGCATTC ATTCCGATGC TTTCGCCGGA CAGATACGCA TGGCCTGGCA GGACAGGGAG AATGCGCTGC TCGGTCCGGA TGTGCACCTG CAGCATTGCT GGGGAATTTC CCACGAAGAG ATCGACATCC TGGCCGAAAC CGGCACCCAT GTCACCCATG CGCCGCCGGG ACGCAGCACC CCGGTCATGC AGATGATGGC CCGGGGTGTG TCGGTCGCAA TCACGTCGGA CGGCGCGGCT CCGAGTCGCC ATTTCGATAT GTTCCAGATC GCGCGTACCG CGCAGGCCAC CCAGCACATC CTGCATAATC ATGACCGCTA CATCCTGCCG CCGGGCAAAA TCTTCGAGAT GATCACCATC GACGCGGCCC GCGCCATCGG CATGGGTCAC GAGATCGGTT CGCTCGAGGT GGGCAAGAAG GCGGACATCG CCGTCATAGA CATGCGCAAG CCGCATCTGA CGCCCAACTG GATGCCCGTG CACCGGCTGA TCCATCAGGT GCTCGGAAGC GACGTCGACA CGGTGATCGT CGACGGCAGG ATCATCATGG AGGAAGGCAA GGTCCTGACG GCCGACATGT ACGAGGCGCT TGCATTCGGG GAGGCCGAAG CGAAGGCCCT TGTGGAACGG GCCGGTTTGC AGGCCCACAT GCACGATCCC GGCTGGGGGC AATTACATCG GACCTTTGAA AGACCTGTTC CGCTCCCGAC ACCGCCGGAT TGTTGA
|
Protein sequence | MDGLLLLHGT IVTVDDKRRI IEDGALAVEN DKIVDIGTSQ ALAPRHAGKK VIDCRGKIII PGLIDAHGHA GHALIRSIGA DTNALWMRIV TPTYYHYVTR DYWYADGLVS GLERLRAGVT TSASIVTSMP RSDDPVFAIN HARAYSELGL REIICVGPAG LPWPQSVTRW ESGKPERRSV SFEAMMEGAE AVIESLNGTS EGRIKVFLTP FTIVPSVEPS NASTPDFAVN LTEDDRMQAR RIRETARKWG VRIHSDAFAG QIRMAWQDRE NALLGPDVHL QHCWGISHEE IDILAETGTH VTHAPPGRST PVMQMMARGV SVAITSDGAA PSRHFDMFQI ARTAQATQHI LHNHDRYILP PGKIFEMITI DAARAIGMGH EIGSLEVGKK ADIAVIDMRK PHLTPNWMPV HRLIHQVLGS DVDTVIVDGR IIMEEGKVLT ADMYEALAFG EAEAKALVER AGLQAHMHDP GWGQLHRTFE RPVPLPTPPD C
|
| |