Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2833 |
Symbol | |
ID | 5323703 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 2958275 |
End bp | 2959636 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640791778 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_001328498 |
Protein GI | 150398031 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.385298 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGGAGA AGGCGGAAAG GCAGCGGAAG GCGGCCCCGG ATCAGCAGCG CTCCGCCGGC TACATGCCGG GATTTGGCAA CGACTTCGAG ACGGAGAGCC TGCCCGGATC GCTGCCGCAG GGGCAGAACA GCCCGCAGAA GTGCAACTAC GGCCTCTATG CCGAACAGCT TTCCGGATCG CCCTTCACTG CACCGCGCGG CACCAATGAG CGCTCGTGGC TCTACCGCAT CCGTCCAAGC GTCCGGCATA CGGGGCGCTT CACGAAGATC GATTATCCGC ATTGGAAGAC GGCGCCGCAT ACGCCAGAGC ATTCCCTGGC GCTCGGGCAA TTGCGCTGGA GCCCTTTGCC GGCCCCCTCG CAGAGCCTGA CCTTTCTTCA GGGTATACGC ACCATGACGA CCGCGGGCGA CGCGCTGACG CAGGTCGGCA TGGCCGCACA TGCCTATGCC TTCAATGCCG ATATGGTGGA CGACTATTTC TTCAACGCCG ATGGCGAGCT GCTGATCGTT CCGGAAACGG GGGCATTCCA GGTCTTCACC GAACTCGGCA GGATCGACGT GGAGCCGTCG GAAATCTGCC TCGTACCGCG GGGCATGATG TTCAAGGTTA CACGCCTCGG CGATGAGAAG GTCTGGCGCG GCTATATCTG CGAGAATTAC GGAGCGAAGT TCACGCTGCC GGACCGCGGA CCGATCGGCG CCAATTGCCT GGCCAATCCG CGCGACTTCA AGACGCCGGT CGCCGCCTAC GAGGACAAGG AGACGCCCTG CCGCGTACAG GTGAAATGGT GCGGCTCCTT TCATACGGCC GAGATCGCCC ACTCACCGCT CGATGTCGTC GCCTGGCATG GCAATTATGC GCCCTACAAA TACGACCTCA AGACCTTCTC ACCTGTCGGT GCGATCCTGT TCGATCATCC CGACCCGTCG ATCTTCACGG TGCTGACCGC GCCGTCCGGG GAGGAAGGGA CGGCCAATGT CGACTTCGTC ATATTCCCGC CGCGCTGGCT GGTCGCCGAG CATACGTTCC GCCCGCCCTG GTATCACCGC AACATCATGA GCGAGTTCAT GGGCCTTATC CACGGGCGCT ACGATGCGAA GGAGGAGGGT TTCGTGCCGG GTGGCATGAG CCTGCACAAC ATGATGCTGG CGCACGGTCC GGATTTTTCC GGCTTCGAAA AGGCGTCGAA CGGCGAACTG AAGCCGGTAA AGCTCGACAA CACCATGGCC TTCATGTTCG AAACCCGTTT CCCCCAGCAG CTGACGACGT TTGCCGCCGA GCTCGAGACG CTGCAGGACG ACTACATCGA TTGCTGGTCA GGCCTCGAGC GCAAATTCGA CGGCACTCCC GGAATCAAGT GA
|
Protein sequence | MLEKAERQRK AAPDQQRSAG YMPGFGNDFE TESLPGSLPQ GQNSPQKCNY GLYAEQLSGS PFTAPRGTNE RSWLYRIRPS VRHTGRFTKI DYPHWKTAPH TPEHSLALGQ LRWSPLPAPS QSLTFLQGIR TMTTAGDALT QVGMAAHAYA FNADMVDDYF FNADGELLIV PETGAFQVFT ELGRIDVEPS EICLVPRGMM FKVTRLGDEK VWRGYICENY GAKFTLPDRG PIGANCLANP RDFKTPVAAY EDKETPCRVQ VKWCGSFHTA EIAHSPLDVV AWHGNYAPYK YDLKTFSPVG AILFDHPDPS IFTVLTAPSG EEGTANVDFV IFPPRWLVAE HTFRPPWYHR NIMSEFMGLI HGRYDAKEEG FVPGGMSLHN MMLAHGPDFS GFEKASNGEL KPVKLDNTMA FMFETRFPQQ LTTFAAELET LQDDYIDCWS GLERKFDGTP GIK
|
| |