Gene Smed_2833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2833 
Symbol 
ID5323703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2958275 
End bp2959636 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content62% 
IMG OID640791778 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_001328498 
Protein GI150398031 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.385298 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGGAGA AGGCGGAAAG GCAGCGGAAG GCGGCCCCGG ATCAGCAGCG CTCCGCCGGC 
TACATGCCGG GATTTGGCAA CGACTTCGAG ACGGAGAGCC TGCCCGGATC GCTGCCGCAG
GGGCAGAACA GCCCGCAGAA GTGCAACTAC GGCCTCTATG CCGAACAGCT TTCCGGATCG
CCCTTCACTG CACCGCGCGG CACCAATGAG CGCTCGTGGC TCTACCGCAT CCGTCCAAGC
GTCCGGCATA CGGGGCGCTT CACGAAGATC GATTATCCGC ATTGGAAGAC GGCGCCGCAT
ACGCCAGAGC ATTCCCTGGC GCTCGGGCAA TTGCGCTGGA GCCCTTTGCC GGCCCCCTCG
CAGAGCCTGA CCTTTCTTCA GGGTATACGC ACCATGACGA CCGCGGGCGA CGCGCTGACG
CAGGTCGGCA TGGCCGCACA TGCCTATGCC TTCAATGCCG ATATGGTGGA CGACTATTTC
TTCAACGCCG ATGGCGAGCT GCTGATCGTT CCGGAAACGG GGGCATTCCA GGTCTTCACC
GAACTCGGCA GGATCGACGT GGAGCCGTCG GAAATCTGCC TCGTACCGCG GGGCATGATG
TTCAAGGTTA CACGCCTCGG CGATGAGAAG GTCTGGCGCG GCTATATCTG CGAGAATTAC
GGAGCGAAGT TCACGCTGCC GGACCGCGGA CCGATCGGCG CCAATTGCCT GGCCAATCCG
CGCGACTTCA AGACGCCGGT CGCCGCCTAC GAGGACAAGG AGACGCCCTG CCGCGTACAG
GTGAAATGGT GCGGCTCCTT TCATACGGCC GAGATCGCCC ACTCACCGCT CGATGTCGTC
GCCTGGCATG GCAATTATGC GCCCTACAAA TACGACCTCA AGACCTTCTC ACCTGTCGGT
GCGATCCTGT TCGATCATCC CGACCCGTCG ATCTTCACGG TGCTGACCGC GCCGTCCGGG
GAGGAAGGGA CGGCCAATGT CGACTTCGTC ATATTCCCGC CGCGCTGGCT GGTCGCCGAG
CATACGTTCC GCCCGCCCTG GTATCACCGC AACATCATGA GCGAGTTCAT GGGCCTTATC
CACGGGCGCT ACGATGCGAA GGAGGAGGGT TTCGTGCCGG GTGGCATGAG CCTGCACAAC
ATGATGCTGG CGCACGGTCC GGATTTTTCC GGCTTCGAAA AGGCGTCGAA CGGCGAACTG
AAGCCGGTAA AGCTCGACAA CACCATGGCC TTCATGTTCG AAACCCGTTT CCCCCAGCAG
CTGACGACGT TTGCCGCCGA GCTCGAGACG CTGCAGGACG ACTACATCGA TTGCTGGTCA
GGCCTCGAGC GCAAATTCGA CGGCACTCCC GGAATCAAGT GA
 
Protein sequence
MLEKAERQRK AAPDQQRSAG YMPGFGNDFE TESLPGSLPQ GQNSPQKCNY GLYAEQLSGS 
PFTAPRGTNE RSWLYRIRPS VRHTGRFTKI DYPHWKTAPH TPEHSLALGQ LRWSPLPAPS
QSLTFLQGIR TMTTAGDALT QVGMAAHAYA FNADMVDDYF FNADGELLIV PETGAFQVFT
ELGRIDVEPS EICLVPRGMM FKVTRLGDEK VWRGYICENY GAKFTLPDRG PIGANCLANP
RDFKTPVAAY EDKETPCRVQ VKWCGSFHTA EIAHSPLDVV AWHGNYAPYK YDLKTFSPVG
AILFDHPDPS IFTVLTAPSG EEGTANVDFV IFPPRWLVAE HTFRPPWYHR NIMSEFMGLI
HGRYDAKEEG FVPGGMSLHN MMLAHGPDFS GFEKASNGEL KPVKLDNTMA FMFETRFPQQ
LTTFAAELET LQDDYIDCWS GLERKFDGTP GIK