Gene Smed_4123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4123 
Symbol 
ID5319316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp590487 
End bp592604 
Gene Length2118 bp 
Protein Length705 aa 
Translation table11 
GC content62% 
IMG OID640775929 
Productcatalase 
Protein accessionYP_001312862 
Protein GI150376266 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0753] Catalase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.283917 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGA AACCGTCTGC GCCGAACAAT ACGAAACCGG CCACCATTCA TGACCAGAAA 
GCGACACGCG GCAATGGTGG AGAGCTTCAC CAGATCGCCG AAGGTGACAC GCCCGTTCTG
ACGACGGCGC AGGGCGGCCC TGTCGCCGAC GATCAGAACA GCCTGCGAGC CGGCGAGCGT
GGCCCCACGC TCATCGAGGA TTTTCATTTT CGCGAGAAGA TCTTCCACTT CGACCATGAA
CGAATTCCCG AGCGCGTCGT GCATGCTCGC GGTTATGGCG TTCACGGCTT TTTCGAGACC
TACGAGTCGC TTGCCGCCTA CACCCGGGCG GACCTGTTCC AGCGCCCGGG CGAGCGAACC
CCCGCCTTCG TGCGGTTCTC GACGGTCGCC GGAAGCAAGG GCTCCTTCGA TCTCGCCCGC
GACGTGCGTG GCTTCGCGGT CAAGATCTAC ACCAAGGAGG GCAATTGGGA CCTGGTCGGC
AACAATATTC CGGTCTTCTT CATCCAGGAT GCGATCAAGT TTCCCGACGT GATACATTCG
GTAAAACCCG AGCCGGACCG GGAGTTTCCG CAGGCGCAGT CCGCCCATGA CAATTTCTGG
GACTTCATCA GCCTGACACC GGAAAGCATG CACATGATCA TGTGGGTCAT GTCCGACCGG
GCGATTCCGC GATCGTTCCG GTTCATGGAA GGGTTCGGCG TGCACACCTT CCGCTTCGTC
AACGCCAAGG ACGAGTCCAC CTTCGTCAAG TTCCACTGGA AGCCGAAGCT CGGGCTGCAG
TCCGTGGTCT GGAACGAGGC CGTGAAGATC AACGGCGCCG ATCCGGACTT CCACCGGCGC
GATATGTGGC AAGCCATCCA GTCCGGCAAC TTTCCGGAAT GGGAACTGCA TGTGCAGCTC
TTCGATCAGG ACTTCGCCGA CAAGTTCGAT TTCGACATCC TCGATCCAAC CAAGATCATC
CCCGAGGAGG TGCTGCCAAC GAAGCCTGTC GGCCGGCTGG TGCTCGATCG CATGCCGGAG
AATTTCTTCG CCGAAACCGA GCAGGTCGCC TTCATGACGC AGAACGTCCC GCCCGGCATC
GACTTCAGCG ACGATCCATT GCTGCAGGGA CGCAACTTCT CCTATCTGGA CACCCAGCTG
AAGCGGCTCG GCAGCCCGAA TTTCACCCAC CTTCCGATCA ACGCGCCGAA ATGTCCCTTC
CATAACTTTC AGCAGGACGG CCACATGGCC ATGCGCAACC CTGTCGGGCG CGCGAACTAC
CAGCCCAATT CCTGGGGCGA GGGACCGCGC GAGTCGCCCG TCAAAGGCTT CCGACACTTT
GCTTCGGAGG AGCAGGGACC GAAGCTCCGC ATCCGCGCTG AAAGCTTCGC CGACCATTAC
AGTCAGGCAC GGCAGTTCTT CATCAGCCAG ACGCCACCCG AGCAGCGGCA CATCGCCGAC
GCCCTGACCT TCGAACTGAG CAAGGTCGAG ACGCCGGTGA TCCGTGAGCG GATGGTCGCG
CATCTCCTCA ACATCGACGA GACGCTGGGA AAAAAGGTCG GCCACGCGCT CGGCATGGAG
ACGATGCCGA AACCCGCCGA CGCGGCCGTT GCCACACGCC AGGACCTCGA TCCGTCGCCG
GCGCTCAGCA TCATCCAGCG CGGGCCCAAG CGTTTCGAAG GACGCAAGCT CGGAATATTG
GCGACCGACG GGACGGATGC CGCCCTTCTT AACGCCTTGC TGCAGGCGGT CGATACGGAG
AAGGCGGCTT TCGAACTGAT CGCACCAAAA GTCGGCGGCT TCACCGCCTC AGACGGCAAA
CGGATAGCGG CCCACCAGAT GCTCGACGGC GGCCCGTCGG TGCTCTACGA CGCCGTGGTC
CTGCTTGCCT CCGCAGAGGC CGTCGCGGAG CTGATCGACG TCGCCACCGC GCGCGATTTC
GTAGCCGACG CCTTCGCCCA TTGCAAATAT ATCGGCTATG TCAGCGCCGC GGTTCCCCTT
CTCGAGAGGG CCGGCATAGC GGGATTGCTC GATGAGGGAA CGATCGAACT CACCGACGCC
GGGAGTGCAG CCGCTTTCCT GAAGGAACTT GGCAAGCTGC GCGTCTGGGC ACGAGAGCCC
TCGGTCAAGC TGAAATAG
 
Protein sequence
MAKKPSAPNN TKPATIHDQK ATRGNGGELH QIAEGDTPVL TTAQGGPVAD DQNSLRAGER 
GPTLIEDFHF REKIFHFDHE RIPERVVHAR GYGVHGFFET YESLAAYTRA DLFQRPGERT
PAFVRFSTVA GSKGSFDLAR DVRGFAVKIY TKEGNWDLVG NNIPVFFIQD AIKFPDVIHS
VKPEPDREFP QAQSAHDNFW DFISLTPESM HMIMWVMSDR AIPRSFRFME GFGVHTFRFV
NAKDESTFVK FHWKPKLGLQ SVVWNEAVKI NGADPDFHRR DMWQAIQSGN FPEWELHVQL
FDQDFADKFD FDILDPTKII PEEVLPTKPV GRLVLDRMPE NFFAETEQVA FMTQNVPPGI
DFSDDPLLQG RNFSYLDTQL KRLGSPNFTH LPINAPKCPF HNFQQDGHMA MRNPVGRANY
QPNSWGEGPR ESPVKGFRHF ASEEQGPKLR IRAESFADHY SQARQFFISQ TPPEQRHIAD
ALTFELSKVE TPVIRERMVA HLLNIDETLG KKVGHALGME TMPKPADAAV ATRQDLDPSP
ALSIIQRGPK RFEGRKLGIL ATDGTDAALL NALLQAVDTE KAAFELIAPK VGGFTASDGK
RIAAHQMLDG GPSVLYDAVV LLASAEAVAE LIDVATARDF VADAFAHCKY IGYVSAAVPL
LERAGIAGLL DEGTIELTDA GSAAAFLKEL GKLRVWAREP SVKLK