Gene Smed_3373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3373 
Symbol 
ID5324257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3575288 
End bp3576160 
Gene Length873 bp 
Protein Length290 aa 
Translation table11 
GC content64% 
IMG OID640792324 
Productmethylated-DNA--protein-cysteine methyltransferase 
Protein accessionYP_001329029 
Protein GI150398562 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0350] Methylated DNA-protein cysteine methyltransferase
[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID[TIGR00589] O-6-methylguanine DNA methyltransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTTG CTACATCCAT TCCCACCGAC ATCACTCCGG AAGGCACCGA CTACGACACG 
GTGACTCGTG TCATCGCGAT GCTGACCGAA GACTATCGCG AGCAGCCGTC GCTCGAGTCG
CTCGCCCGTC GCCTCGGGCA GTCGCCGACG CAACTTCAGA AAGTTTTCAC CCGTTGGGCG
GGGCTCTCAC CCAAGGCCTT TCTGCAGGCA ATCACTCTCG ATCACGCCAA ACGGCTCCTG
CGCCAGGAAG ACCTGCCGTT GCTGGAGACC AGCATTGAGA TCGGCCTGTC CGGGCCGAGC
CGGCTGCACG ATCTCTTCGT AACGCATGAG GCGATGTCCC CCGGTGAATG GAAAGCGCGC
GGCGCGGGCC TTACCATCCG CTACGGGTTT CACCCTTCAC CCTTCGGGAC GGCGCTGGTC
ATGGTGACCG AGCGCGGTCT CGCCGGACTG GCCTTCGCCG ATTCAGGCGA GGAGCACGCG
AGCTTCGAGG ACATGGCCTC CCGCTGGCCG AACGCAATCT ACCTTGAAGA CAGCGCTGCA
ACGGCGCGCT ATGCGGCGCG CATTTTCGAC CCCGATCGGT GGTCCGCGGA GGAGCCGCTG
AGGATTTTTC TGATCGGCTC CGATTTTCAG GTCCGCGTAT GGCAGACGCT TCTCAAGATT
CCGCTCGGTA AGGCAACGAC CTATTCGAAA ATCGCGGAGA ATATCGGCCA GCCAACCGCT
TCGCGCGCCG TCGGCGCCGC GGTGGGGCGC AATCCGATCT CCTTCGTCGT GCCCTGCCAC
CGGGCGCTCG GCAAGGCCGG CGATCTCACC GGCTACCATT GGGGGCTGAC GCGCAAGCGC
GCGATCCTCG GCTGGGAGGC GGGGAAGGCC TGA
 
Protein sequence
MNVATSIPTD ITPEGTDYDT VTRVIAMLTE DYREQPSLES LARRLGQSPT QLQKVFTRWA 
GLSPKAFLQA ITLDHAKRLL RQEDLPLLET SIEIGLSGPS RLHDLFVTHE AMSPGEWKAR
GAGLTIRYGF HPSPFGTALV MVTERGLAGL AFADSGEEHA SFEDMASRWP NAIYLEDSAA
TARYAARIFD PDRWSAEEPL RIFLIGSDFQ VRVWQTLLKI PLGKATTYSK IAENIGQPTA
SRAVGAAVGR NPISFVVPCH RALGKAGDLT GYHWGLTRKR AILGWEAGKA