Gene Smed_1205 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1205 
Symbol 
ID5322052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1285540 
End bp1287438 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content62% 
IMG OID640790146 
ProductDNA-cytosine methyltransferase 
Protein accessionYP_001326890 
Protein GI150396423 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.649135 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCGA AGGCAGAGGC ACTATCGGCG GCGAAGCGAA GGATCATCGA GCTGCAGAAG 
CAGATGGCGG CACGGATCCT CGACATGGCT GCGGAAGTCG AAAAGCTCGC AAACGAGACG
ACGGAACTCG AAGCGCGTGA ATTCTTGCGC GTCACCTGCA ACATGCCGTC CTCCGAATTA
TCTACCTATG TGCGCTTCAG CAGCACATTG CGTGGCCGCG AGGAGCTGCT CGAGAGGCAC
CGGGTGTCGT TCCCAGTGTT GAAAGCGCTG ATTGGAGCGG ACGAAGAGAC TAGGTCGGAA
GTGCTTGAGA GAATGGAGAT CGGCGCGAGG ATCGATCTCA GAGGTATTTC GACCATTCGG
AAGCGTTTGC GGGAGGCGAA GCTGACCCCT GAGGCAGTGT TGGCCGACCA GGGGCGGAAG
ATCGCGTCGG CGGCAGCGCG CAAGCGAGTC CAGGATTCCT CGGCCACCTA CCTCGATCGG
CTCCATTACT TCGTATCAAG TATCATCGAT GAGCGAGACG CTGCTGAACT CGCGGCCGAC
GACATCCGTC ACGAGGCCGG TGAGCTGCGG ACCGGGTTCG AAGATCTATT CGGCCCGGAC
CATCGCGCTC CTGAGGAACT TAAACCGCGG TCAGCGGCAT ACGAGCTGTC GGTGGCCTAC
CGCGCCCTGG TGCATCTCGA GGAGGGCACC CTGCCCTTTG CAGGCGGCGT CGGCGAGCTC
GATCCCGACA GGGCGCATCC GTGGCTGCAG TCACTTAATG CTTTGACGGG GCGAGCGTTG
CCCGGGCACA AAGAGGATAG GGCGAACCTT CGAGAACTCC CTGCAGGTGC CGAGCGCCTG
ACGGTCGTCG AGATCTGCGC GGGAGCCGGT GGGATGTCGC TAGGTCTTGA ACGCGCTGGC
TTCGAGCACG TTGCGCTCGT CGAATACGAC AATCATGCTG CCGCCACGCT CCGCCGCAAC
CGCCGTGATT GGACGGTCAT CCGAGAGGAC GTGCGGACAA TGGATTTCAG GCTTTACCGC
CAGCTTGAAA TCGACCTCGT GAGCGGGGGC CCTCCCTGCC AGCCGTATTC GTCCGACGGG
TACGGTCTCG GTAAGGAAGA CCCGCGCGAC CTGCTCCCGG AATGCGTTCG CATCGTTGAT
GAGATCAAAC CGAAGGCGTT TCTGTTCGAG AATGTCGATG GCCTGCTACA GGCGCGCCAC
TCGGACCATG TCGCCGACAT CCTCCGTGGC TTCAAGCGTG CTGGCTACGA GGTCGATATT
CATCGCATTC AGGCAAAAGA CTATGGGCTG GCTCAGGAAC GAAGCCGTGT CCTGTTCATC
GGAATCCGCA AGGATCTTGC TCGGGGATTC CGAATGCCGC CGAAATTCCC ACAGCGGAGT
GCCAACATCG GCGACGTCCT CGTTGATCTG ATGGCGGCAA ACGGTTGGGA AGGAGCCTAC
GAATGGGCCA GAGAGCGGCG GGAGGCCAAC GACGTAGCGT CTACCGTTGT CACGCGCCGC
GGCAAGCCGC GTGCGAAGGA GGCAGCCCGC TGGGGCTCCA AGGGCGTTGA CATTGCAGGT
CTGCCGGAAT CGGCACCTAC CATGGTGCAG GCGTCGAAGC CGGGCTTCAT GCCCGCGTTG
ACGGCGCGGA TGCGTGCCCG GCTGCAGGAC TTCCCGGACG AGTGGGAATT CGTCGGGGGC
AAGCAAGCAA CCGCGGACCA GATCGGCAAT GCGGTGCCGC CGCGCATGGC GCAGGCGATC
GGCCTGGCGA TCCACGGCGC GCTCATGGGC GCGAGCTGGG ACATGGAAGC GATGCTCTGG
CCCGAGCAAC GCCGTGTGGA AGTGTCGGCG CCGCCGCTTG CGACGCAGGC CGCAGATTAC
ATGGCTCCCT TGGAATGCAT CCATGAACAT GACCTTTAA
 
Protein sequence
MNSKAEALSA AKRRIIELQK QMAARILDMA AEVEKLANET TELEAREFLR VTCNMPSSEL 
STYVRFSSTL RGREELLERH RVSFPVLKAL IGADEETRSE VLERMEIGAR IDLRGISTIR
KRLREAKLTP EAVLADQGRK IASAAARKRV QDSSATYLDR LHYFVSSIID ERDAAELAAD
DIRHEAGELR TGFEDLFGPD HRAPEELKPR SAAYELSVAY RALVHLEEGT LPFAGGVGEL
DPDRAHPWLQ SLNALTGRAL PGHKEDRANL RELPAGAERL TVVEICAGAG GMSLGLERAG
FEHVALVEYD NHAAATLRRN RRDWTVIRED VRTMDFRLYR QLEIDLVSGG PPCQPYSSDG
YGLGKEDPRD LLPECVRIVD EIKPKAFLFE NVDGLLQARH SDHVADILRG FKRAGYEVDI
HRIQAKDYGL AQERSRVLFI GIRKDLARGF RMPPKFPQRS ANIGDVLVDL MAANGWEGAY
EWARERREAN DVASTVVTRR GKPRAKEAAR WGSKGVDIAG LPESAPTMVQ ASKPGFMPAL
TARMRARLQD FPDEWEFVGG KQATADQIGN AVPPRMAQAI GLAIHGALMG ASWDMEAMLW
PEQRRVEVSA PPLATQAADY MAPLECIHEH DL