Gene Smed_0541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0541 
Symbol 
ID5321375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp584422 
End bp585618 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content61% 
IMG OID640789475 
ProductDNA methylase N-4/N-6 domain-containing protein 
Protein accessionYP_001326232 
Protein GI150395765 
COG category[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0336098 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.888289 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGTAACCA ACTTCGGTAA CCATACGCCC ATAGACAGTC CGAGTAAGCG TATTTGCGAG 
TTGCCAATGT CCTCAGTTGT TTCGCTTGCC GAAATCTCCC GTGCCGCCCG TCCTTTGAAC
TGGCTCGACA GCATCATCAA GGGGGATTGC GTGGCCGCGC TGAATGCTCT TCCCGACAAT
TCGGTCGATG TCGTCTTCGC CGACCCGCCC TATAATCTCC AACTCGGCGG CACGCTGCAC
CGGCCTGATC AGTCGCTGGT CGATGCGGTG GACGATGACT GGGACCAGTT TGCGTCCTTC
GAAGCCTATG ACGCCTTTAC CCGCGCCTGG CTGCTCGCCT GCCGCCGCGT GCTGAAGCCC
ACCGGCACGC TCTGGGTCAT CGGTTCCTAC CACAATATCT TCCGGGTCGG CGCAATCCTT
CAGGACCTGC ATTTCTGGGT TCTGAACGAT ATCATCTGGC GCAAGACCAA CCCGATGCCG
AACTTCAAGG GGCGCCGCTT CCAGAACGCT CATGAAACGC TGATCTGGGC GACGCCGAAC
GCCAAGGCAA AGGGGTACAC CTTCAATTAT GAGGCGATGA AGGCGGCGAA TGACGACGTT
CAGATGCGTT CCGACTGGCT GTTCCCGATC TGCTCCGGTT CGGAGCGGCT GAAGGGAGAC
GACGGCAAGA AGGTGCACCC GACGCAAAAG CCGGAAGCGC TGCTTGCCCG CATTCTGATG
GCCTCGACCA AACCCGGCGA CGTTGTGCTC GATCCTTTCT TCGGCTCCGG CACCACCGGG
GCAGTCGCAA AGCGCCTCGG CCGGCACTTT GTCGGCATCG AGCGCGAACA GGACTATATC
GATGCAGCCG CCGAACGCAT CGCGGCTGTC GAGCCTCTCG GCAAAGCGAC GCTTTCCGTC
ATGACCGGCA AGAAAGCCGA GCCGCGTGTC GCCTTCAACA CGCTCATCGA AAGCGGGCTC
ATCAAGCCCG GCACGGTTCT GACGGACGCG AAGCGCCGCT ACAGCGCGAT CGTTCGCGCC
GACGGCACGC TGGCGTCCGG TGGCGAGGCT GGCTCCATTC ACCGCCTTGG CGCGAAAGTG
CAAGGCCTCG ACGCCTGCAA TGGCTGGACC TTCTGGCACT TCGAAGACGG AAGCGTGTTG
AAGCCGATCG ACGAGCTCAG ATCCGTCATT CGAAACGACC TGGCAAAACT GAACTGA
 
Protein sequence
MVTNFGNHTP IDSPSKRICE LPMSSVVSLA EISRAARPLN WLDSIIKGDC VAALNALPDN 
SVDVVFADPP YNLQLGGTLH RPDQSLVDAV DDDWDQFASF EAYDAFTRAW LLACRRVLKP
TGTLWVIGSY HNIFRVGAIL QDLHFWVLND IIWRKTNPMP NFKGRRFQNA HETLIWATPN
AKAKGYTFNY EAMKAANDDV QMRSDWLFPI CSGSERLKGD DGKKVHPTQK PEALLARILM
ASTKPGDVVL DPFFGSGTTG AVAKRLGRHF VGIEREQDYI DAAAERIAAV EPLGKATLSV
MTGKKAEPRV AFNTLIESGL IKPGTVLTDA KRRYSAIVRA DGTLASGGEA GSIHRLGAKV
QGLDACNGWT FWHFEDGSVL KPIDELRSVI RNDLAKLN