Gene EcSMS35_1224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1224 
Symboldcm 
ID6147496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1227053 
End bp1228471 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content52% 
IMG OID641616102 
ProductDNA cytosine methylase 
Protein accessionYP_001743285 
Protein GI170681403 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.27257 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.639542 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGAAA ATATATCAGT AACCGATTCA TACAGCACCG GGAATGCCGC ACAGGCAATG 
CTGGAGAAAC TGCTGCAAAT TTATGATGTT AAAACGCTGG TGGCGCAGCT TAATGGTGTG
GGTGAGAATC ACTGGAGCGC GGCAATTTTA AAACGTGCGC TGGCGAATGA CTCGGCATGG
CACCGTTTAA GTGAGAAAGA GTTCGCCCAT CTGCAAACGT TGTTACCCAA ACCACCGGCA
CATCATCCGC ATTATGCGTT TCGCTTTATC GATCTATTTG CCGGAATTGG CGGCATCCGT
CGCGGTTTTG AATCGATTGG CGGACAATGC GTGTTTACCA GCGAATGGAA CAAACATGCG
GTACGCACTT ATAAAGCCAA CCATTATTGC GATCCGGCGA CGCATCATTT TAATGAAGAT
ATCCGCGATA TCACCCTCAG CCATAAAGAA GGCGTGAGTG ATGAGGCGGC GGCGGAACAT
ATTCGTCAAC ACATTCCTGA ACACGATGTT TTACTGGCCG GTTTCCCTTG TCAGCCATTT
TCGCTGGCTG GCGTATCGAA AAAGAACTCG CTCGGGCGGG CGCACGGTTT TGCCTGCGAT
ACCCAGGGCA CGCTGTTTTT TGATGTGGTA CGCATTATCG ACGCGCGTCG TCCGGCGATG
TTTGTGCTCG AAAACGTCAA AAACCTGAAA AGTCACGACC AGGGTAAAAC GTTCCGCATC
ATCATGCAGA CGCTGGACGA ACTGGGCTAT GACGTGGCTG ATGCAGAAGA TAACGGGCCA
GACGATCCGA AAATCATCGA CGGCAAACAT TTTCTGCCGC AGCACCGTGA ACGCATCGTG
CTGGTGGGTT TTCGTCGCGA TCTGAATCTG AAAGCCGATT TTACCCTGCG TGATATCAGC
GAATGTTTCC CTGCGCAGCG AGTGACGCTG GCGCAGCTGC TGGACCCGAT GGTCGAGGCG
AAATATATCC TGACGCCGGT GCTGTGGAAG TACCTCTATC GATATGCGAA AAAACATCAG
GCGCGCGGTA ACGGCTTCGG TTATGGAATG GTTTATCCGA ACAATCCGCA AAGCGTCACC
CGTACGCTGT CTGCGCGTTA TTACAAAGAT GGCGCGGAAA TTTTAATCGA TCGTGGCTGG
GATATGGCCA CGGGTGAGAA AGACTTTGAC GATCCGCAGA ATCAGCAACA TCGTCCACGT
CGGTTAACGC CTCGTGAATG CGCGCGCTTA ATGGGTTTTG AAGCGCCGGG AGAAGCGAAA
TTCCGCATTC CGGTTTCGGA CACTCAGGCC TATCGCCAGT TCGGTAACTC GGTGGTCGTG
CCGGTCTTTG CCGCGGTGGC AAAACTGCTT GAGCCAAAAA TCAAACAGGC GGTGGCGTTG
CGTCAGCAAG AGGCACAACA TGGCCGACGT TCACGATAA
 
Protein sequence
MQENISVTDS YSTGNAAQAM LEKLLQIYDV KTLVAQLNGV GENHWSAAIL KRALANDSAW 
HRLSEKEFAH LQTLLPKPPA HHPHYAFRFI DLFAGIGGIR RGFESIGGQC VFTSEWNKHA
VRTYKANHYC DPATHHFNED IRDITLSHKE GVSDEAAAEH IRQHIPEHDV LLAGFPCQPF
SLAGVSKKNS LGRAHGFACD TQGTLFFDVV RIIDARRPAM FVLENVKNLK SHDQGKTFRI
IMQTLDELGY DVADAEDNGP DDPKIIDGKH FLPQHRERIV LVGFRRDLNL KADFTLRDIS
ECFPAQRVTL AQLLDPMVEA KYILTPVLWK YLYRYAKKHQ ARGNGFGYGM VYPNNPQSVT
RTLSARYYKD GAEILIDRGW DMATGEKDFD DPQNQQHRPR RLTPRECARL MGFEAPGEAK
FRIPVSDTQA YRQFGNSVVV PVFAAVAKLL EPKIKQAVAL RQQEAQHGRR SR