Gene EcSMS35_4319 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4319 
Symbol 
ID6143903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4415633 
End bp4416571 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content41% 
IMG OID641619139 
ProductDNA-cytosine methyltransferase family protein 
Protein accessionYP_001746263 
Protein GI170680813 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.00165512 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCAACTG TAGTTTCACT TTTTTCTGGA TGTGGTGGTT CTGATGCGGG AGTTTTGAAC 
GCAGGTTTCA ATGTGCTTAT GGCAAATGAT ATTTTGCCTT ACGCAAGGGA CGTTTACTTA
GAAAACCATC CTGAAACTGA CTACATTTTG GGCGATATCT CAGGGCTCCA GTCGTTCCCT
TCTGCTGAGT TGCTCATCGG ATGCTATCCT TGCCAAGGAT TTAGTCAAGG TGGGGCAAGG
AAGGCAGATA GAAAGATTAA TACACTATAT TTAGAGTTTG CCCGTGCTTT GAGTAAAATT
AAGCCAAAAG CATTCATTGT AGAGAATGTC TCTGGTATGG TAAGGCGTAA CTTTGAGCAT
TTATTAAAGG ATCAATTCAA AGTTTTCGAA GAAGCAGGTT ATACAGTAAG CTCGCAAATT
CTGAATGCGT CCCATTATGG GGTATCCCAA GATAGGAAGC GAATCTTTAT CGTAGGAATA
CGAAAGGACT ACGGTATTAC ATACAAATTT CCAAAACCAA CACATGGTGA TGGTTTGACA
CCATATTCCA CAATTCGTGA TGCTATTGGG CATATGCCTG TTTGGCCTGT TGGCGAGTTT
TATGACGCCG ATTTTCATTG GTATTATCTA TCGCGAAACC GTAGGCAAGA TTGGGATCAG
ATATCTAAAA CAATTGTTGC AAATCCTAGA CATATGCCTC TACATCCAAT AAGTCCAACA
TTAGAAAAAT TGGGACCTGA TAAGTGGCAA TTTACTTCGG ATGCACCAGC TCGTAGGTTT
AGCTATCGAG AGGCTGCTAT TTTACAAGGT TTTGGGGATT TAATTTTCCC AGAAACTGAA
CGGGCTTCTA TTAATATGAA ATATACTGTG GTAGGTAATG CAGTACCGCC TCCATTGTTT
GAGGCGGTTG CTAGAAACCT TCCAGATTTA TGGGATTAA
 
Protein sequence
MPTVVSLFSG CGGSDAGVLN AGFNVLMAND ILPYARDVYL ENHPETDYIL GDISGLQSFP 
SAELLIGCYP CQGFSQGGAR KADRKINTLY LEFARALSKI KPKAFIVENV SGMVRRNFEH
LLKDQFKVFE EAGYTVSSQI LNASHYGVSQ DRKRIFIVGI RKDYGITYKF PKPTHGDGLT
PYSTIRDAIG HMPVWPVGEF YDADFHWYYL SRNRRQDWDQ ISKTIVANPR HMPLHPISPT
LEKLGPDKWQ FTSDAPARRF SYREAAILQG FGDLIFPETE RASINMKYTV VGNAVPPPLF
EAVARNLPDL WD