Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4319 |
Symbol | |
ID | 6143903 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4415633 |
End bp | 4416571 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 641619139 |
Product | DNA-cytosine methyltransferase family protein |
Protein accession | YP_001746263 |
Protein GI | 170680813 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0270] Site-specific DNA methylase |
TIGRFAM ID | [TIGR00675] DNA-methyltransferase (dcm) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.00165512 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCAACTG TAGTTTCACT TTTTTCTGGA TGTGGTGGTT CTGATGCGGG AGTTTTGAAC GCAGGTTTCA ATGTGCTTAT GGCAAATGAT ATTTTGCCTT ACGCAAGGGA CGTTTACTTA GAAAACCATC CTGAAACTGA CTACATTTTG GGCGATATCT CAGGGCTCCA GTCGTTCCCT TCTGCTGAGT TGCTCATCGG ATGCTATCCT TGCCAAGGAT TTAGTCAAGG TGGGGCAAGG AAGGCAGATA GAAAGATTAA TACACTATAT TTAGAGTTTG CCCGTGCTTT GAGTAAAATT AAGCCAAAAG CATTCATTGT AGAGAATGTC TCTGGTATGG TAAGGCGTAA CTTTGAGCAT TTATTAAAGG ATCAATTCAA AGTTTTCGAA GAAGCAGGTT ATACAGTAAG CTCGCAAATT CTGAATGCGT CCCATTATGG GGTATCCCAA GATAGGAAGC GAATCTTTAT CGTAGGAATA CGAAAGGACT ACGGTATTAC ATACAAATTT CCAAAACCAA CACATGGTGA TGGTTTGACA CCATATTCCA CAATTCGTGA TGCTATTGGG CATATGCCTG TTTGGCCTGT TGGCGAGTTT TATGACGCCG ATTTTCATTG GTATTATCTA TCGCGAAACC GTAGGCAAGA TTGGGATCAG ATATCTAAAA CAATTGTTGC AAATCCTAGA CATATGCCTC TACATCCAAT AAGTCCAACA TTAGAAAAAT TGGGACCTGA TAAGTGGCAA TTTACTTCGG ATGCACCAGC TCGTAGGTTT AGCTATCGAG AGGCTGCTAT TTTACAAGGT TTTGGGGATT TAATTTTCCC AGAAACTGAA CGGGCTTCTA TTAATATGAA ATATACTGTG GTAGGTAATG CAGTACCGCC TCCATTGTTT GAGGCGGTTG CTAGAAACCT TCCAGATTTA TGGGATTAA
|
Protein sequence | MPTVVSLFSG CGGSDAGVLN AGFNVLMAND ILPYARDVYL ENHPETDYIL GDISGLQSFP SAELLIGCYP CQGFSQGGAR KADRKINTLY LEFARALSKI KPKAFIVENV SGMVRRNFEH LLKDQFKVFE EAGYTVSSQI LNASHYGVSQ DRKRIFIVGI RKDYGITYKF PKPTHGDGLT PYSTIRDAIG HMPVWPVGEF YDADFHWYYL SRNRRQDWDQ ISKTIVANPR HMPLHPISPT LEKLGPDKWQ FTSDAPARRF SYREAAILQG FGDLIFPETE RASINMKYTV VGNAVPPPLF EAVARNLPDL WD
|
| |