Gene EcolC_1682 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1682 
Symbol 
ID6066519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1884734 
End bp1886152 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content52% 
IMG OID641601096 
ProductDNA cytosine methylase 
Protein accessionYP_001724661 
Protein GI170019707 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0382129 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGAAA ATATATCAGT AACCGATTCA TACAGCACCG GGAATGCCGC ACAGGCAATG 
CTGGAGAAAC TGCTGCAAAT TTATGATGTT AAAACGCTGG TGGCGCAGCT TAATGGTGTG
GGTGAGAATC ACTGGAGCGC GGCAATTTTA AAACGTGCGC TGGCGAATGA CTCGGCATGG
CACCGTTTAA GTGAGAAAGA GTTCGCCCAT CTGCAAACGT TGTTACCCAA ACCACCGGCA
CATCATCCGC ATTATGCGTT TCGCTTTATC GATCTATTTG CCGGAATTGG CGGCATCCGT
CGCGGTTTTG AATCGATTGG CGGACAATGC GTGTTTACCA GCGAATGGAA CAAACATGCG
GTACGCACTT ATAAAGCCAA TCATTATTGC GACCCGGCGA CGCATCATTT TAATGAAGAT
ATCCGCGACA TCACCCTCAG CCATAAAGAA GGCGTGAGTG ATGAGGCGGC GGCGGAACAT
ATTCGTAAAC ACATTCCTGA ACACGATGTT TTACTGGCCG GTTTCCCTTG TCAGCCATTT
TCGCTGGCTG GCGTATCGAA AAAGAACTCG CTCGGGCGGG CGCACGGTTT TGCCTGCGAT
ACTCAGGGCA CGCTGTTCTT TGATGTGGTG CGCATTATCG ACGCGCGTCG TCCGGCGATG
TTTGTGCTCG AAAACGTCAA AAACCTGAAA AGCCACGACC AGGGTAAAAC GTTCCGCATC
ATCATGCAGA CGCTGGACGA ACTGGGCTAT GACGTGGCTG ATGCAGAAGA TAACGGGCCG
GACGATCCGA AAATCATCGA TGGTAAACAT TTTCTGCCGC AGCACCGTGA ACGCATCGTG
CTGGTGGGTT TTCGTCGCGA TCTTAATCTG AAAGCCGATT TTACTCTGCG TGATATCAGC
GAATGTTTCC CTGCACAGCG AGTGACGCTG GCGCAGCTGC TGGACCCGAT GGTCGAGGCG
AAATATATCC TGACGCCGGT GCTGTGGAAG TACCTCTATC GTTATGCGAA AAAACATCAG
GCGCGCGGTA ACGGCTTCGG TTATGGAATG GTCTATCCTA ACAATCCGCA AAGCGTCACG
CGTACGCTGT CTGCGCGTTA TTACAAAGAT GGCGCGGAAA TTTTAATCGA TCGCGGCTGG
GATATGGCCA CGGGTGAGAA AGACTTTGAC GATCCGCTGA ATCAGCAACA TCGTCCACGT
CGGTTAACGC CTCGGGAATG CGCGCGCTTA ATGGGTTTTG AAGCGCCGGG AGAAGCGAAA
TTCCGCATTC CGGTTTCGGA CACTCAGGCC TATCGCCAGT TCGGTAACTC GGTGGTCGTG
CCGGTCTTTG CCGCGGTGGC AAAACTGCTT GAGCCAAAAA TCAAACAGGC GGTGGCGTTG
CGTCAGCAAG AGGCACAACA TGGCCGACGT TCACGATAA
 
Protein sequence
MQENISVTDS YSTGNAAQAM LEKLLQIYDV KTLVAQLNGV GENHWSAAIL KRALANDSAW 
HRLSEKEFAH LQTLLPKPPA HHPHYAFRFI DLFAGIGGIR RGFESIGGQC VFTSEWNKHA
VRTYKANHYC DPATHHFNED IRDITLSHKE GVSDEAAAEH IRKHIPEHDV LLAGFPCQPF
SLAGVSKKNS LGRAHGFACD TQGTLFFDVV RIIDARRPAM FVLENVKNLK SHDQGKTFRI
IMQTLDELGY DVADAEDNGP DDPKIIDGKH FLPQHRERIV LVGFRRDLNL KADFTLRDIS
ECFPAQRVTL AQLLDPMVEA KYILTPVLWK YLYRYAKKHQ ARGNGFGYGM VYPNNPQSVT
RTLSARYYKD GAEILIDRGW DMATGEKDFD DPLNQQHRPR RLTPRECARL MGFEAPGEAK
FRIPVSDTQA YRQFGNSVVV PVFAAVAKLL EPKIKQAVAL RQQEAQHGRR SR