Gene EcHS_A2063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2063 
Symboldcm 
ID5594136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2050706 
End bp2052124 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content52% 
IMG OID640921204 
ProductDNA cytosine methylase 
Protein accessionYP_001458748 
Protein GI157161430 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.00482699 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGAAA ATATATCAGT AACCGATTCA TACAGCACCG GGAATGCCGC ACAGGCAATG 
CTGGAGAAAC TGCTGCAAAT TTATGATGTT AAAACGTTGG TGGCGCAGCT TAATGGCGTA
GGTGAGAATC ACTGGAGCGC GGCAATTTTA AAACGTGCGC TGGCGAATGA CTCGGCATGG
CACCGTTTAA GTGAGAAAGA GTTCGCCCAT CTGCAAACGT TATTACCCAA ACCACCGGCA
CATCATCCGC ATTATGCGTT TCGCTTTATC GATCTATTCG CCGGAATTGG CGGCATCCGT
CGCGGTTTTG AATCGATTGG CGGACAGTGC GTGTTTACCA GCGAATGGAA CAAACATGCG
GTACGCACTT ATAAAGCCAA CCATTATTGC GATCCGGCGA CGCATCATTT TAATGAAGAT
ATCCGCGACA TCACCCTCAG CCATAAAGAA GGCGTGAGTG ATGAGGCGGC GGCGGAACAT
ATTCGTCAAC ACATTCCTGA ACACGATGTT TTACTGGCCG GTTTCCCTTG TCAGCCATTT
TCGCTGGCTG GCGTATCGAA AAAGAACTCG CTCGGGCGGG CGCACGGTTT TGCCTGCGAT
ACCCAGGGCA CGCTGTTTTT TGATGTGGTA CGCATTATCG ACGCGCGTCG TCCGGCGATG
TTTGTGCTCG AAAACGTCAA AAACCTGAAA AGTCACGACC AGGGTAAAAC GTTCCGCATC
ATCATGCAGA CGCTGGACGA ACTGGGCTAT GACGTGGCTG ATGCAGAAGA TAATGGGCCA
GACGATCCGA AAATCATCGA CGGCAAACAT TTTCTGCCGC AGCACCGTGA ACGCATCGTG
CTGGTGGGTT TTCGTCGCGA TCTGAATCTG AAAGCCGATT TTACCCTGCG TGATATCAGC
GAATGTTTCC CTGCGCAGCG AGTGACGCTG GCGCAGCTGT TGGACCCGAT GGTCGAGGCG
AAATATATCC TGACGCCGGT GCTGTGGAAG TACCTCTATC GATATGCGAA AAAACATCAG
ACGCGCGGTA ACGGCTTCGG TTATGGAATG GTTTATCCGA ACAATCCGCA AAGCGTCACG
CGTACGCTGT CTGCGCGTTA TTACAAAGAT GGCGCGGAAA TTTTAATCGA TCGCGGCTGG
GATATGGCCA AAGGTGAGAA AGACTTTGAC GATCCGCAGA ATCAGCAACA TCGTCCACGT
CGGTTAACGC CTCGGGAATG CGCGCGCTTA ATGGGTTTTG AAGCGCCGGG AGAAGCGAAA
TTCCGCATTC CGGTTTCGGA CACTCAGGCC TATCGCCAGT TCGGTAACTC GGTGGTCGTG
CCGGTCTTTG CCGCGGTGGC AAAACTGCTT GAGCCAAAAA TCAAACAGGC GGTGGCGTTG
CGTCAGCAAG AGGCACAACA TGGCCGACGT TCACGATAA
 
Protein sequence
MQENISVTDS YSTGNAAQAM LEKLLQIYDV KTLVAQLNGV GENHWSAAIL KRALANDSAW 
HRLSEKEFAH LQTLLPKPPA HHPHYAFRFI DLFAGIGGIR RGFESIGGQC VFTSEWNKHA
VRTYKANHYC DPATHHFNED IRDITLSHKE GVSDEAAAEH IRQHIPEHDV LLAGFPCQPF
SLAGVSKKNS LGRAHGFACD TQGTLFFDVV RIIDARRPAM FVLENVKNLK SHDQGKTFRI
IMQTLDELGY DVADAEDNGP DDPKIIDGKH FLPQHRERIV LVGFRRDLNL KADFTLRDIS
ECFPAQRVTL AQLLDPMVEA KYILTPVLWK YLYRYAKKHQ TRGNGFGYGM VYPNNPQSVT
RTLSARYYKD GAEILIDRGW DMAKGEKDFD DPQNQQHRPR RLTPRECARL MGFEAPGEAK
FRIPVSDTQA YRQFGNSVVV PVFAAVAKLL EPKIKQAVAL RQQEAQHGRR SR