Gene EcDH1_1688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1688 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1845738 
End bp1847156 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content52% 
IMG OID 
ProductDNA-cytosine methyltransferase 
Protein accessionACX39349 
Protein GI260448927 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.105701 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGAAA ATATATCAGT AACCGATTCA TACAGCACCG GGAATGCCGC ACAGGCAATG 
CTGGAGAAAC TGCTGCAAAT TTATGATGTT AAAACGTTGG TGGCGCAGCT TAATGGTGTA
GGTGAGAATC ACTGGAGCGC GGCAATTTTA AAACGTGCGC TGGCGAATGA CTCGGCATGG
CACCGTTTAA GTGAGAAAGA GTTCGCCCAT CTGCAAACGT TATTACCCAA ACCACCGGCA
CATCATCCGC ATTATGCGTT TCGCTTTATC GATCTATTCG CCGGAATTGG CGGCATCCGT
CGCGGTTTTG AATCGATTGG CGGACAGTGC GTGTTTACCA GCGAATGGAA CAAACATGCG
GTACGCACTT ATAAAGCCAA CCATTATTGC GATCCGGCGA CGCATCATTT TAATGAAGAT
ATCCGCGACA TCACCCTCAG CCATAAAGAA GGCGTGAGTG ATGAGGCGGC GGCGGAACAT
ATTCGTCAAC ACATTCCTGA ACACGATGTT TTACTGGCCG GTTTCCCTTG TCAGCCATTT
TCGCTGGCTG GCGTATCGAA AAAGAACTCG CTCGGGCGGG CGCACGGTTT TGCCTGCGAT
ACCCAGGGCA CGCTGTTTTT TGATGTGGTA CGCATTATCG ACGCGCGTCG TCCGGCGATG
TTTGTGCTCG AAAACGTCAA AAACCTGAAA AGTCACGACC AGGGTAAAAC GTTCCGCATC
ATCATGCAGA CGCTGGACGA ACTGGGCTAT GACGTGGCTG ATGCAGAAGA TAATGGGCCA
GACGATCCGA AAATCATCGA CGGCAAACAT TTTCTGCCGC AGCACCGTGA ACGCATCGTG
CTGGTGGGTT TTCGTCGCGA TCTGAATCTG AAAGCCGATT TTACCCTGCG TGATATCAGC
GAATGTTTCC CTGCGCAGCG AGTGACGCTG GCGCAGCTGT TGGACCCGAT GGTCGAGGCG
AAATATATCC TGACGCCGGT GCTGTGGAAG TACCTCTATC GATATGCGAA AAAACATCAG
GCGCGCGGTA ACGGCTTCGG TTATGGAATG GTTTATCCGA ACAATCCGCA AAGCGTCACG
CGTACGCTGT CTGCGCGTTA TTACAAAGAT GGCGCGGAAA TTTTAATCGA TCGCGGCTGG
GATATGGCCA CGGGTGAGAA AGACTTTGAC GATCCGCTGA ATCAGCAACA TCGTCCACGT
CGGTTAACGC CTCGGGAATG CGCGCGCTTA ATGGGTTTTG AAGCGCCGGG AGAAGCGAAA
TTCCGTATTC CGGTTTCGGA CACTCAGGCC TATCGCCAGT TCGGTAACTC GGTGGTCGTG
CCGGTCTTTG CCGCGGTGGC AAAACTGCTT GAGCCAAAAA TCAAACAGGC GGTGGCGTTG
CGTCAGCAAG AGGCACAACA TGGCCGACGT TCACGATAA
 
Protein sequence
MQENISVTDS YSTGNAAQAM LEKLLQIYDV KTLVAQLNGV GENHWSAAIL KRALANDSAW 
HRLSEKEFAH LQTLLPKPPA HHPHYAFRFI DLFAGIGGIR RGFESIGGQC VFTSEWNKHA
VRTYKANHYC DPATHHFNED IRDITLSHKE GVSDEAAAEH IRQHIPEHDV LLAGFPCQPF
SLAGVSKKNS LGRAHGFACD TQGTLFFDVV RIIDARRPAM FVLENVKNLK SHDQGKTFRI
IMQTLDELGY DVADAEDNGP DDPKIIDGKH FLPQHRERIV LVGFRRDLNL KADFTLRDIS
ECFPAQRVTL AQLLDPMVEA KYILTPVLWK YLYRYAKKHQ ARGNGFGYGM VYPNNPQSVT
RTLSARYYKD GAEILIDRGW DMATGEKDFD DPLNQQHRPR RLTPRECARL MGFEAPGEAK
FRIPVSDTQA YRQFGNSVVV PVFAAVAKLL EPKIKQAVAL RQQEAQHGRR SR