Gene EcDH1_3648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3648 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3929981 
End bp3931570 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content55% 
IMG OID 
ProductN-6 DNA methylase 
Protein accessionACX41260 
Protein GI260450838 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAATA ACGATCTGGT CGCGAAGCTG TGGAAGCTGT GCGACAACCT GCGCGATGGC 
GGCGTTTCCT ATCAAAACTA CGTCAATGAA CTCGCCTCGC TGCTGTTTTT GAAAATGTGT
AAAGAGACCG GTCAGGAAGC GGAATACCTG CCGGAAGGTT ACCGCTGGGA TGACCTGAAA
TCCCGCATCG GCCAGGAGCA GTTGCAGTTC TACCGAAAAA TGCTCGTGCA TTTAGGCGAA
GATGACAAAA AGCTGGTACA GGCAGTTTTT CATAATGTTA GTACCACCAT CACCGAGCCG
AAACAAATAA CCGCACTGGT CAGCAATATG GATTCGCTGG ACTGGTACAA CGGCGCGCAC
GGTAAGTCGC GCGATGACTT CGGCGATATG TACGAAGGGC TGTTGCAGAA GAACGCGAAT
GAAACCAAGT CTGGTGCAGG CCAGTACTTC ACCCCGCGTC CGCTGATTAA AACCATTATT
CATCTGCTGA AACCGCAGCC GCGTGAAGTG GTGCAGGACC CGGCGGCAGG TACGGCGGGC
TTTTTGATTG AAGCCGACCG CTATGTTAAG TCGCAAACCA ATGATCTGGA CGACCTTGAT
GGCGACACGC AGGATTTCCA GATCCACCGC GCGTTTATCG GCCTCGAACT GGTGCCCGGC
ACCCGTCGTC TGGCACTGAT GAACTGCCTG CTGCACGATA TTGAAGGCAA CCTCGACCAC
GGCGGCGCAA TCCGTCTGGG CAACACTCTG GGTAGCGACG GTGAAAACCT GCCGAAGGCG
CATATTGTCG CCACTAACCC GCCGTTTGGC AGCGCCGCAG GCACCAACAT TACCCGCACC
TTTGTTCACC CGACCAGCAA CAAACAGTTG TGCTTTATGC AGCATATTAT CGAAACGCTG
CATCCCGGCG GTCGTGCGGC GGTGGTGGTG CCGGATAACG TGCTGTTTGA AGGCGGCAAA
GGCACCGACA TTCGTCGTGA CCTGATGGAT AAGTGTCATC TGCACACCAT TCTGCGTCTG
CCGACCGGTA TTTTTTACGC TCAGGGCGTG AAGACCAACG TGCTGTTCTT TACCAAAGGG
ACGGTGGCGA ACCCGAATCA GGATAAGAAC TGTACCGATG ATGTGTGGGT GTATGACCTG
CGTACCAATA TGCCGAGTTT CGGCAAGCGC ACACCGTTTA CCGACGAGCA TTTGCAGCCG
TTTGAGCGCG TGTATGGCGA AGACCCGCAC GGTTTAAGCC CGCGCACTGA AGGTGAATGG
AGTTTTAACG CCGAAGAGAC GGAAGTTGCC GACAGCGAAG AGAACAAAAA CACCGACCAG
CATCTTGCTA CCAGCCGCTG GCGCAAGTTC AGCCGTGAGT GGATCCGCAC CGCAAAATCC
GATTCGCTGG ATATCTCCTG GCTGAAAGAT AAAGACAGTA TTGATGCCGA CAGCCTGCCG
GAGCCGGATG TATTAGCGGC AGAAGCGATG GGCGAACTGG TACAGGCGCT GTCTGAACTG
GATGCGCTGA TGCGTGAACT GGGGGCGAGC GATGAGGCCG ATTTGCAGCG TCAGTTGCTG
GAAGAAGCGT TTGGTGGGGT GAAGGAATGA
 
Protein sequence
MNNNDLVAKL WKLCDNLRDG GVSYQNYVNE LASLLFLKMC KETGQEAEYL PEGYRWDDLK 
SRIGQEQLQF YRKMLVHLGE DDKKLVQAVF HNVSTTITEP KQITALVSNM DSLDWYNGAH
GKSRDDFGDM YEGLLQKNAN ETKSGAGQYF TPRPLIKTII HLLKPQPREV VQDPAAGTAG
FLIEADRYVK SQTNDLDDLD GDTQDFQIHR AFIGLELVPG TRRLALMNCL LHDIEGNLDH
GGAIRLGNTL GSDGENLPKA HIVATNPPFG SAAGTNITRT FVHPTSNKQL CFMQHIIETL
HPGGRAAVVV PDNVLFEGGK GTDIRRDLMD KCHLHTILRL PTGIFYAQGV KTNVLFFTKG
TVANPNQDKN CTDDVWVYDL RTNMPSFGKR TPFTDEHLQP FERVYGEDPH GLSPRTEGEW
SFNAEETEVA DSEENKNTDQ HLATSRWRKF SREWIRTAKS DSLDISWLKD KDSIDADSLP
EPDVLAAEAM GELVQALSEL DALMRELGAS DEADLQRQLL EEAFGGVKE