Gene EcDH1_1473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1473 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1594467 
End bp1596227 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content53% 
IMG OID 
Producttype III restriction protein res subunit 
Protein accessionACX39143 
Protein GI260448721 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value8.7246e-07 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTTTA CACTTCGCCC ATATCAGCAA GAAGCCGTGG ATGCCACGCT CAACCATTTT 
CGTCGTCATA AAACCCCTGC CGTTATCGTG CTGCCCACCG GCGCAGGTAA AAGCCTGGTG
ATAGCGGAAC TGGCACGGCT GGCTCGTGGT CGCGTGCTGG TGCTGGCACA CGTTAAAGAA
CTGGTGGCGC AAAACCATGC AAAGTATCAG GCGCTGGGGC TGGAAGCCGA TATTTTTGCC
GCCGGGCTAA AGCGCAAAGA GAGCCACGGT AAAGTGGTAT TTGGCAGCGT GCAGTCTGTC
GCCCGTAATC TTGATGCCTT TCAGGGTGAA TTTTCGCTGT TGATTGTCGA TGAATGTCAC
CGTATTGGTG ACGATGAAGA GAGCCAGTAT CAGCAAATCC TCACTCACCT GACAAAAGTG
AATCCCCACT TACGCCTGCT GGGGCTGACT GCCACGCCTT TTCGATTGGG CAAAGGCTGG
ATCTACCAGT TTCATTATCA CGGCATGGTA CGCGGCGATG AGAAAGCCCT TTTCCGTGAC
TGCATTTATG AGCTGCCGCT GCGTTATATG ATTAAACACG GCTATCTGAC GCCGCCAGAA
CGACTGGATA TGCCAGTAGT GCAATACGAT TTCAGCCGCT TGCAGGCACA GAGTAACGGG
CTGTTCAGCG AAGCCGATCT CAACCGTGAG CTGAAAAAAC AACAACGTAT TACCCCGCAC
ATCATCAGCC AGATTATGGA GTTTGCTGCA ACGCGCAAAG GGGTGATGAT TTTTGCCGCG
ACGGTTGAAC ACGCAAAAGA GATTGTGGGA TTACTGCCTG CCGAAGATGC AGCACTGATT
ACTGGCGACA CCCCCGGCGC TGAGCGCGAT GTGTTAATTG AAAATTTTAA AGCCCAGCGT
TTTCGCTATC TGGTCAACGT CGCGGTACTG ACCACCGGAT TTGACGCCCC GCACGTCGAT
CTTATCGCCA TTCTGCGCCC TACCGAATCA GTGAGTCTTT ACCAACAAAT TGTCGGGCGC
GGTCTGCGTC TCGCTCCGGG CAAGACTGAT TGCTTAATTC TTGATTATGC GGGTAATCCT
CACGATCTCT ACGCGCCGGA AGTTGGTACA CCAAAAGGCA AAAGTGACAA CGTTCCGGTA
CAGGTTTTCT GCCCTGCCTG CGGTTTTGCC AACACCTTTT GGGGGAAAAC GACCGCCGAC
GGGACATTGA TTGAACACTT TGGTCGTCGC TGTCAGGGAT GGTTTGAAGA TGACGACGGT
CATCGCGAAC AATGTGACTT CCGTTTCCGT TTTAAAAATT GCCCGCAATG TAACGCGGAA
AACGATATTG CCGCCCGCCG CTGCCGCGAA TGTGACACCG TACTGGTTGA TCCGGACGAT
ATGTTAAAAG CGGCGCTACG ACTGAAAGAC GCGCTGGTAT TACGCTGTAG CGGCATGTCT
TTGCAACATG GGCACGACGA GAAAGGCGAA TGGTTGAAAA TCACCTATTA CGATGAAGAC
GGCGCGGATG TGAGTGAGCG TTTCCGTCTG CAAACACCTG CCCAGCGTAC CGCCTTCGAG
CAGCTTTTTA TCCGCCCGCA TACGCGCACA CCGGGCATCC CGCTGCGCTG GATCACCGCC
GCCGATATCC TCGCCCAGCA AGCCTTATTG CGACACCCGG ATTTTGTCGT CGCCCGCATG
AAAGGCCAGT ACTGGCAGGT GCGTGAAAAA GTGTTCGATT ACGAAGGTCG TTTTCGTCTG
GCGCACGAAT TACGCGGTTA A
 
Protein sequence
MIFTLRPYQQ EAVDATLNHF RRHKTPAVIV LPTGAGKSLV IAELARLARG RVLVLAHVKE 
LVAQNHAKYQ ALGLEADIFA AGLKRKESHG KVVFGSVQSV ARNLDAFQGE FSLLIVDECH
RIGDDEESQY QQILTHLTKV NPHLRLLGLT ATPFRLGKGW IYQFHYHGMV RGDEKALFRD
CIYELPLRYM IKHGYLTPPE RLDMPVVQYD FSRLQAQSNG LFSEADLNRE LKKQQRITPH
IISQIMEFAA TRKGVMIFAA TVEHAKEIVG LLPAEDAALI TGDTPGAERD VLIENFKAQR
FRYLVNVAVL TTGFDAPHVD LIAILRPTES VSLYQQIVGR GLRLAPGKTD CLILDYAGNP
HDLYAPEVGT PKGKSDNVPV QVFCPACGFA NTFWGKTTAD GTLIEHFGRR CQGWFEDDDG
HREQCDFRFR FKNCPQCNAE NDIAARRCRE CDTVLVDPDD MLKAALRLKD ALVLRCSGMS
LQHGHDEKGE WLKITYYDED GADVSERFRL QTPAQRTAFE QLFIRPHTRT PGIPLRWITA
ADILAQQALL RHPDFVVARM KGQYWQVREK VFDYEGRFRL AHELRG