Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_3649 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 3931567 |
End bp | 3932961 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | |
Product | restriction modification system DNA specificity domain protein |
Protein accession | ACX41261 |
Protein GI | 260450839 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGCGG GGAAATTGCC GGAGGGGTGG GTTATCGCCC CAGTATCTAC GGTCACAACT CTAATCCGAG GAGTAACGTA TAAAAAAGAG CAGGCAATAA ATTATCTAAA AGATGATTAT TTGCCTCTTA TCCGTGCGAA CAATATTCAG AATGGCAAGT TTGATACTAC GGACTTGGTT TTTGTTCCTA AAAATCTTGT TAAAGAAAGT CAAAAAATAT CTCCTGAAGA TATTGTTATT GCAATGTCAT CAGGGAGCAA ATCCGTAGTT GGTAAATCCG CACATCAGCA TCTACCATTT GAATGTAGTT TCGGCGCATT TTGCGGTGTA TTACGTCCTG AAAAACTTAT ATTTTCTGGT TTTATTGCTC ATTTCACAAA ATCTTCTCTT TATCGAAACA AAATTTCATC ACTTTCTGCT GGTGCAAATA TTAATAATAT TAAGCCGGCA AGCTTTGATT TGATAAATAT ACCAATCCCA CCACTTGCCG AACAAAAAAT CATCGCTGAA AAACTCGATA CGCTGCTGGC GCAGGTAGAC AGCACCAAAG CACGTTTTGA GCAAATCCCA CAAATCCTGA AACGTTTTCG TCAAGCGGTA TTGGGGGGCG CAGTTAATGG AAAATTGACA GAAAAATGGC GTAATTTTGA GCCGCAACAT TCTGTATTTA AGAAGTTAAA TTTTGAATCT ATCTTAACTG AATTACGTAA TGGTCTTTCA TCAAAGCCAA ATGAAAGTGG TGTTGGTCAT CCAATACTAC GCATTAGTTC TGTACGTGCT GGCCATGTAG ATCAAAACGA TATTCGGTTT CTAGAATGTT CAGAAAGTGA ACTAAACCGC CACAAATTAC AAGATGGAGA TCTTTTATTT ACTCGCTATA ACGGAAGTTT AGAATTTGTT GGTGTTTGTG GGTTATTGAA AAAATTACAA CATCAAAATT TGCTATATCC TGATAAACTT ATTCGAGCTC GATTAACCAA AGATGCTTTA CCAGAATATA TCGAAATATT TTTTTCATCC CCCTCAGCAC GAAATGCAAT GATGAACTGC GTGAAAACAA CTTCTGGTCA AAAAGGTATT TCAGGAAAAG ATATCAAATC CCAAGTTGTT TTATTACCTC CAGTAAAAGA ACAAGCCGAA ATCGTTCGCC GCGTCGAGCA ACTCTTCGCC TACGCCGACA CCATAGAAAA ACAGGTCAAC AACGCCTTAG CCCGCGTCAA CAACCTGACG CAATCCATCC TGGCAAAAGC GTTCCGTGGT GAACTTACCG CCCAGTGGCG GGCCGAAAAC CCGGATTTGA TCAGCGGAGA AAACAGCGCC GCCGCGTTGC TGGAAAAAAT CAAAGCTGAA CGCGCAGCTA GCGGGGGTAA AAAAGCCTCA CGTAAAAAAT CCTGA
|
Protein sequence | MSAGKLPEGW VIAPVSTVTT LIRGVTYKKE QAINYLKDDY LPLIRANNIQ NGKFDTTDLV FVPKNLVKES QKISPEDIVI AMSSGSKSVV GKSAHQHLPF ECSFGAFCGV LRPEKLIFSG FIAHFTKSSL YRNKISSLSA GANINNIKPA SFDLINIPIP PLAEQKIIAE KLDTLLAQVD STKARFEQIP QILKRFRQAV LGGAVNGKLT EKWRNFEPQH SVFKKLNFES ILTELRNGLS SKPNESGVGH PILRISSVRA GHVDQNDIRF LECSESELNR HKLQDGDLLF TRYNGSLEFV GVCGLLKKLQ HQNLLYPDKL IRARLTKDAL PEYIEIFFSS PSARNAMMNC VKTTSGQKGI SGKDIKSQVV LLPPVKEQAE IVRRVEQLFA YADTIEKQVN NALARVNNLT QSILAKAFRG ELTAQWRAEN PDLISGENSA AALLEKIKAE RAASGGKKAS RKKS
|
| |