Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_2022 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 2179298 |
End bp | 2180326 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | transcriptional regulator, LacI family |
Protein accession | ACX39679 |
Protein GI | 260449257 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.287863 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTACCG CCAAAAAAAT AACCATTCAT GATGTTGCGC TGGCTGCGGG CGTGTCGGTA AGTACCGTTT CGCTGGTGCT TAGTGGCAAA GGGCGAATCT CTACCGCCAC AGGAGAACGC GTTAACGCCG CCATTGAAGA GCTGGGATTT GTGCGCAATC GCCAGGCGTC GGCGCTGCGC GGCGGGCAAA GCGGCGTCAT TGGTTTGATC GTCCGTGATT TATCTGCGCC GTTTTACGCC GAATTGACGG CCGGATTGAC GGAAGCTCTG GAAGCGCAGG GACGGATGGT TTTTTTGCTT CACGGCGGTA AAGACGGTGA GCAGCTGGCA CAGCGGTTTT CACTGTTACT GAATCAGGGT GTCGATGGTG TGGTAATTGC CGGAGCTGCA GGAAGTAGCG ATGACCTGCG ACGGATGGCA GAAGAAAAAG CTATCCCGGT GATTTTCGCT TCCCGTGCCA GTTATCTTGA TGATGTTGAT ACGGTTCGCC CGGACAACAT GCAGGCTGCA CAGTTGTTGA CGGAGCATCT CATTCGCAAT GGGCATCAGC GGATCGCCTG GCTGGGAGGG CAAAGTTCCT CATTAACCCG TGCAGAACGG GTTGGGGGCT ATTGTGCAAC TCTACTAAAA TTTGGCCTGC CGTTTCATAG CGATTGGGTG TTGGAGTGCA CTTCCAGCCA GAAGCAAGCC GCGGAAGCTA TCACGGCGCT TTTACGTCAT AACCCGACCA TCAGTGCCGT GGTTTGCTAT AACGAAACTA TTGCGATGGG GGCATGGTTT GGTTTGCTCA AAGCAGGGCG GCAAAGCGGG GAAAGCGGAG TCGATCGTTA CTTTGAGCAA CAGGTTTCGC TGGCGGCATT TACCGATGCG ACACCAACCA CACTTGATGA TATACCTGTT ACCTGGGCCA GCACGCCAGC GCGGGAACTT GGTATCACAC TTGCGGATCG CATGATGCAA AAAATCACCC ATGAAGAGAC GCATTCACGC AATCTTATTA TTCCCGCCCG GCTCATTGCA GCGAAATAA
|
Protein sequence | MATAKKITIH DVALAAGVSV STVSLVLSGK GRISTATGER VNAAIEELGF VRNRQASALR GGQSGVIGLI VRDLSAPFYA ELTAGLTEAL EAQGRMVFLL HGGKDGEQLA QRFSLLLNQG VDGVVIAGAA GSSDDLRRMA EEKAIPVIFA SRASYLDDVD TVRPDNMQAA QLLTEHLIRN GHQRIAWLGG QSSSLTRAER VGGYCATLLK FGLPFHSDWV LECTSSQKQA AEAITALLRH NPTISAVVCY NETIAMGAWF GLLKAGRQSG ESGVDRYFEQ QVSLAAFTDA TPTTLDDIPV TWASTPAREL GITLADRMMQ KITHEETHSR NLIIPARLIA AK
|
| |