Gene EcDH1_2688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2688 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2862820 
End bp2864580 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content52% 
IMG OID 
Productputative ATP-dependent protease 
Protein accessionACX40321 
Protein GI260449899 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000000256121 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACCATTA CGAAACTTGC ATGGCGTGAC CTGGTTCCTG ATACCGATAG CTATCAGGAA 
ATATTTGCTC AGCCACATTT GATTGACGAA AACGATCCTT TATTCAGTGA TACTCAACCG
CGGCTGCAAT TTGCGCTGGA GCAGTTGCTG CATACGCGAG CATCCTCCTC TTTTATGCTG
GCGAAGGCCC CGGAAGAGTC TGAGTATCTG AATCTTATTG CCAATGCCGC GCGTACGCTA
CAAAGCGATG CAGGCCAACT GGTGGGCGGT CACTATGAGG TTTCCGGCCA CTCCATCCGC
TTACGTCACG CAGTGAGTGC AGATGATAAT TTTGCGACTT TAACGCAAGT TGTCGCTGCC
GACTGGGTAG AAGCGGAGCA ACTCTTTGGC TGCCTGCGCC AGTTTAATGG CGACATTACC
CTGCAGCCTG GTCTGGTGCA TCAGGCAAAT GGCGGTATTC TCATTATCTC TTTGCGTACA
CTGCTGGCGC AACCTCTGCT GTGGATGCGG CTGAAAAATA TCGTTAACCG CGAGCGTTTT
GACTGGGTTG CGTTTGATGA GTCGCGCCCT CTCCCCGTCT CTGTGCCTTC GATGCCATTG
AAGCTGAAAG TCATTCTGGT AGGCGAACGC GAATCATTGG CTGATTTCCA GGAGATGGAG
CCAGAGCTTT CAGAGCAGGC TATTTATAGC GAATTTGAAG ATACTCTGCA GATTGTCGAT
GCGGAGTCAG TAACCCAGTG GTGTCGCTGG GTGACATTTA CCGCCAGACA TAATCACTTA
CCTGCACCGG GAGCGGATGC CTGGCCGATA CTTATCCGCG AAGCAGCACG CTACACCGGT
GAACAAGAAA CACTTCCGCT TAGCCCGCAG TGGATCCTCC GCCAGTGTAA AGAGGTCGCC
TCCCTGTGTG ATGGCGACAC CTTCTCCGGC GAGCAGCTAA ACTTAATGCT GCAGCAGCGT
GAATGGCGCG AAGGTTTCCT CGCTGAACGT ATGCAGGATG AGATCCTTCA GGAGCAAATC
CTGATTGAAA CCGAAGGCGA ACGCATCGGG CAAATTAACG CCCTTTCGGT CATTGAATTT
CCGGGTCATC CACGCGCTTT TGGCGAACCT TCTCGCATTA GCTGCGTTGT GCATATTGGC
GATGGTGAAT TCACCGACAT CGAACGCAAA GCGGAGCTTG GCGGCAATAT CCATGCGAAA
GGGATGATGA TCATGCAAGC GTTCCTGATG TCGGAACTAC AGCTTGAGCA ACAGATCCCC
TTCTCAGCAT CGCTGACATT TGAGCAGTCA TACAGTGAAG TTGATGGAGA TAGTGCCTCG
ATGGCTGAAC TCTGCGCCCT GATAAGCGCC CTCGCCGATG TGCCGGTGAA TCAGAGTATC
GCTATCACAG GTTCAGTCGA TCAGTTCGGT CGCGCCCAGC CGGTCGGTGG TTTAAATGAG
AAAATCGAAG GCTTCTTTGC TATTTGCCAG CAACGTGAGT TAACCGGGAA ACAAGGTGTC
ATTATCCCCA CAGCTAACGT TCGCCATTTA AGTCTTCACA GTGAACTGGT GAAAGCGGTA
GAAGAAGGCA AATTCACCAT CTGGGCAGTA GACGATGTGA CTGACGCACT GCCGTTATTA
TTAAATCTGG TGTGGGATGG CGAAGGCCAA ACGACGCTGA TGCAAACCAT CCAGGAACGT
ATCGCGCAAG CATCGCAACA GGAAGGACGT CACCGTTTTC CATGGCCATT ACGTTGGCTG
AACTGGTTTA TTCCGAACTG A
 
Protein sequence
MTITKLAWRD LVPDTDSYQE IFAQPHLIDE NDPLFSDTQP RLQFALEQLL HTRASSSFML 
AKAPEESEYL NLIANAARTL QSDAGQLVGG HYEVSGHSIR LRHAVSADDN FATLTQVVAA
DWVEAEQLFG CLRQFNGDIT LQPGLVHQAN GGILIISLRT LLAQPLLWMR LKNIVNRERF
DWVAFDESRP LPVSVPSMPL KLKVILVGER ESLADFQEME PELSEQAIYS EFEDTLQIVD
AESVTQWCRW VTFTARHNHL PAPGADAWPI LIREAARYTG EQETLPLSPQ WILRQCKEVA
SLCDGDTFSG EQLNLMLQQR EWREGFLAER MQDEILQEQI LIETEGERIG QINALSVIEF
PGHPRAFGEP SRISCVVHIG DGEFTDIERK AELGGNIHAK GMMIMQAFLM SELQLEQQIP
FSASLTFEQS YSEVDGDSAS MAELCALISA LADVPVNQSI AITGSVDQFG RAQPVGGLNE
KIEGFFAICQ QRELTGKQGV IIPTANVRHL SLHSELVKAV EEGKFTIWAV DDVTDALPLL
LNLVWDGEGQ TTLMQTIQER IAQASQQEGR HRFPWPLRWL NWFIPN