Gene EcDH1_2760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2760 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2955579 
End bp2957855 
Gene Length2277 bp 
Protein Length758 aa 
Translation table11 
GC content53% 
IMG OID 
ProductATP-dependent Clp protease, ATP-binding subunit clpA 
Protein accessionACX40393 
Protein GI260449971 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00164003 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCAATC AAGAACTGGA ACTCAGTTTA AATATGGCTT TCGCCAGAGC GCGCGAGCAC 
CGTCATGAGT TTATGACCGT CGAGCACTTG TTACTGGCGC TGCTCAGTAA CCCATCTGCC
CGGGAGGCGC TGGAAGCGTG TTCTGTGGAT TTGGTTGCGC TCCGTCAGGA ACTGGAAGCC
TTTATTGAAC AAACCACACC CGTTCTGCCT GCCAGTGAAG AGGAGCGCGA CACACAGCCG
ACGCTGAGTT TTCAGCGTGT ACTGCAACGT GCGGTCTTCC ATGTCCAGTC CTCCGGTCGC
AATGAGGTTA CCGGTGCAAA CGTTCTGGTC GCTATCTTTA GCGAACAGGA GTCGCAGGCG
GCATATCTGT TGCGTAAACA TGAAGTCAGC CGTCTCGATG TGGTGAACTT TATCTCTCAT
GGCACGCGTA AAGACGAGCC GACACAGTCT TCTGATCCTG GCAGCCAGCC AAACAGCGAA
GAACAAGCTG GTGGGGAGGA ACGTATGGAG AATTTCACGA CGAACCTGAA TCAGCTTGCG
CGCGTGGGCG GAATCGACCC ACTGATTGGT CGTGAGAAGG AGCTGGAGCG TGCTATTCAG
GTTCTCTGCC GTCGCCGTAA AAACAACCCG CTGCTGGTGG GGGAATCTGG TGTCGGTAAA
ACCGCGATTG CGGAAGGTCT TGCCTGGCGA ATTGTTCAGG GCGATGTGCC GGAAGTGATG
GCTGACTGTA CGATTTACTC TCTCGATATC GGTTCTCTGT TAGCGGGCAC AAAATATCGC
GGCGACTTTG AAAAACGTTT TAAAGCGTTG CTCAAGCAGC TGGAGCAGGA CACTAACAGC
ATCCTGTTTA TTGATGAGAT CCACACCATT ATCGGTGCGG GTGCAGCGTC TGGTGGTCAG
GTCGATGCGG CTAACCTAAT CAAACCGTTG CTCTCCAGCG GTAAAATTCG TGTAATTGGT
TCGACAACCT ATCAGGAGTT CAGCAACATT TTCGAGAAAG ACCGTGCTCT GGCGCGTCGC
TTCCAGAAAA TTGATATTAC TGAACCGTCG ATCGAAGAAA CTGTTCAAAT CATCAATGGC
CTGAAACCGA AGTATGAAGC GCACCACGAC GTGCGTTATA CCGCAAAAGC GGTGCGTGCG
GCGGTAGAGC TGGCGGTGAA ATACATTAAC GATCGTCATC TGCCGGATAA AGCCATTGAT
GTTATCGACG AAGCGGGCGC TCGCGCACGC CTGATGCCGG TAAGCAAACG CAAGAAAACC
GTTAATGTGG CGGATATTGA GTCCGTGGTG GCCCGTATTG CACGCATTCC AGAGAAGAGT
GTTTCTCAGA GTGATCGTGA TACCCTGAAA AACCTCGGCG ATCGCTTGAA AATGCTGGTC
TTCGGTCAGG ATAAAGCCAT TGAGGCGCTG ACTGAAGCCA TTAAGATGGC GCGTGCAGGT
TTAGGTCACG AACATAAACC GGTTGGTTCG TTCCTGTTTG CCGGCCCTAC CGGGGTCGGG
AAAACAGAGG TGACGGTACA GCTTTCGAAA GCTTTGGGCA TTGAGCTTCT GCGCTTTGAT
ATGTCCGAGT ATATGGAACG CCATACCGTC AGCCGTCTTA TTGGTGCGCC TCCGGGATAC
GTTGGTTTTG ATCAGGGCGG TTTGCTGACT GATGCGGTCA TCAAGCATCC ACATGCGGTG
CTGCTGCTGG ACGAAATCGA GAAAGCGCAC CCGGACGTGT TCAATATTCT GTTGCAGGTG
ATGGATAACG GTACGCTGAC CGATAACAAC GGACGCAAAG CAGACTTCCG TAACGTGGTG
CTGGTGATGA CCACCAACGC CGGGGTACGG GAAACTGAGC GCAAATCCAT TGGTCTTATC
CACCAGGATA ACAGCACCGA TGCGATGGAA GAGATCAAGA AGATCTTTAC ACCGGAATTC
CGTAACCGTC TCGACAACAT TATCTGGTTT GATCATCTGT CAACCGACGT GATCCATCAG
GTGGTGGATA AATTCATCGT CGAGTTGCAG GTTCAGCTGG ATCAGAAAGG TGTTTCTCTG
GAAGTGAGCC AGGAAGCGCG TAACTGGCTG GCCGAGAAAG GTTACGACCG GGCAATGGGC
GCTCGTCCGA TGGCGCGTGT CATCCAGGAC AACCTGAAAA AACCGCTCGC CAACGAACTG
CTGTTTGGTT CGCTGGTGGA CGGCGGTCAG GTCACCGTCG CGCTGGATAA AGAGAAAAAT
GAGCTGACTT ACGGATTCCA GAGTGCACAA AAGCACAAGG CGGAAGCAGC GCATTAA
 
Protein sequence
MLNQELELSL NMAFARAREH RHEFMTVEHL LLALLSNPSA REALEACSVD LVALRQELEA 
FIEQTTPVLP ASEEERDTQP TLSFQRVLQR AVFHVQSSGR NEVTGANVLV AIFSEQESQA
AYLLRKHEVS RLDVVNFISH GTRKDEPTQS SDPGSQPNSE EQAGGEERME NFTTNLNQLA
RVGGIDPLIG REKELERAIQ VLCRRRKNNP LLVGESGVGK TAIAEGLAWR IVQGDVPEVM
ADCTIYSLDI GSLLAGTKYR GDFEKRFKAL LKQLEQDTNS ILFIDEIHTI IGAGAASGGQ
VDAANLIKPL LSSGKIRVIG STTYQEFSNI FEKDRALARR FQKIDITEPS IEETVQIING
LKPKYEAHHD VRYTAKAVRA AVELAVKYIN DRHLPDKAID VIDEAGARAR LMPVSKRKKT
VNVADIESVV ARIARIPEKS VSQSDRDTLK NLGDRLKMLV FGQDKAIEAL TEAIKMARAG
LGHEHKPVGS FLFAGPTGVG KTEVTVQLSK ALGIELLRFD MSEYMERHTV SRLIGAPPGY
VGFDQGGLLT DAVIKHPHAV LLLDEIEKAH PDVFNILLQV MDNGTLTDNN GRKADFRNVV
LVMTTNAGVR ETERKSIGLI HQDNSTDAME EIKKIFTPEF RNRLDNIIWF DHLSTDVIHQ
VVDKFIVELQ VQLDQKGVSL EVSQEARNWL AEKGYDRAMG ARPMARVIQD NLKKPLANEL
LFGSLVDGGQ VTVALDKEKN ELTYGFQSAQ KHKAEAAH