Gene EcDH1_4134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_4134 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4481978 
End bp4483309 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content54% 
IMG OID 
Productpeptidase M24 
Protein accessionACX41734 
Protein GI260451312 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.689702 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATCAC TGGCCTCGCT CTATAAAAAT CATATAGCTA CCTTACAAGA ACGGACTCGC 
GATGCGCTGG CGCGCTTCAA GCTGGATGCG TTACTTATTC ACTCCGGCGA GCTGTTCAAT
GTTTTTCTCG ACGATCATCC CTATCCGTTT AAAGTGAACC CGCAATTCAA AGCGTGGGTG
CCGGTAACTC AGGTGCCAAA CTGCTGGTTG CTGGTGGATG GCGTGAACAA GCCGAAACTG
TGGTTCTATC TGCCGGTTGA TTACTGGCAC AACGTCGAAC CGCTGCCGAC CTCTTTCTGG
ACTGAAGATG TGGAAGTGAT CGCGCTGCCG AAAGCCGATG GCATTGGTAG CCTGCTGCCT
GCTGCGCGCG GCAATATCGG TTATATCGGT CCGGTGCCGG AACGTGCGCT GCAACTGGGT
ATTGAGGCCA GCAACATCAA CCCGAAAGGG GTTATCGACT ACCTGCATTA CTATCGCTCC
TTCAAAACTG AGTACGAACT GGCCTGTATG CGTGAAGCGC AGAAAATGGC GGTCAACGGT
CACCGCGCGG CAGAAGAAGC GTTCCGTTCT GGCATGAGCG AGTTCGATAT CAACATTGCC
TATCTGACCG CGACCGGTCA TCGTGATACC GACGTACCTT ACAGCAACAT TGTGGCGCTT
AACGAACACG CTGCGGTACT GCATTACACC AAACTGGACC ATCAGGCACC GGAAGAGATG
CGCAGCTTCC TGCTGGATGC CGGGGCAGAA TATAACGGCT ATGCGGCTGA CCTGACCCGT
ACCTGGTCGG CAAAAAGTGA CAACGACTAC GCGCAGCTGG TGAAGGACGT TAATGATGAA
CAACTGGCGC TGATCGCCAC CATGAAAGCA GGCGTCAGCT ATGTGGATTA CCACATCCAG
TTCCATCAGC GCATCGCCAA ACTGCTGCGT AAACATCAAA TCATCACCGA TATGAGTGAA
GAAGCGATGG TCGAAAACGA TCTTACCGGG CCGTTTATGC CGCATGGTAT CGGCCATCCG
CTGGGCCTGC AGGTGCATGA CGTCGCTGGT TTTATGCAGG ATGATAGCGG TACGCACCTC
GCGGCACCGG CAAAATATCC GTACCTGCGC TGCACCCGTA TTCTCCAGCC GGGCATGGTG
TTAACCATCG AACCGGGTAT CTACTTCATC GAATCGCTAC TGGCACCGTG GCGTGAAGGG
CAGTTCAGCA AGCACTTCAA CTGGCAGAAA ATTGAAGCAC TGAAACCGTT CGGCGGCATT
CGTATCGAAG ACAACGTGGT GATCCACGAA AACAACGTGG AAAACATGAC CCGGGATCTG
AAACTGGCGT GA
 
Protein sequence
MESLASLYKN HIATLQERTR DALARFKLDA LLIHSGELFN VFLDDHPYPF KVNPQFKAWV 
PVTQVPNCWL LVDGVNKPKL WFYLPVDYWH NVEPLPTSFW TEDVEVIALP KADGIGSLLP
AARGNIGYIG PVPERALQLG IEASNINPKG VIDYLHYYRS FKTEYELACM REAQKMAVNG
HRAAEEAFRS GMSEFDINIA YLTATGHRDT DVPYSNIVAL NEHAAVLHYT KLDHQAPEEM
RSFLLDAGAE YNGYAADLTR TWSAKSDNDY AQLVKDVNDE QLALIATMKA GVSYVDYHIQ
FHQRIAKLLR KHQIITDMSE EAMVENDLTG PFMPHGIGHP LGLQVHDVAG FMQDDSGTHL
AAPAKYPYLR CTRILQPGMV LTIEPGIYFI ESLLAPWREG QFSKHFNWQK IEALKPFGGI
RIEDNVVIHE NNVENMTRDL KLA