Gene EcDH1_1940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1940 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2091366 
End bp2093744 
Gene Length2379 bp 
Protein Length792 aa 
Translation table11 
GC content55% 
IMG OID 
Productphosphoenolpyruvate synthase 
Protein accessionACX39597 
Protein GI260449175 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.887116 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAACA ATGGCTCGTC ACCGCTGGTG CTTTGGTATA ACCAACTCGG CATGAATGAT 
GTAGACAGGG TTGGGGGCAA AAATGCCTCC CTGGGTGAAA TGATTACTAA TCTTTCCGGA
ATGGGTGTTT CCGTTCCGAA TGGTTTCGCC ACAACCGCCG ACGCGTTTAA CCAGTTTCTG
GACCAAAGCG GCGTAAACCA GCGCATTTAT GAACTGCTGG ATAAAACGGA TATTGACGAT
GTTACTCAGC TTGCGAAAGC GGGCGCGCAA ATCCGCCAGT GGATTATCGA CACTCCCTTC
CAGCCTGAGC TGGAAAACGC CATCCGCGAA GCCTATGCAC AGCTTTCCGC CGATGACGAA
AACGCCTCTT TTGCGGTGCG CTCCTCCGCC ACCGCAGAAG ATATGCCGGA CGCTTCTTTT
GCCGGTCAGC AGGAAACCTT CCTCAACGTT CAGGGTTTTG ACGCCGTTCT CGTGGCAGTG
AAACATGTAT TTGCTTCTCT GTTTAACGAT CGCGCCATCT CTTATCGTGT GCACCAGGGT
TACGATCACC GTGGTGTGGC GCTCTCCGCC GGTGTTCAAC GGATGGTGCG CTCTGACCTC
GCATCATCTG GCGTGATGTT CTCCATTGAT ACCGAATCCG GCTTTGACCA GGTGGTGTTT
ATCACTTCCG CATGGGGCCT TGGTGAGATG GTCGTGCAGG GTGCGGTTAA CCCGGATGAG
TTTTACGTGC ATAAACCGAC ACTGGCGGCG AATCGCCCGG CTATCGTGCG CCGCACCATG
GGGTCGAAAA AAATCCGCAT GGTTTACGCG CCGACCCAGG AGCACGGCAA GCAGGTTAAA
ATCGAAGACG TACCGCAGGA ACAGCGTGAC ATCTTCTCGC TGACCAACGA AGAAGTGCAG
GAACTGGCAA AACAGGCCGT ACAAATTGAG AAACACTACG GTCGCCCGAT GGATATTGAG
TGGGCGAAAG ATGGCCACAC CGGTAAACTG TTCATTGTGC AGGCGCGTCC GGAAACCGTG
CGCTCACGCG GTCAGGTCAT GGAGCGTTAT ACGCTGCATT CACAGGGTAA GATTATCGCC
GAAGGCCGTG CTATCGGTCA TCGCATCGGT GCGGGTCCGG TGAAAGTCAT CCATGACATC
AGCGAAATGA ACCGCATCGA ACCTGGCGAC GTGCTGGTTA CTGACATGAC CGACCCGGAC
TGGGAACCGA TCATGAAGAA AGCATCTGCC ATCGTCACCA ACCGTGGCGG TCGTACCTGT
CACGCGGCGA TCATCGCTCG TGAACTGGGC ATTCCGGCGG TAGTGGGCTG TGGAGATGCA
ACAGAACGGA TGAAAGACGG TGAGAACGTC ACTGTTTCTT GTGCCGAAGG TGATACCGGT
TACGTCTATG CGGAGTTGCT GGAATTTAGC GTGAAAAGCT CCAGCGTAGA AACGATGCCG
GATCTGCCGT TGAAAGTGAT GATGAACGTC GGTAACCCGG ACCGTGCTTT CGACTTCGCC
TGCCTACCGA ACGAAGGCGT GGGCCTTGCG CGTCTGGAAT TTATCATCAA CCGTATGATT
GGCGTCCACC CACGCGCACT GCTTGAGTTT GACGATCAGG AACCGCAGTT GCAAAACGAA
ATCCGCGAGA TGATGAAAGG TTTTGATTCT CCGCGTGAAT TTTACGTTGG TCGTCTGACT
GAAGGGATCG CGACGCTGGG TGCCGCGTTT TATCCGAAGC GCGTCATTGT CCGTCTCTCT
GATTTTAAAT CGAACGAATA TGCCAACCTG GTCGGTGGTG AGCGTTACGA GCCAGATGAA
GAGAACCCGA TGCTCGGCTT CCGTGGCGCG GGCCGCTATG TTTCCGACAG CTTCCGCGAC
TGTTTCGCGC TGGAGTGTGA AGCAGTGAAA CGTGTGCGCA ACGACATGGG ACTGACCAAC
GTTGAGATCA TGATCCCGTT CGTGCGTACC GTAGATCAGG CGAAAGCGGT GGTTGAAGAA
CTGGCGCGTC AGGGGCTGAA ACGTGGCGAG AACGGGCTGA AAATCATCAT GATGTGTGAA
ATCCCGTCCA ACGCCTTGCT GGCCGAGCAG TTCCTCGAAT ATTTCGACGG CTTCTCAATT
GGCTCAAACG ATATGACGCA GCTGGCGCTC GGTCTGGACC GTGACTCCGG CGTGGTGTCT
GAATTGTTCG ATGAGCGCAA CGATGCGGTG AAAGCACTGC TGTCGATGGC TATCCGTGCC
GCGAAGAAAC AGGGCAAATA TGTCGGGATT TGCGGTCAGG GTCCGTCCGA CCACGAAGAC
TTTGCCGCAT GGTTGATGGA AGAGGGGATC GATAGCCTGT CTCTGAACCC GGACACCGTG
GTGCAAACCT GGTTAAGCCT GGCTGAACTG AAGAAATAA
 
Protein sequence
MSNNGSSPLV LWYNQLGMND VDRVGGKNAS LGEMITNLSG MGVSVPNGFA TTADAFNQFL 
DQSGVNQRIY ELLDKTDIDD VTQLAKAGAQ IRQWIIDTPF QPELENAIRE AYAQLSADDE
NASFAVRSSA TAEDMPDASF AGQQETFLNV QGFDAVLVAV KHVFASLFND RAISYRVHQG
YDHRGVALSA GVQRMVRSDL ASSGVMFSID TESGFDQVVF ITSAWGLGEM VVQGAVNPDE
FYVHKPTLAA NRPAIVRRTM GSKKIRMVYA PTQEHGKQVK IEDVPQEQRD IFSLTNEEVQ
ELAKQAVQIE KHYGRPMDIE WAKDGHTGKL FIVQARPETV RSRGQVMERY TLHSQGKIIA
EGRAIGHRIG AGPVKVIHDI SEMNRIEPGD VLVTDMTDPD WEPIMKKASA IVTNRGGRTC
HAAIIARELG IPAVVGCGDA TERMKDGENV TVSCAEGDTG YVYAELLEFS VKSSSVETMP
DLPLKVMMNV GNPDRAFDFA CLPNEGVGLA RLEFIINRMI GVHPRALLEF DDQEPQLQNE
IREMMKGFDS PREFYVGRLT EGIATLGAAF YPKRVIVRLS DFKSNEYANL VGGERYEPDE
ENPMLGFRGA GRYVSDSFRD CFALECEAVK RVRNDMGLTN VEIMIPFVRT VDQAKAVVEE
LARQGLKRGE NGLKIIMMCE IPSNALLAEQ FLEYFDGFSI GSNDMTQLAL GLDRDSGVVS
ELFDERNDAV KALLSMAIRA AKKQGKYVGI CGQGPSDHED FAAWLMEEGI DSLSLNPDTV
VQTWLSLAEL KK