Gene EcDH1_2032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2032 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2191890 
End bp2193293 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content54% 
IMG OID 
Productfumarate hydratase, class II 
Protein accessionACX39689 
Protein GI260449267 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.000761073 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACAG TACGCAGCGA AAAAGATTCG ATGGGGGCGA TTGATGTCCC GGCAGATAAG 
CTGTGGGGCG CACAAACTCA ACGCTCGCTG GAGCATTTCC GCATTTCGAC GGAGAAAATG
CCCACCTCAC TGATTCATGC GCTGGCGCTA ACCAAGCGTG CAGCGGCAAA AGTTAATGAA
GATTTAGGCT TGTTGTCTGA AGAGAAAGCG AGCGCCATTC GTCAGGCGGC GGATGAAGTA
CTGGCAGGAC AGCATGACGA CGAATTCCCG CTGGCTATCT GGCAGACCGG CTCCGGCACG
CAAAGTAACA TGAACATGAA CGAAGTGCTG GCTAACCGGG CCAGTGAATT ACTCGGCGGT
GTGCGCGGGA TGGAACGTAA AGTTCACCCT AACGACGACG TGAACAAAAG CCAAAGTTCC
AACGATGTCT TTCCGACGGC GATGCACGTT GCGGCGCTGC TGGCGCTGCG CAAGCAACTC
ATTCCTCAGC TTAAAACCCT GACACAGACA CTGAATGAGA AATCCCGTGC TTTTGCCGAT
ATCGTCAAAA TTGGTCGTAC TCACTTGCAG GATGCCACGC CGTTAACGCT GGGGCAGGAG
ATTTCCGGCT GGGTAGCGAT GCTCGAGCAT AATCTCAAAC ATATCGAATA CAGCCTGCCT
CACGTAGCGG AACTGGCTCT TGGCGGTACA GCGGTGGGTA CTGGACTAAA TACCCATCCG
GAGTATGCGC GTCGCGTAGC AGATGAACTG GCAGTCATTA CCTGTGCACC GTTTGTTACC
GCGCCGAACA AATTTGAAGC GCTGGCGACC TGTGATGCCC TGGTTCAGGC GCACGGCGCG
TTGAAAGGGT TGGCTGCGTC ACTGATGAAA ATCGCCAATG ATGTCCGCTG GCTGGCCTCT
GGCCCGCGCT GCGGAATTGG TGAAATCTCA ATCCCGGAAA ATGAGCCGGG CAGCTCAATC
ATGCCGGGGA AAGTGAACCC AACACAGTGT GAGGCATTAA CCATGCTCTG CTGTCAGGTG
ATGGGGAACG ACGTGGCGAT CAACATGGGG GGCGCTTCCG GTAACTTTGA ACTGAACGTC
TTCCGTCCAA TGGTGATCCA CAATTTCCTG CAATCGGTGC GCTTGCTGGC AGATGGCATG
GAAAGTTTTA ACAAACACTG CGCAGTGGGT ATTGAACCGA ATCGTGAGCG AATCAATCAA
TTACTCAATG AATCGCTGAT GCTGGTGACT GCGCTTAACA CCCACATTGG TTATGACAAA
GCCGCCGAGA TCGCCAAAAA AGCGCATAAA GAAGGGCTGA CCTTAAAAGC TGCGGCCCTT
GCGCTGGGGT ATCTTAGCGA AGCCGAGTTT GACAGCTGGG TACGGCCAGA ACAGATGGTC
GGCAGTATGA AAGCCGGGCG TTAA
 
Protein sequence
MNTVRSEKDS MGAIDVPADK LWGAQTQRSL EHFRISTEKM PTSLIHALAL TKRAAAKVNE 
DLGLLSEEKA SAIRQAADEV LAGQHDDEFP LAIWQTGSGT QSNMNMNEVL ANRASELLGG
VRGMERKVHP NDDVNKSQSS NDVFPTAMHV AALLALRKQL IPQLKTLTQT LNEKSRAFAD
IVKIGRTHLQ DATPLTLGQE ISGWVAMLEH NLKHIEYSLP HVAELALGGT AVGTGLNTHP
EYARRVADEL AVITCAPFVT APNKFEALAT CDALVQAHGA LKGLAASLMK IANDVRWLAS
GPRCGIGEIS IPENEPGSSI MPGKVNPTQC EALTMLCCQV MGNDVAINMG GASGNFELNV
FRPMVIHNFL QSVRLLADGM ESFNKHCAVG IEPNRERINQ LLNESLMLVT ALNTHIGYDK
AAEIAKKAHK EGLTLKAAAL ALGYLSEAEF DSWVRPEQMV GSMKAGR