Gene ECH74115_2320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2320 
SymbolfumC 
ID6970722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2191281 
End bp2192684 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content54% 
IMG OID643386199 
Productfumarate hydratase 
Protein accessionYP_002270683 
Protein GI209396648 
COG category[C] Energy production and conversion 
COG ID[COG0114] Fumarase 
TIGRFAM ID[TIGR00979] fumarate hydratase, class II 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.109369 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACAG TACGCAGCGA AAAAGATTCG ATGGGGGCGA TTGATGTCCC GGCAGATAAG 
CTGTGGGGCG CACAAACTCA ACGCTCGCTG GAGCATTTCC GCATTTCGAC GGAGAAAATG
CCCACCTCAC TGATTCATGC GCTGGCGCTA ACCAAGCGCG CAGCGGCAAA AGTTAATGAA
GATTTAGGCT TGTTGTCTGA AGAGAAAGCG AGCGCCATTC GGCAGGCGGC GGATGAAGTA
CTGGCAGGAC AGCATGACGA CGAATTCCCG CTGGCTATCT GGCAGACCGG CTCCGGCACG
CAAAGTAACA TGAACATGAA TGAAGTGCTG GCTAACCGGG CCAGTGAATT ACTCGGCGGC
GTGCGCGGGA TGGAACGTAA AGTTCACCCT AACGACGACG TGAACAAAAG CCAAAGTTCC
AACGATGTCT TTCCGACGGC GATGCACGTT GCGGCGCTGC TGGCGCTGCG CAAGCAACTC
ATTCCGCAGC TTAAAACCCT GACACAGACG CTGAGTGAAA AATCGCGTGC ATTTGCCGAT
ATCGTAAAAA TCGGTCGAAC CCACTTGCAG GACGCCACCC CGCTAACACT GGGGCAGGAG
ATTTCCGGCT GGGTAGCGAT GCTCGAGCAT AATCTCAAAC ATATCGAATA CAGCCTGCCT
CACGTAGCGG AACTGGCTCT TGGCGGTACA GCGGTGGGTA CTGGACTAAA TACCCATCCG
GAGTATGCGC GTCGCGTAGC AGATGAACTG GCAGTCATTA CCTGTGCACC GTTTGTTACC
GCGCCGAACA AATTTGAAGC GCTGGCGACC TGTGATGCCC TGGTTCAGGC GCACGGCGCG
TTGAAAGGGT TGGCTGCGTC ACTGATGAAA ATCGCCAATG ATGTCCGCTG GCTGGCCTCT
GGCCCGCGCT GCGGAATTGG TGAAATCTCA ATCCCGGAAA ATGAGCCGGG CAGCTCAATC
ATGCCGGGGA AAGTGAACCC AACACAGTGT GAGGCATTAA CCATGCTCTG CTGTCAGGTG
ATGGGGAACG ACGTGGCGAT CAACATGGGG GGCGCTTCCG GTAACTTTGA ACTGAACGTC
TTCCGTCCAA TGGTGATCCA CAATTTCCTG CAATCGGTGC GCTTGCTGGC AGATGGCATG
GAAAGTTTTA ACAAACACTG CGCAGTGGGT ATTGAACCGA ATCGTGAGCG AATCAATCAA
TTACTCAATG AATCGCTGAT GCTGGTGACT GCGCTTAACA CCCACATTGG TTATGACAAA
GCCGCGGAGA TCGCCAAAAA AGCGCATAAA GAAGGGCTGA CCTTAAAAGC TGCGGCCCTT
GCGCTGGGGT ATCTTAGCGA AGCCGAGTTT GACAGCTGGG TACGGCCAGA ACAGATGGTC
GGCAGTATGA AAGCCGGGCG TTAA
 
Protein sequence
MNTVRSEKDS MGAIDVPADK LWGAQTQRSL EHFRISTEKM PTSLIHALAL TKRAAAKVNE 
DLGLLSEEKA SAIRQAADEV LAGQHDDEFP LAIWQTGSGT QSNMNMNEVL ANRASELLGG
VRGMERKVHP NDDVNKSQSS NDVFPTAMHV AALLALRKQL IPQLKTLTQT LSEKSRAFAD
IVKIGRTHLQ DATPLTLGQE ISGWVAMLEH NLKHIEYSLP HVAELALGGT AVGTGLNTHP
EYARRVADEL AVITCAPFVT APNKFEALAT CDALVQAHGA LKGLAASLMK IANDVRWLAS
GPRCGIGEIS IPENEPGSSI MPGKVNPTQC EALTMLCCQV MGNDVAINMG GASGNFELNV
FRPMVIHNFL QSVRLLADGM ESFNKHCAVG IEPNRERINQ LLNESLMLVT ALNTHIGYDK
AAEIAKKAHK EGLTLKAAAL ALGYLSEAEF DSWVRPEQMV GSMKAGR