Gene EcHS_A1686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1686 
SymbolfumC 
ID5591273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1707319 
End bp1708722 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content54% 
IMG OID640920834 
Productfumarate hydratase 
Protein accessionYP_001458390 
Protein GI157161072 
COG category[C] Energy production and conversion 
COG ID[COG0114] Fumarase 
TIGRFAM ID[TIGR00979] fumarate hydratase, class II 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value0.569967 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACAG TACGCAGCGA AAAAGATTCG ATGGGGGCGA TTGATGTCCC GGCAGATAAG 
CTGTGGGGCG CACAAACTCA ACGCTCGCTG GAGCATTTCC GCATTTCGAC GGAGAAAATG
CCCACCTCAC TGATTCATGC GCTGGCGCTA ACCAAGCGCG CAGCGGCAAA AGTTAATGAA
GATTTAGGCT TGTTGTCTGA AGAGAAAGCG AGCGCCATTC GGCAGGCGGC GGATGAAGTA
CTGGCAGGAC AGCATGACGA CGAATTCCCG CTGGCTATCT GGCAGACCGG CTCCGGCACG
CAAAGTAACA TGAACATGAA CGAAGTGCTG GCTAACCGGG CCAGTGAATT ACTCGGCGGC
GTGCGCGGGA TGGAACGTAA AGTTCACCCT AACGACGACG TGAACAAAAG CCAAAGTTCC
AACGATGTCT TTCCGACGGC GATGCACGTT GCGGCGCTGC TGGCGCTGCG CAAGCAACTC
ATTCCGCAGC TTAAAACCCT GACACAGACG CTGAGTGAAA AATCGCGTGC ATTTGCCGAT
ATCGTCAAAA TCGGTCGAAC CCACTTGCAG GACGCCACGC CGCTAACACT GGGGCAGGAG
ATTTCCGGCT GGGTAGCGAT GCTCGAGTAT AATCTCAAAC ATATCGAATA CAGCCTGCCT
CACGTAGCGG AACTGGCTCT TGGCGGTACA GCGGTGGGTA CTGGACTAAA TACCCATCCG
GAGTATGCGC GTCGCGTAGC AGATGAACTG GCAGTCATTA CCTGTGCACC GTTTGTTACC
GCGCCGAACA AATTTGAAGC GCTGGCGACC TGTGATGCTC TGGTTCAGGC GCACGGCGCG
TTGAAAGGGT TGGCTGCGTC ACTGATGAAA ATCGCCAATG ATGTCCGCTG GCTGGCCTCT
GGCCCGCGCT GCGGAATTGG TGAAATCTCA ATCCCGGAAA ATGAGCCGGG CAGCTCAATC
ATGCCGGGGA AAGTGAATCC AACACAGTGT GAGGCATTAA CCATGCTCTG CTGTCAGGTG
ATGGGGAACG ACGTGGCGAT CAACATGGGG GGCGCTTCCG GTAACTTTGA ACTGAACGTC
TTCCGTCCAA TGGTGATCCA CAATTTCCTG CAATCGGTGC GCTTGCTGGC AGATGGCATG
GAAAGTTTTA ACAAACACTG CGCAGTGGGT ATTGAACCGA ATCGTGAGCG AATCAATCAA
TTACTCAATG AATCGCTGAT GCTGGTGACT GCGCTTAACA CCCACATTGG TTATGACAAA
GCCGCGGAGA TCGCCAAAAA AGCGCATAAA GAAGGGCTGA CCTTAAAAGC TGCGGCCCTT
GGGCTGGGGT ATCTTAGCGA AGCCGAGTTT GACAGCTGGG TACGGCCAGA ACAGATGGTC
GGCAGTATGA AAGCCGGGCG TTAA
 
Protein sequence
MNTVRSEKDS MGAIDVPADK LWGAQTQRSL EHFRISTEKM PTSLIHALAL TKRAAAKVNE 
DLGLLSEEKA SAIRQAADEV LAGQHDDEFP LAIWQTGSGT QSNMNMNEVL ANRASELLGG
VRGMERKVHP NDDVNKSQSS NDVFPTAMHV AALLALRKQL IPQLKTLTQT LSEKSRAFAD
IVKIGRTHLQ DATPLTLGQE ISGWVAMLEY NLKHIEYSLP HVAELALGGT AVGTGLNTHP
EYARRVADEL AVITCAPFVT APNKFEALAT CDALVQAHGA LKGLAASLMK IANDVRWLAS
GPRCGIGEIS IPENEPGSSI MPGKVNPTQC EALTMLCCQV MGNDVAINMG GASGNFELNV
FRPMVIHNFL QSVRLLADGM ESFNKHCAVG IEPNRERINQ LLNESLMLVT ALNTHIGYDK
AAEIAKKAHK EGLTLKAAAL GLGYLSEAEF DSWVRPEQMV GSMKAGR