Gene EcSMS35_1589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1589 
SymbolfumC 
ID6146349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1575901 
End bp1577304 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content54% 
IMG OID641616465 
Productfumarate hydratase 
Protein accessionYP_001743643 
Protein GI170681319 
COG category[C] Energy production and conversion 
COG ID[COG0114] Fumarase 
TIGRFAM ID[TIGR00979] fumarate hydratase, class II 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.194502 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACAG TACGCAGCGA AAAAGATTCG ATGGGGGCGA TTGATGTCCC GGCAGATAAG 
CTGTGGGGCG CACAAACTCA ACGCTCGCTG GAGCATTTCC GCATTTCGAC GGAGAAAATG
CCCACCTCAC TGATTCATGC GCTGGCACTA ACCAAGCGCG CAGCGGCAAA AGTTAATGAA
GATTTAGGCT TGTTGTCTGA AGAGAAAGCG AGCGCCATTC GTCAGGCGGC GGATGAAGTA
CTGGCAGGAC AGCATGACGA CGAATTCCCG CTGGCTATCT GGCAGACCGG CTCCGGCACG
CAAAGTAATA TGAACATGAA CGAAGTGCTG GCTAATCGGG CCAGTGAATT ACTTGGCGGC
GTGCGCGGGA TGGAGCGTAA AGTTCACCCT AACGACGACG TGAACAAAAG CCAAAGTTCC
AATGATGTCT TTCCGACGGC GATGCACGTT GCGGCGCTAC TGGCGCTGCG CAAGCAACTC
ATTCCGCAGC TTAAAACCCT GACACAGACA CTGAATGAGA AATCCCGTGC ATTTGCCGAT
ATCGTCAAAA TCGGTCGAAC CCACTTGCAG GATGCCACGC CGCTAACACT GGGGCAGGAG
ATTTCCGGCT GGGTAGCGAT GCTCGAGCAT AATCTCAAAC ATATCGAATA CAGCCTGCCG
CATGTAGCGG AACTGGCTCT TGGCGGTACA GCGGTGGGTA CTGGACTAAA TACCCATCCG
GAGTATGCGC GTCGAGTAGC AGATGAACTG GCAGTCATTA CCTGCGCTCC GTTTGTTACC
GCGCCGAACA AATTTGAAGC GCTGGCGACC TGTGATGCCC TGGTTCAGGC GCACGGCGCG
TTGAAAGGGT TGGCTGCGTC ACTGATGAAA ATTGCCAATG ATGTCCGCTG GCTGGCCTCT
GGCCCGCGCT GTGGAATTGG TGAAATCGCA ATCCCGGAAA ATGAGCCGGG CAGCTCAATC
ATGTCGGGGA AAGTGAACCC AACCCAGTGT GAGGCATTAA CCATGCTTTG CTGTCAGGTG
ATGGGGAACG ACGTGGCGAT CAACATGGGT GGCGCTTCCG GTAACTTTGA ACTGAACGTC
TTCCGTCCGA TGGTGATTCA TAATTTCCTG CAATCGGTGC GCTTGCTGGC AGATGGCATG
GAAAGTTTCA ACAAACACTG TGCAGTGGGC ATTGAACCGA ATCGTGAGCG AATCAATCAA
TTACTCAATG AATCGCTGAT GCTGGTGACT GCGCTTAACA CCCACATTGG TTATGACAAA
GCCGCCGAGA TCGCCAAAAA AGCGCATAAA GAAGGGCTGA CCTTAAAAGC TGCGGCCCTT
GCGCTGGGGT ATCTTAGCGA AGCCGAGTTT GACAGCTGGG TACGGCCAGA ACAGATGGTC
GGCAGTATGA AAGCCGGGGG TTAA
 
Protein sequence
MNTVRSEKDS MGAIDVPADK LWGAQTQRSL EHFRISTEKM PTSLIHALAL TKRAAAKVNE 
DLGLLSEEKA SAIRQAADEV LAGQHDDEFP LAIWQTGSGT QSNMNMNEVL ANRASELLGG
VRGMERKVHP NDDVNKSQSS NDVFPTAMHV AALLALRKQL IPQLKTLTQT LNEKSRAFAD
IVKIGRTHLQ DATPLTLGQE ISGWVAMLEH NLKHIEYSLP HVAELALGGT AVGTGLNTHP
EYARRVADEL AVITCAPFVT APNKFEALAT CDALVQAHGA LKGLAASLMK IANDVRWLAS
GPRCGIGEIA IPENEPGSSI MSGKVNPTQC EALTMLCCQV MGNDVAINMG GASGNFELNV
FRPMVIHNFL QSVRLLADGM ESFNKHCAVG IEPNRERINQ LLNESLMLVT ALNTHIGYDK
AAEIAKKAHK EGLTLKAAAL ALGYLSEAEF DSWVRPEQMV GSMKAGG