Gene EcSMS35_1587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1587 
SymbolfumA 
ID6146518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1574112 
End bp1575758 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content51% 
IMG OID641616464 
Productfumarate hydratase 
Protein accessionYP_001743642 
Protein GI170682150 
COG category[C] Energy production and conversion 
COG ID[COG1838] Tartrate dehydratase beta subunit/Fumarate hydratase class I, C-terminal domain
[COG1951] Tartrate dehydratase alpha subunit/Fumarate hydratase class I, N-terminal domain 
TIGRFAM ID[TIGR00722] hydro-lyases, Fe-S type, tartrate/fumarate subfamily, alpha region
[TIGR00723] hydro-lyases, Fe-S type, tartrate/fumarate subfamily, beta region 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.596697 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAACA AACCCTTTCA TTATCAGGCT CCTTTTCCAC TCAAAAAAGA TGATACTGAG 
TATTACCTGC TAACCAGCGA ACACGTTAGC GTATCTGAAT TTGAAGGGCA GGAGATTTTG
AAAGTCGCAC CCGAAGCGTT AACTCTGTTG GCGCGTCAAG CGTTTCATGA TGCGTCATTT
ATGCTGCGTC CGGCTCACCA ACAACAGGTG GCCGACATTC TGCGTGACCC GGAAGCCAGC
GAAAATGATA AATATGTGGC GCTGCAATTC CTGCGTAACT CCGACATCGC GGCGAAAGGC
ATTCTGCCGA CCTGTCAGGA TACCGGCACC GCGATTATTG TTGGTAAAAA AGGGCAGCGT
GTATGGACCG GTGGCGGTGA TGAAGCGGCG CTGGCACGCG GTGTCTATAA CACTTATATC
GAAGATAATC TGCGCTACTC GCAAAACGCG CCGCTGGATA TGTATAAAGA GGTGAATACC
GGCACCAATC TACCAGCGCA GATCGATCTT TATGCCGTTG ATGGCGACGA GTACAAATTC
CTCTGTATCG CCAAAGGTGG CGGTTCGGCA AACAAGACGT ATCTGTATCA GGAAACCAAA
GCGTTACTAA CGCCGGGGAA ACTGAAAAAT TACCTGGTTG AGAAGATGCG CACGCTGGGT
ACGGCGGCCT GTCCTCCGTA TCATATTGCG TTCGTTATTG GTGGAACTTC TGCAGAAACG
AACCTTAAAA CGGTGAAACT GGCTTCCGCT AAATACTATG ATGAACTGCC AACGGAAGGG
AATGAGCACG GACAGGCGTT CCGCGATGTG GAACTGGAAA AAGAATTGCT GATCGAAGCG
CAAAATCTTG GTCTGGGTGC GCAGTTTGGT GGTAAATACT TCGCTCACGA CATCCGCGTG
ATTCGCCTGC CACGTCACGG CGCATCCTGC CCGGTCGGTA TGGGCGTCTC CTGTTCTGCT
GACCGTAATA TCAAAGCGAA GATCAACCGT CAGGGGATCT GGATCGAAAA ACTGGAACAT
AATCCAGGAA AATATATCCC GGAAGAGCTG CGCAAAGCGG GAGAAGGCGA AGCGGTGCGC
GTTGACCTTA ACCGTCCGAT GAAAGAGATC CTCGCACAGT TGTCGCAGTA TCCCGTTTCT
ACACGCTTAT CGCTGAATGG CACGATTATC GTCGGTCGTG ATATTGCTCA CGCCAAACTG
AAAGAGCGGA TGGATAACGG TGAAGGGCTG CCGCAGTACA TCAAAGATCA TCCGATTTAC
TATGCGGGTC CGGCCAAAAC GCCGGAAGGT TATGCCTCCG GTTCTCTTGG CCCAACGACC
GCCGGACGGA TGGATTCTTA TGTCGATCAA CTGCAAGCGC AGGGAGGAAG TATGATCATG
CTGGCGAAAG GCAACCGCAG CCAGCAGGTG ACGGATGCCT GTAAAAAACA CGGCGGTTTC
TACCTTGGCA GTATCGGTGG TCCGGCCGCT GTATTGGCGC AGGGTAGTAT TAAGAGCCTG
GAATGTGTTG AATATCCGGA ACTGGGAATG GAAGCCATCT GGAAAATTGA AGTGGAAGAT
TTCCCGGCAT TTATCCTTGT GGATGATAAA GGAAATGACT TCTTCCAGCA GATACAACTC
ACACAGTGCA CCCGCTGTGT GAAATAA
 
Protein sequence
MSNKPFHYQA PFPLKKDDTE YYLLTSEHVS VSEFEGQEIL KVAPEALTLL ARQAFHDASF 
MLRPAHQQQV ADILRDPEAS ENDKYVALQF LRNSDIAAKG ILPTCQDTGT AIIVGKKGQR
VWTGGGDEAA LARGVYNTYI EDNLRYSQNA PLDMYKEVNT GTNLPAQIDL YAVDGDEYKF
LCIAKGGGSA NKTYLYQETK ALLTPGKLKN YLVEKMRTLG TAACPPYHIA FVIGGTSAET
NLKTVKLASA KYYDELPTEG NEHGQAFRDV ELEKELLIEA QNLGLGAQFG GKYFAHDIRV
IRLPRHGASC PVGMGVSCSA DRNIKAKINR QGIWIEKLEH NPGKYIPEEL RKAGEGEAVR
VDLNRPMKEI LAQLSQYPVS TRLSLNGTII VGRDIAHAKL KERMDNGEGL PQYIKDHPIY
YAGPAKTPEG YASGSLGPTT AGRMDSYVDQ LQAQGGSMIM LAKGNRSQQV TDACKKHGGF
YLGSIGGPAA VLAQGSIKSL ECVEYPELGM EAIWKIEVED FPAFILVDDK GNDFFQQIQL
TQCTRCVK