Gene EcHS_A1687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1687 
SymbolfumA 
ID5591274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1708865 
End bp1710511 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content51% 
IMG OID640920835 
Productfumarate hydratase 
Protein accessionYP_001458391 
Protein GI157161073 
COG category[C] Energy production and conversion 
COG ID[COG1838] Tartrate dehydratase beta subunit/Fumarate hydratase class I, C-terminal domain
[COG1951] Tartrate dehydratase alpha subunit/Fumarate hydratase class I, N-terminal domain 
TIGRFAM ID[TIGR00722] hydro-lyases, Fe-S type, tartrate/fumarate subfamily, alpha region
[TIGR00723] hydro-lyases, Fe-S type, tartrate/fumarate subfamily, beta region 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value0.713112 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAACA AACCCTTTCA TTATCAGGCT CCTTTTCCAC TCAAAAAAGA TGATACTGAG 
TATTACCTGC TAACCAGCGA ACACGTTAGC GTATCTGAAT TTGAAGGGCA GGAGATTTTG
AAAGTCGCAC CCGAAGCGTT AACTCTGTTG GCGCGTCAGG CGTTTCATGA TGCGTCGTTT
ATGCTGCGTC CGGCTCACCA ACAACAGGTG GCCGACATTC TGCGTGACCC GGAGGCCAGC
GAAAATGATA AATATGTGGC GCTGCAATTC CTGCGTAACT CCGACATCGC GGCGAAAGGC
GTTCTGCCAA CCTGTCAGGA TACCGGCACT GCGATTATTG TTGGTAAAAA AGGGCAGCGT
GTATGGACCG GTGGCTGTGA TGAAGCGGCG CTGGCGCGCG GTGTCTATAA CACTTATATC
GAAGATAATT TGCGCTACTC GCAAAACGCG CCGCTGGATA TGTATAAAGA GGTGAATACC
GGCACCAATC TGCCAGCGCA GATCGATCTT TATGCCGTTG ATGGCGACGA GTACAAATTC
CTCTGTATCG CCAAAGGTGG TGGTTCGGCA AACAAGACGT ATCTCTATCA GGAAACCAAA
GCGTTACTAA CGCCGGGGAA ACTGAAAAAT TACCTGGTTG AGAAGATGCG CACGCTGGGT
ACGGCGGCCT GTCCTCCGTA TCATATTGCG TTCGTAATTG GTGGAACTTC TGCAGAAACT
AACCTTAAAA CAGTGAAACT GGCTTCCGCT AAATACTATG ACGAACTGCC AACGGAAGGG
AATGAGCACG GTCAGGCATT CCGCGATGTG GAACTGGAAA AAGAATTGCT GATCGAAGCG
CAAAATCTCG GTCTGGGTGC GCAGTTTGGT GGTAAATACT TCGCTCACGA CATACGCGTG
ATTCGCCTGC CACGTCACGG CGCATCCTGT CCGGTTGGTA TGGGTGTTTC CTGCTCTGCT
GACCGTAATA TCAAAGCGAA GATCAACCGT CAGGGGATCT GGATCGAAAA ACTGGAACAT
AATCCAGGTA AATATATCCC TGAAGAGCTG CGTAAAGCGG GAGAAGGCGA AGCGGTGCGC
GTTGACCTTA ACCGTCCGAT GAAAGAGATC CTCGCACAGT TGTCGCAGTA TCCCGTTTCT
ACACGCTTAT CGCTTAACGG CACGATTATC GTCGGTCGTG ATATTGCTCA CGCCAAACTG
AAAGAGCGGA TGGATAACGG TGAAGGGCTG CCGCAGTACA TTAAAGATCA TCCGATTTAC
TACGCGGGTC CGGCCAAAAC GCCGGAAGGT TATGCCTCCG GTTCTCTTGG CCCAACGACC
GCCGGACGGA TGGATTCTTA TGTCGATCAA CTGCAAGCGC AGGGCGGAAG TATGATCATG
CTGGCGAAAG GCAACCGCAG CCAGCAGGTG ACGGATGCCT GTAAAAAACA CGGCGGCTTC
TACCTTGGCA GTATCGGTGG TCCGGCCGCT GTATTGGCGC AGGGAAGTAT TAAGAGCCTG
GAATGTGTTG AATATCCGGA ACTGGGAATG GAAGCCATCT GGAAAATTGA AGTGGAAGAT
TTCCCGGCGT TTATCCTTGT GGATGATAAA GGAAATGACT TCTTCCAGCA GATACAACTC
ACACAGTGCA CTCGCTGTGT GAAATAA
 
Protein sequence
MSNKPFHYQA PFPLKKDDTE YYLLTSEHVS VSEFEGQEIL KVAPEALTLL ARQAFHDASF 
MLRPAHQQQV ADILRDPEAS ENDKYVALQF LRNSDIAAKG VLPTCQDTGT AIIVGKKGQR
VWTGGCDEAA LARGVYNTYI EDNLRYSQNA PLDMYKEVNT GTNLPAQIDL YAVDGDEYKF
LCIAKGGGSA NKTYLYQETK ALLTPGKLKN YLVEKMRTLG TAACPPYHIA FVIGGTSAET
NLKTVKLASA KYYDELPTEG NEHGQAFRDV ELEKELLIEA QNLGLGAQFG GKYFAHDIRV
IRLPRHGASC PVGMGVSCSA DRNIKAKINR QGIWIEKLEH NPGKYIPEEL RKAGEGEAVR
VDLNRPMKEI LAQLSQYPVS TRLSLNGTII VGRDIAHAKL KERMDNGEGL PQYIKDHPIY
YAGPAKTPEG YASGSLGPTT AGRMDSYVDQ LQAQGGSMIM LAKGNRSQQV TDACKKHGGF
YLGSIGGPAA VLAQGSIKSL ECVEYPELGM EAIWKIEVED FPAFILVDDK GNDFFQQIQL
TQCTRCVK