Gene EcolC_2019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2019 
SymbolfumC 
ID6067990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2229904 
End bp2231307 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content54% 
IMG OID641601431 
Productfumarate hydratase 
Protein accessionYP_001724990 
Protein GI170020036 
COG category[C] Energy production and conversion 
COG ID[COG0114] Fumarase 
TIGRFAM ID[TIGR00979] fumarate hydratase, class II 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACAG TACGCAGCGA AAAAGATTCG ATGGGGGCGA TTGATGTCCC GGCAGATAAG 
CTGTGGGGCG CACAAACTCA ACGCTCGCTG GAGCATTTCC GCATTTCGAC GGAGAAAATG
CCCACCTCAC TGATTCATGC GCTGGCGCTA ACCAAGCGCG CAGCGGCAAA AGTTAATGAA
GATTTAGGCT TGTTGTCTGA AGAGAAAGCG AGCGCCATTC GGCAGGCGGC GGATGAAGTA
CTGGCAGGAC AGCATGACGA CGAATTCCCG CTGGCTATCT GGCAGACCGG CTCCGGCACG
CAAAGTAACA TGAACATGAA CGAAGTGCTG GCTAACCGGG CCAGTGAATT ACTCGGCGGC
GTGCGCGGGA TGGAACGTAA AGTTCACCCT AACGACGACG TGAACAAAAG CCAAAGTTCC
AACGATGTCT TTCCGACGGC GATGCACGTT GCGGCGCTGC TGGCGCTGCG CAAGCAACTC
ATTCCGCAGC TTAAAACCCT GACACAGACG CTGAGTGAAA AATCGCGTGC ATTTGCCGAT
ATCGTCAAAA TCGGTCGAAC CCACTTGCAG GACGCCACGC CGCTAACACT GGGGCAGGAG
ATTTCCGGCT GGGTAGCGAT GCTCGAGTAT AATCTCAAAC ATATCGAATA CAGCCTGCCT
CACGTAGCGG AACTGGCTCT TGGCGGTACA GCGGTGGGTA CTGGACTAAA TACCCATCCG
GAGTATGCGC GTCGCGTAGC AGATGAACTG GCAGTCATTA CCTGTGCACC GTTTGTTACC
GCGCCGAACA AATTTGAAGC GCTGGCGACC TGTGATGCTC TGGTTCAGGC GCACGGCGCG
TTGAAAGGGT TGGCTGCGTC ACTGATGAAA ATCGCCAATG ATGTCCGCTG GCTGGCCTCT
GGCCCGCGCT GCGGAATTGG TGAAATCTCA ATCCCGGAAA ATGAGCCGGG CAGCTCAATC
ATGCCGGGGA AAGTGAATCC AACACAGTGT GAGGCATTAA CCATGCTCTG CTGTCAGGTG
ATGGGGAACG ACGTGGCGAT CAACATGGGG GGCGCTTCCG GTAACTTTGA ACTGAACGTC
TTCCGTCCAA TGGTGATCCA CAATTTCCTG CAATCGGTGC GCTTGCTGGC AGATGGCATG
GAAAGTTTTA ACAAACACTG CGCAGTGGGT ATTGAACCGA ATCGTGAGCG AATCAATCAA
TTACTCAATG AATCGCTGAT GCTGGTGACT GCGCTTAACA CCCACATTGG TTATGACAAA
GCCGCGGAGA TCGCCAAAAA AGCGCATAAA GAAGGGCTGA CCTTAAAAGC TGCGGCCCTT
GCGCTGGGGT ATCTTAGCGA AGCCGAGTTT GACAGCTGGG TACGGCCAGA ACAGATGGTC
GGCAGTATGA AAGCCGGGCG TTAA
 
Protein sequence
MNTVRSEKDS MGAIDVPADK LWGAQTQRSL EHFRISTEKM PTSLIHALAL TKRAAAKVNE 
DLGLLSEEKA SAIRQAADEV LAGQHDDEFP LAIWQTGSGT QSNMNMNEVL ANRASELLGG
VRGMERKVHP NDDVNKSQSS NDVFPTAMHV AALLALRKQL IPQLKTLTQT LSEKSRAFAD
IVKIGRTHLQ DATPLTLGQE ISGWVAMLEY NLKHIEYSLP HVAELALGGT AVGTGLNTHP
EYARRVADEL AVITCAPFVT APNKFEALAT CDALVQAHGA LKGLAASLMK IANDVRWLAS
GPRCGIGEIS IPENEPGSSI MPGKVNPTQC EALTMLCCQV MGNDVAINMG GASGNFELNV
FRPMVIHNFL QSVRLLADGM ESFNKHCAVG IEPNRERINQ LLNESLMLVT ALNTHIGYDK
AAEIAKKAHK EGLTLKAAAL ALGYLSEAEF DSWVRPEQMV GSMKAGR