Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4588 |
Symbol | fumB |
ID | 6145226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4690644 |
End bp | 4692290 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641619404 |
Product | fumarate hydratase class I, anaerobic |
Protein accession | YP_001746516 |
Protein GI | 170681722 |
COG category | [C] Energy production and conversion |
COG ID | [COG1838] Tartrate dehydratase beta subunit/Fumarate hydratase class I, C-terminal domain [COG1951] Tartrate dehydratase alpha subunit/Fumarate hydratase class I, N-terminal domain |
TIGRFAM ID | [TIGR00722] hydro-lyases, Fe-S type, tartrate/fumarate subfamily, alpha region [TIGR00723] hydro-lyases, Fe-S type, tartrate/fumarate subfamily, beta region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAAACA AACCCTTTAT CTACCAGGCA CCTTTCCCGA TGGGGAAAGA CAATACCGAA TACTATCTAC TCACTTCCGA TTACGTTAGC GTTGCCGACT TCGACGGTGA AACCATCCTG AAAGTGGAAC CAGAAGCCCT GACTCTGCTG GCGCAGCAAG CCTTTCACGA TGCTTCTTTC ATGCTCCGCC CGGCACACCA GAAACAGGTT GCGGCTATTC TTCACGATCC AGAAGCCAGC GAAAACGACA AGTACGTGGC GCTGCAATTC TTAAGAAACT CCGAAATCGC CGCCAAAGGC GTGCTGCCGA CCTGCCAGGA TACCGGCACC GCGATCATCG TCGGTAAAAA AGGTCAGCGC GTGTGGACCG GTGGCGGTGA TGAAGAAGCG CTGTCGAAAG GCGTCTATAA CACCTATATC GAAGATAACC TGCGCTATTC ACAGAACGCG CCGCTGGACA TGTATAAAGA GGTCAACACC GGCACGAACC TGCCTGCGCA AATCGACCTG TACGCGGTAG ATGGCGATGA GTACAAATTC CTCTGTGTCT CTAAAGGCGG CGGCTCTGCC AACAAAACGT ATCTCTACCA GGAAACCAAA GCCCTGCTGA CGCCCGGCAA ACTGAAAAAC TTCCTCGTCG AGAAAATGCG TACCCTCGGT ACTGCAGCCT GCCCGCCGTA CCATATCGCG TTTGTGATTG GCGGTACGTC TGCGGAAACC AACCTGAAAA CCGTCAAGTT AGCAAGCGCT CACTATTACG ATGAACTGCC GACGGAAGGG AACGAACATG GTCAGGCGTT CCGCGATATC CAGCTGGAAC AGGAACTGCT GGAAGAGGCC CAGAAACTCG GTCTTGGCGC GCAGTTTGGC GGTAAATACT TCGCGCACGA CATCCGCGTT ATCCGTCTGC CACGTCACGG CGCATCCTGC CCGGTAGGCA TGGGCGTCTC CTGCTCCGCT GACCGTAACA TTAAAGCGAA AATCAACCGC GAAGGTATCT GGATCGAAAA ACTGGAACAC AATCCAGGCC AGTACATTCC GGAAGAACTG CGCAAGGCCG GTGAAGGCGA AGCGGTGAAA GTTGACCTTA ACCGCCCGAT GAAAGAGATC CTCGCCCAGC TTTCGCAATA CCCGGTATCC ACCCGCTTGT CGCTCACCGG CACCATTATC GTGGGTCGCG ATATTGCACA CGCCAAGCTG AAAGAGCTGA TTGACGCCGG TAAAGAACTG CCGCAGTACA TCAAAGATCA CCCGATCTAC TACGCGGGTC CGGCGAAAAC CCCTGCCGGT TATCCATCAG GTTCACTTGG CCCAACCACC GCAGGTCGTA TGGACTCCTA CGTGGATCTG CTGCAATCCC ACGGCGGCAG CATGATCATG CTGGCGAAAG GTAACCGCAG TCAGCAGGTT ACTGACGCGT GCCATAAACA CGGCGGCTTC TACCTCGGTA GCATCGGCGG TCCGGCGGCG GTACTGGCGC AGCAGAGCAT CAAGCATTTG GAATGCGTCG CTTATCCGGA GCTGGGTATG GAAGCTATCT GGAAAATCGA AGTAGAAGAT TTCCCGGCGT TTATCCTGGT CGATGACAAA GGTAACGACT TCTTCCAGCA AATCGTCAAC AAACAGTGCG CGAACTGCAC TAAGTAA
|
Protein sequence | MSNKPFIYQA PFPMGKDNTE YYLLTSDYVS VADFDGETIL KVEPEALTLL AQQAFHDASF MLRPAHQKQV AAILHDPEAS ENDKYVALQF LRNSEIAAKG VLPTCQDTGT AIIVGKKGQR VWTGGGDEEA LSKGVYNTYI EDNLRYSQNA PLDMYKEVNT GTNLPAQIDL YAVDGDEYKF LCVSKGGGSA NKTYLYQETK ALLTPGKLKN FLVEKMRTLG TAACPPYHIA FVIGGTSAET NLKTVKLASA HYYDELPTEG NEHGQAFRDI QLEQELLEEA QKLGLGAQFG GKYFAHDIRV IRLPRHGASC PVGMGVSCSA DRNIKAKINR EGIWIEKLEH NPGQYIPEEL RKAGEGEAVK VDLNRPMKEI LAQLSQYPVS TRLSLTGTII VGRDIAHAKL KELIDAGKEL PQYIKDHPIY YAGPAKTPAG YPSGSLGPTT AGRMDSYVDL LQSHGGSMIM LAKGNRSQQV TDACHKHGGF YLGSIGGPAA VLAQQSIKHL ECVAYPELGM EAIWKIEVED FPAFILVDDK GNDFFQQIVN KQCANCTK
|
| |