Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4468 |
Symbol | aceB |
ID | 6145921 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4560847 |
End bp | 4562448 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641619284 |
Product | malate synthase |
Protein accession | YP_001746396 |
Protein GI | 170683352 |
COG category | [C] Energy production and conversion |
COG ID | [COG2225] Malate synthase |
TIGRFAM ID | [TIGR01344] malate synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.554471 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.0868855 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGAAC AGGCAACAAC AACCGATGAA CTGGCTTTCA TAAGGCCGTA TGGCGAGCAG GAGAAGCAAA TTCTTACTGC CGAAGCGGTA GAATTTCTGA CTGAGCTGGT GACGCATTTT ACGCCACAAC GCAATAAACT TCTGGCAGCG CGCATTCAGC AGCAGCAGGA TATCGATAAC GGAACGTTGC CTGATTTTAT TTCGGAAACA GCTTCCATTC GTGATGCTGA CTGGAAAATT CGTGGTATTC CCGCGGACTT ACAAGATCGT CGAGTCGAGA TAACTGGCCC GGTTGAGCGC AAGATGGTGA TCAACGCGCT CAACGCCAAT GTGAAAGTCT TTATGGCCGA TTTCGAAGAT TCACTGGCCC CGGACTGGAA CAAAGTGATC GACGGGCAAA TTAACCTGCG CGATGCGGTT AACGGCACCA TCAGCTATAC CAATGAAGCA GGCAAAATTT ATCAGCTCAA GCCCAATCCA GCGGTGTTGA TTTGTCGGGT TCGCGGTCTG CACTTGCCGG AAAAACATGT CACCTGGCGC GGCGAAGCAA TTCCTGGCAG CCTGTTTGAT TTTGCGCTCT ATTTCTTCCA CAACTACCAG GCTCTGTTAG CAAAAGGCAG CGGTCCCTAT TTCTATCTGC CGAAAACCCA GTCCTGGCAG GAAGCGGCCT GGTGGAGCGA AGTCTTCAGC TATGCAGAAG ATCGCTTTAA TCTGCCGCGC GGCACCATCA AGGCGACGTT GCTGATTGAA ACGCTGCCCG CCGTGTTCCA GATGGACGAA ATCCTTCACG CGCTGCGTGA CCATATTGTT GGTCTGAACT GCGGTCGTTG GGATTACATC TTCAGTTATA TCAAAACGTT GAAAAACTAT CCCGATCGCG TCCTGCCAGA TAGACAGGCA GTGACGATGG ATAAACCCTT CCTGAATGCT TACTCACGCC TGTTGATTAA AACCTGCCAT AAACGCGGCG CATTTGCGAT GGGCGGCATG GCGGCGTTTA TTCCGAGCAA AGATGAAGAG CGCAATAACC AGGTGCTCAA CAAAGTAAAA GCGGATAAAT CGCTGGAAGC CAATAACGGT CACGATGGCA CATGGATCGC TCACCCAGGT CTTGCGGATA CGGCAATGGC GGTATTCAAC GACATTCTCG GCTCCCGTAA AAATCAGCTT GAAGTGATGC GCGAACAAGA CGCGCCGATT ACTGCCGATC AGCTGCTGGC ACCTTGTGAT GGTGAACGCA CCGAAGAAGG TATGCGCGCC AACATTCGCG TGGCTGTGCA GTACATCGAA GCATGGATCT CCGGCAACGG CTGCGTGCCG ATTTATGGCC TGATGGAAGA TGCGGCGACG GCTGAAATTT CCCGTACCTC AATCTGGCAG TGGATCCATC ATCAAAAAAC GTTGAGCAAT GGCAAACCGG TGACCAAAGC CTTGTTCCGC CAGATGCTGG GCGAAGAGAT GAAAGTCATT GCCAGCGAAC TGGGCGAAGA ACGTTTCTCC CAGGGGCGTT TTGACGATGC CGCACGTTTG ATGGAACAGA TCACCACTTC CGATGAGTTA ATTGATTTCC TGACCCTGCC AGGCTACCGC CTGTTAGCGT AA
|
Protein sequence | MTEQATTTDE LAFIRPYGEQ EKQILTAEAV EFLTELVTHF TPQRNKLLAA RIQQQQDIDN GTLPDFISET ASIRDADWKI RGIPADLQDR RVEITGPVER KMVINALNAN VKVFMADFED SLAPDWNKVI DGQINLRDAV NGTISYTNEA GKIYQLKPNP AVLICRVRGL HLPEKHVTWR GEAIPGSLFD FALYFFHNYQ ALLAKGSGPY FYLPKTQSWQ EAAWWSEVFS YAEDRFNLPR GTIKATLLIE TLPAVFQMDE ILHALRDHIV GLNCGRWDYI FSYIKTLKNY PDRVLPDRQA VTMDKPFLNA YSRLLIKTCH KRGAFAMGGM AAFIPSKDEE RNNQVLNKVK ADKSLEANNG HDGTWIAHPG LADTAMAVFN DILGSRKNQL EVMREQDAPI TADQLLAPCD GERTEEGMRA NIRVAVQYIE AWISGNGCVP IYGLMEDAAT AEISRTSIWQ WIHHQKTLSN GKPVTKALFR QMLGEEMKVI ASELGEERFS QGRFDDAARL MEQITTSDEL IDFLTLPGYR LLA
|
| |