Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_4016 |
Symbol | |
ID | 6064573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 4416330 |
End bp | 4417931 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641603427 |
Product | malate synthase |
Protein accession | YP_001726942 |
Protein GI | 170021988 |
COG category | [C] Energy production and conversion |
COG ID | [COG2225] Malate synthase |
TIGRFAM ID | [TIGR01344] malate synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.931285 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0529189 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGAAC AGGCAACAAC AACCGATGAA CTGGCTTTCA CAAGGCCGTA TGGCGAGCAG GAGAAGCAAA TTCTTACTGC CGAAGCGGTA GAATTTCTGA CTGAGCTGGT GACGCATTTT ACGCCACAAC GCAATAAACT TCTGGCAGCG CGCATTCAGC AGCAGCAAGA TATTGATAAC GGAACGTTGC CTGATTTTAT TTCGGAAACA GCTTCCATTC GCGATGCTGA TTGGAAAATT CGCGGGATTC CTGCGGACTT AGAAGACCGC CGCGTAGAGA TAACTGGCCC GGTAGAGCGC AAGATGGTGA TCAACGCGCT CAACGCCAAT GTGAAAGTCT TTATGGCCGA TTTCGAAGAT TCACTGGCAC CAGACTGGAA CAAAGTGATC GACGGGCAAA TTAACCTGCG TGATGCGGTT AACGGCACCA TCAGTTACAC TAATGAAGCA GGCAAAATTT ACCAGCTCAA GCCCAATCCA GCGGTTTTGA TTTGTCGGGT ACGCGGTCTG CACTTGCCGG AAAAACATGT CACCTGGCGT GGTGAGGCAA TCCCCGGCAG CCTGTTTGAT TTTGCGCTCT ATTTCTTCCA CAACTATCAG GCACTGTTGG CAAAGGGCAG TGGTCCCTAT TTCTATCTGC CGAAAACCCA GTCCTGGCAG GAAGCGGCCT GGTGGAGCGA AGTCTTCAGC TATGCAGAAG ATCGCTTTAA TCTGCCGCGC GGCACCATCA AGGCGACGTT GCTGATTGAA ACGCTGCCCG CCGTGTTCCA GATGGATGAA ATCCTTCACG CGCTGCGTGA CCATATTGTT GGTCTGAACT GCGGTCGTTG GGATTACATC TTCAGCTATA TCAAAACGTT GAAAAACTAT CCCGATCGCG TCCTGCCAGA CAGACAGGCA GTGACGATGG ATAAACCATT CCTGAATGCT TACTCACGCC TGTTGATTAA AACCTGCCAT AAACGCGGTG CTTTTGCGAT GGGCGGCATG GCGGCGTTTA TTCCGAGCAA AGATGAAGAG CACAATAACC AGGTGCTCAA CAAAGTAAAA GCGGATAAAT CGCTGGAAGC CAATAACGGT CACGATGGCA CATGGATCGC TCACCCAGGC CTTGCGGACA CGGCAATGGC GGTATTCAAC GACATTCTCG GCTCCCGTAA AAATCAGCTT GAAGTGATGC GCGAACAAGA CGCGCCGATT ACTGCCGATC AGCTGCTGGC ACCTTGTGAT GGTGAACGCA CCGAAGAAGG TATGCGCGCC AACATTCGCG TGGCTGTGCA GTACATCGAA GCGTGGATCT CTGGCAACGG CTGTGTGCCG ATTTATGGCC TGATGGAAGA TGCGGCGACG GCTGAAATTT CCCGTACCTC GATCTGGCAG TGGATCCATC ATCAAAAAAC GTTGAGCAAT GGCAAACCGG TGACCAAAGC CTTGTTCCGC CAGATGCTGG GCGAAGAGAT GAAAGTCATT GCCAGCGAAC TGGGCGAAGA ACGTTTCTCC CAGGGGCGTT TTGACGATGC CGCACGCTTG ATGGAACAGA TCACCACTTC CGATGAGTTA ATTGATTTCC TGACCCTGCC AGGCTACCGC CTGTTAGCGT AA
|
Protein sequence | MTEQATTTDE LAFTRPYGEQ EKQILTAEAV EFLTELVTHF TPQRNKLLAA RIQQQQDIDN GTLPDFISET ASIRDADWKI RGIPADLEDR RVEITGPVER KMVINALNAN VKVFMADFED SLAPDWNKVI DGQINLRDAV NGTISYTNEA GKIYQLKPNP AVLICRVRGL HLPEKHVTWR GEAIPGSLFD FALYFFHNYQ ALLAKGSGPY FYLPKTQSWQ EAAWWSEVFS YAEDRFNLPR GTIKATLLIE TLPAVFQMDE ILHALRDHIV GLNCGRWDYI FSYIKTLKNY PDRVLPDRQA VTMDKPFLNA YSRLLIKTCH KRGAFAMGGM AAFIPSKDEE HNNQVLNKVK ADKSLEANNG HDGTWIAHPG LADTAMAVFN DILGSRKNQL EVMREQDAPI TADQLLAPCD GERTEEGMRA NIRVAVQYIE AWISGNGCVP IYGLMEDAAT AEISRTSIWQ WIHHQKTLSN GKPVTKALFR QMLGEEMKVI ASELGEERFS QGRFDDAARL MEQITTSDEL IDFLTLPGYR LLA
|
| |