Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1564 |
Symbol | sfcA |
ID | 5591799 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 1570784 |
End bp | 1572481 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640920717 |
Product | malate dehydrogenase |
Protein accession | YP_001458273 |
Protein GI | 157160955 |
COG category | [C] Energy production and conversion |
COG ID | [COG0281] Malic enzyme |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 54 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACCAA AAACAAAAAA ACAGCGTTCG CTTTATATCC CTTACGCTGG CCCTGTACTG CTGGAATTTC CGTTGTTGAA TAAAGGCAGT GCCTTCAGCA TGGAAGAACG CCGTAACTTC AACCTGCTGG GGTTACTGCC GGAAGTGGTC GAAACCATCG AAGAACAAGC GGAACGAGCA TGGATCCAGT ATCAGGGATT CAAAACCGAA ATCGACAAAC ACATCTACCT GCGTAACATC CAGGACACTA ACGAAACCCT CTTCTACCGT CTGGTAAACA ATCATCTTGA TGAGATGATG CCTGTTATTT ATACCCCAAC CGTCGGCGCA GCCTGTGAGC GTTTTTCTGA GATCTACCGC CGTTCACGCG GCGTGTTTAT CTCTTACCAG AACCGGCACA ATATGGACGA TATTCTGCAA AACGTGCCGA ACCATAATAT TAAAGTGATT GTGGTGACTG ACGGTGAACG TATTCTGGGG CTTGGTGACC AGGGCATCGG CGGGATGGGC ATTCCGATCG GTAAACTGTC GCTCTATACC GCCTGTGGCG GCATCAGCCC GGCGTATACC CTTCCGGTGG TGCTGGATGT CGGAACGAAC AACCAACAGC TGCTTAACGA TCCGCTGTAT ATGGGCTGGC GTAATCCGCG TATCACTGAC GACGAATACT ATGAATTCGT TGATGAATTT ATCCAGGCTG TGAAACAACG CTGGCCAGAC GTGCTGTTGC AGTTTGAAGA CTTTGCTCAA AAAAATGCGA TGCCGTTACT TAACCGCTAT CGCAATGAAA TTTGTTCTTT TAACGATGAC ATTCAGGGCA CTGCGGCGGT AACAGTCGGC ACACTGATCG CAGCAAGCCG AGCGGCAGGT GGTCAGTTAA GCGAGAAAAA AATCGTCTTC CTTGGTGCAG GTTCAGCGGG ATGCGGCATT GCCGAAATGA TCATCGCCCA GACCCAGCGC GAAGGATTAA GCGAGGAAGC GGCGCGGCAG AAAGTCTTTA TGGTCGATCG CTTTGGCTTG CTGACTGACA AGATGCCGAA CCTGCTGCCT TTCCAGACCA AACTGGTGCA GAAGCGCGAA AACCTCAGTG ACTGGGATAC CGACAGCGAT GTGCTGTCAC TGCTGGATGT GGTGCGCAAT GTAAAACCAG ATATTCTGAT TGGCGTCTCA GGACAGACCG GGCTGTTTAC GGAAGAGATC ATCCGTGAGA TGCATAAACA CTGTCCGCGT CCGATCGTGA TGCCGCTGTC TAACCCGACG TCACGCGTGG AAGCCACACC GCAGGACATT ATCGCCTGGA CCGAAGGTAA CGCGCTGGTC GCCACGGGCA GCCCGTTTAA TCCAGTGGTA TGGAAAGATA AAATCTACCC TATCGCCCAG TGTAACAACG CCTTTATTTT CCCGGGCATC GGCCTGGGTG TTATTGCTTC CGGCGCGTCA CGTATCACCG ATGAGATGCT GATGTCGGCA AGTGAAACGC TGGCGCAGTA TTCACCATTG GTGCTGAACG GCGAAGGTAT GGTACTGCCG GAACTGAAAG ATATTCAGAA AGTCTCCCGC GCAATTGCGT TTGCGGTTGG CAAAATGGCG CAGCAGCAAG GCGTGGCGGT GAAAACCTCT GCCGAAGCCC TGCAACAGGC CATTGACGAT AATTTCTGGC AAGCCGAATA CCGCGACTAC CGCCGTACCT CCATCTAA
|
Protein sequence | MEPKTKKQRS LYIPYAGPVL LEFPLLNKGS AFSMEERRNF NLLGLLPEVV ETIEEQAERA WIQYQGFKTE IDKHIYLRNI QDTNETLFYR LVNNHLDEMM PVIYTPTVGA ACERFSEIYR RSRGVFISYQ NRHNMDDILQ NVPNHNIKVI VVTDGERILG LGDQGIGGMG IPIGKLSLYT ACGGISPAYT LPVVLDVGTN NQQLLNDPLY MGWRNPRITD DEYYEFVDEF IQAVKQRWPD VLLQFEDFAQ KNAMPLLNRY RNEICSFNDD IQGTAAVTVG TLIAASRAAG GQLSEKKIVF LGAGSAGCGI AEMIIAQTQR EGLSEEAARQ KVFMVDRFGL LTDKMPNLLP FQTKLVQKRE NLSDWDTDSD VLSLLDVVRN VKPDILIGVS GQTGLFTEEI IREMHKHCPR PIVMPLSNPT SRVEATPQDI IAWTEGNALV ATGSPFNPVV WKDKIYPIAQ CNNAFIFPGI GLGVIASGAS RITDEMLMSA SETLAQYSPL VLNGEGMVLP ELKDIQKVSR AIAFAVGKMA QQQGVAVKTS AEALQQAIDD NFWQAEYRDY RRTSI
|
| |