Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1695 |
Symbol | sfcA |
ID | 6146449 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1696479 |
End bp | 1698176 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641616571 |
Product | malate dehydrogenase |
Protein accession | YP_001743749 |
Protein GI | 170682113 |
COG category | [C] Energy production and conversion |
COG ID | [COG0281] Malic enzyme |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.488975 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACCAA AAACAAAAAA ACAGCGTTCG CTTTATATCC CTTACGCTGG CCCTGTATTG CTGGAATTTC CGTTGTTGAA TAAAGGCAGC GCCTTCAGCA TGGAAGAACG CCGTAACTTC AACCTGCTGG GGTTACTGCC GGAAGTGGTC GAAACCATCG AAGAACAAGC GGAACGAGCA TGGATCCAGT ATCAGGGATT CAAAACCGAA ATCGACAAAC ACATCTACCT GCGTAACATC CAGGACACTA ACGAAACCCT CTTCTACCGT CTGGTAAACA ATCATCTTGA TGAGATGATG CCAGTCATTT ACACCCCAAC CGTCGGCGCA GCCTGTGAGC GTTTTTCTGA GATCTACCGC CGTTCACGCG GTGTGTTTAT CTCTTACCAG AACCGCCACA ATATGGACGA TATTCTGCAA AACGTACCAA ACCATAATAT CAAAGTGATT GTGGTGACTG ACGGCGAACG TATTCTGGGG CTTGGTGACC AGGGCATCGG AGGGATGGGC ATTCCGATCG GTAAACTGTC ACTCTATACC GCCTGTGGCG GCATCAGCCC GGCGTATACC CTTCCGGTGG TGCTGGATGT CGGGACGAAC AACCAACAGC TGCTTAACGA TCCGCTGTAT ATGGGCTGGC GTAATCCGCG TATCACTGAC GACGAATACT ATGAATTCGT TGATGAATTT ATCCAGGCTG TGAAACAACG CTGGCCGGAC GTACTGTTGC AGTTTGAAGA CTTCGCACAA AAAAATGCGA TGCCGTTACT TAACCGCTAT CGCAATGAAA TTTGTTCTTT TAACGATGAC ATTCAGGGCA CCGCGGCGGT AACGGTCGGC ACACTGATCG CAGCCAGCCG CGCGGCGGGT GGTCAACTGA GCGAGAAAAA GATTGTCTTC CTCGGCGCAG GCTCTGCGGG GTGCGGGATT GCCGAAATGA TCATTGCCCA GACTCAGCGT GAAGGATTAA GCGAGGAAAC GGCGCGGCAG AAAGTCTTTA TGGTCGATCG CTTTGGCCTG CTGACCGACA AGATGCCGAA CCTGCTGCCT TTCCAGACCA AACTGGTGCA GAAACGCGAA AACCTCAGTG ACTGGGATAC CGACAGCGAT GTGCTGTCAC TGCTGGATGT GGTGCGCAAT GTAAAACCAG ATATTCTGAT TGGCGTCTCA GGACAGACCG GGCTGTTTAC GGAAGAGATC ATCCGTGAGA TGCATAAACA CTGTCCGCGT CCGATCGTGA TGCCGCTGTC TAACCCGACT TCACGCGTCG AAGCAACACC GCAGGACATT ATCGCCTGGA CTGAAGGTAA CGCGCTGGTC GCCACTGGCA GTCCATTTAA TCCAGTGGTA TGGAAAGATA AAATCTACCC TATCGCCCAG TGTAATAACG CCTTTATTTT CCCGGGCATC GGGCTGGGTG TTATTGCTTC CGGCGCGTCA CGTATCACCG ATGAAATGCT GATGTCGGCA AGTGAAACGC TTGCTCAGTA TTCGCCGCTG GTCCTGAACG GCGAAGGTCT GGTACTACCG GAACTGAAAG ATATACAGAA AGTCTCCCGC GCAATTGCGT TTGCGGTTGG CAAAATGGCG CAGCAGCAAG GCGTGGCGGT GAAAACCTCC GCCGAAGCTC TGCAACAGGC CATTGATGAT AATTTCTGGC AAGCCGAATA CCGCGACTAC CGCCGTACCT CCATCTAA
|
Protein sequence | MEPKTKKQRS LYIPYAGPVL LEFPLLNKGS AFSMEERRNF NLLGLLPEVV ETIEEQAERA WIQYQGFKTE IDKHIYLRNI QDTNETLFYR LVNNHLDEMM PVIYTPTVGA ACERFSEIYR RSRGVFISYQ NRHNMDDILQ NVPNHNIKVI VVTDGERILG LGDQGIGGMG IPIGKLSLYT ACGGISPAYT LPVVLDVGTN NQQLLNDPLY MGWRNPRITD DEYYEFVDEF IQAVKQRWPD VLLQFEDFAQ KNAMPLLNRY RNEICSFNDD IQGTAAVTVG TLIAASRAAG GQLSEKKIVF LGAGSAGCGI AEMIIAQTQR EGLSEETARQ KVFMVDRFGL LTDKMPNLLP FQTKLVQKRE NLSDWDTDSD VLSLLDVVRN VKPDILIGVS GQTGLFTEEI IREMHKHCPR PIVMPLSNPT SRVEATPQDI IAWTEGNALV ATGSPFNPVV WKDKIYPIAQ CNNAFIFPGI GLGVIASGAS RITDEMLMSA SETLAQYSPL VLNGEGLVLP ELKDIQKVSR AIAFAVGKMA QQQGVAVKTS AEALQQAIDD NFWQAEYRDY RRTSI
|
| |