Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0383 |
Symbol | mhpE |
ID | 6146682 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 398086 |
End bp | 399099 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641615279 |
Product | 4-hydroxy-2-ketovalerate aldolase |
Protein accession | YP_001742486 |
Protein GI | 170682868 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR03217] 4-hydroxy-2-oxovalerate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGATA AAAAACTTTA TATCTCGGAC GTCACGTTGC GTGATGGTAT GCACGCCATT CGTCATCAGT ATTCATTGGA AAACGTTCGC CAGGTTGCCA AAGCACTGGA CGATGCCCGC GTGGATTCGA TTGAAGTGGC CCACGGCGAC GGTTTGCAAG GTTCCAGCTT TAACTATGGT TTCGGCGCAC ATAGCGACCT TGAATGGATT GAAGCGGCGG CGGATGTGGT GAAGCACGCC AAAATCGCGA CGTTGTTGCT GCCTGGAATC GGCACTATTC ACGATCTGAA AAATGCCTGG CAGGCTGGCG CGCGGGTGGT TCGTGTGGCT ACGCACTGTA CCGAAGCTGA TGTTTCCGCC CAGCATATTC AGTATGCCCG CGAGCTCGGA ATGGACACCG TTGGCTTTCT GATGATGAGC CATATGACCA CGCCGGAGAA TCTCGCAAAG CAGGCAAAGC TGATGGAAGG CTACGGTGCG ACCTGTATTT ATGTGGTGGA TTCTGGCGGT GCGATGAACA TGAGCGATAT CCGTGACCGT TTCCGCGCCC TGAAAGCAGT GCTGAAACCA GAAACGCAAA CCGGCATACA CGCTCACCAT AACCTGAGTC TTGGCGTGGC GAACTCTATC GCGGCGGTGG AAGAGGGCTG CGACCGAATC GATGCCAGTC TCGCGGGAAT GGGCGCGGGC GCAGGTAACG CGCCGCTGGA AGTGTTTATT GCCGCCGCGG ATAAACTGGG TTGGCAGCAT GGGACCGATC TCTATGCGTT AATGGATGCC GCTGACGACC TGGTGCGTCC GTTGCAGGAT CGTCCGGTAC GAGTCGATCG CGAAACGCTG GCGCTGGGAT ACGCCGGTGT TTACTCCAGC TTCCTGCGTC ACTGTGAAAC GGCGGCGGCG CGTTATGGCT TAAGCGCGGT GGATATTCTC GTTGAGTTGG GCAAACGCCG GATGGTTGGC GGCCAGGAGG ATATGATCGT CGACGTGGCG CTGGATCTGC GCAACAACAA ATAA
|
Protein sequence | MNDKKLYISD VTLRDGMHAI RHQYSLENVR QVAKALDDAR VDSIEVAHGD GLQGSSFNYG FGAHSDLEWI EAAADVVKHA KIATLLLPGI GTIHDLKNAW QAGARVVRVA THCTEADVSA QHIQYARELG MDTVGFLMMS HMTTPENLAK QAKLMEGYGA TCIYVVDSGG AMNMSDIRDR FRALKAVLKP ETQTGIHAHH NLSLGVANSI AAVEEGCDRI DASLAGMGAG AGNAPLEVFI AAADKLGWQH GTDLYALMDA ADDLVRPLQD RPVRVDRETL ALGYAGVYSS FLRHCETAAA RYGLSAVDIL VELGKRRMVG GQEDMIVDVA LDLRNNK
|
| |