Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_1743 |
Symbol | |
ID | 5112482 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | - |
Start bp | 1892335 |
End bp | 1893381 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640491932 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001176473 |
Protein GI | 162286712 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0203503 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0341148 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAAA CCGACGAACT TCGCACAGCG CGTATTGAAA GTCTGGTGAC GCCCGCAGAA CTGGCACAGC GCCACCCCGT TTCCGCAAGC GTTGCGGAAC ATGTTATTGC GTCGCGCCGA CGCATCGAAA AAATATTGAA CGGTGAAGAT CGCCGTTTGC TGGTGGTTAT TGGCCCTTGC TCGATTCACG ATCTTGATGC GGCACTGGAT TATGCCAAGC GCCTGAAAGT GCTGCGCGAT AAGCACCAGG ATCGCCTTGA AATCGTGATG CGTACCTATT TCGAGAAACC ACGTACCGTG GTGGGCTGGA AAGGGTTGAT TTCCGATCCG GATTTGAACG GCAGCTATCG CGTCAATCAC GGTATCGAGC TGGCGCGTAA ATTACTGCTC CAGGTGAATG AACTCGGCGT GCCGACAGCC ACCGAATTTC TCGATATGGT GATCGGACAG TTTATCGCCG ACTTGATTAG CTGGGGCGCG ATTGGCGCGC GTACCACCGA AAGCCAAATC CATCGCGAGA TGGCCTCCGC GCTTTCTTGC CCGGTGGGCT TTAAAAACGG TACGGATGGG AATACGCACA TCGCGATCGA TGCGATCCGC GCCTCGCGTG CCAGCCATAT GTTCCTCTCG CCGGACAAAA ACGGTCAGAT GACCATTTAC CAGACCAGCG GTAACCCGTA CGGCCACATC ATTATGCGTG GGGGCAAAAA GCCCAACTAC CATGCAGAAG ATATCGCCGC GGCGTGCGAC ACGCTGCACG AGTTTGATCT CCCGGAACAT CTGGTGGTCG ATTTCAGCCA TGGCAACTGC CAGAAGCAAC ATCGTCGTCA GTTGGACGTG TGCGATGAAA TTTGTCAGCA AATCCGCAGC GGTTCTACCG CCATCGCCGG CATCATGGCG GAAAGTTTCC TGAAGGAAGG CACGCAAAAG GTCGTGGCAG GACAACCGAT TACCTATGGT CAGTCGATCA CCGATCCGTG TCTGGGCTGG GAAGACAGCG AACTGTTGCT GGAAAAATTA GCCTCCGCCG TCGATAGCCG TTTTTAA
|
Protein sequence | MNKTDELRTA RIESLVTPAE LAQRHPVSAS VAEHVIASRR RIEKILNGED RRLLVVIGPC SIHDLDAALD YAKRLKVLRD KHQDRLEIVM RTYFEKPRTV VGWKGLISDP DLNGSYRVNH GIELARKLLL QVNELGVPTA TEFLDMVIGQ FIADLISWGA IGARTTESQI HREMASALSC PVGFKNGTDG NTHIAIDAIR ASRASHMFLS PDKNGQMTIY QTSGNPYGHI IMRGGKKPNY HAEDIAAACD TLHEFDLPEH LVVDFSHGNC QKQHRRQLDV CDEICQQIRS GSTAIAGIMA ESFLKEGTQK VVAGQPITYG QSITDPCLGW EDSELLLEKL ASAVDSRF
|
| |