Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1082 |
Symbol | |
ID | 6065529 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 1171754 |
End bp | 1172824 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641600494 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001724076 |
Protein GI | 170019122 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0673858 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00039 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAAAAAG ACGCGCTGAA TAACGTACAT ATTACCGACG AACAGGTTTT AATGACTCCG GAACAACTGA AGGCCGCTTT TCCATTGAGC CTGCAACAAG AAGCCCAGAT TGCTGACTCG CGTAAAACCA TTTCAGATAT TATCGCCGGG CGCGATCCTC GTCTGCTGGT AGTATGTGGT CCTTGTTCCA TTCATGATCC GGAAACTGCT CTGGAATATG CTCGTCGATT TAAAGCCCTT GCCGCAGAGG TCAGCGATAG CCTCTATCTG GTAATGCGCG TCTATTTTGA AAAACCCCGT ACCACTGTCG GCTGGAAAGG GTTAATTAAC GATCCCCATA TGGATGGCTC TTTTGATGTA GAAGCCGGGC TGCAGATCGC GCGTAAATTG CTGCTTGAGC TGGTGAATAT GGGACTGCCA CTGGCGACGG AAGCGTTAGA TCCGAATAGC CCGCAATACC TGGGCGATCT GTTTAGCTGG TCAGCAATTG GTGCTCGTAC AACGGAATCG CAAACTCACC GTGAAATGGC CTCCGGGCTT TCCATGCCGG TTGGTTTTAA AAACGGCACC GACGGCAGTC TGGCAACAGC AATTAACGCT ATGCGCGCCG CCGCCCAGCC GCACCGTTTT GTTGGCATTA ACCAGGCAGG GCAGGTTGCG TTGCTACAAA CTCAGGGGAA TCCGGACGGC CATGTGATCC TGCGCGGTGG TAAAGCGCCG AACTATAGCC CTGCGGATGT TGCGCAATGT GAAAAAGAGA TGGAACAGGC GGGACTGCGC CCGTCTCTGA TGGTAGATTG CAGCCACGGT AATTCCAATA AAGATTATCG CCGTCAGCCT GCGGTGGCAG AATCCGTGGT TGCTCAAATC AAAGATGGCA ATCGCTCAAT TATTGGTCTG ATGATCGAAA GTAATATCCA CGAGGGCAAT CAGTCTTCCG AGCAACCGCG CAGTGAAATG AAATACGGTG TATCCGTAAC CGATGCCTGC ATTAGCTGGG AAATGACCGA TGCCTTGCTG CGTGAAATTC ATCAGGATCT GAACGGGCAG CTGACGGCTC GCGTGGCTTA A
|
Protein sequence | MQKDALNNVH ITDEQVLMTP EQLKAAFPLS LQQEAQIADS RKTISDIIAG RDPRLLVVCG PCSIHDPETA LEYARRFKAL AAEVSDSLYL VMRVYFEKPR TTVGWKGLIN DPHMDGSFDV EAGLQIARKL LLELVNMGLP LATEALDPNS PQYLGDLFSW SAIGARTTES QTHREMASGL SMPVGFKNGT DGSLATAINA MRAAAQPHRF VGINQAGQVA LLQTQGNPDG HVILRGGKAP NYSPADVAQC EKEMEQAGLR PSLMVDCSHG NSNKDYRRQP AVAESVVAQI KDGNRSIIGL MIESNIHEGN QSSEQPRSEM KYGVSVTDAC ISWEMTDALL REIHQDLNGQ LTARVA
|
| |