Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_3080 |
Symbol | |
ID | 5112619 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | - |
Start bp | 3357619 |
End bp | 3358689 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640493278 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001177795 |
Protein GI | 146312721 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.674249 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0670847 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAAAG ACGCGCTGAA CAACGTACAT ATTACCGACG AACAAGTTTT GATCACTCCG GATCAGTTAA AGCTGGAATT CCCACTGAGT TCTGAGCAAG AAGCGCAAAT CGAGCAGTCG CGCCAGACCA TCTCTGACAT CATTGCCGGT CGCGATCCGC GCCTGCTGGT GGTGTGTGGT CCATGCTCGA TTCACGATCC TGAAACCGCG ATTGAATATG CTCGTCGATT TAAAGTGTTA GCCGAAGAGG TCAGCGATAG CCTCTACCTG GTCATGCGCG TCTATTTTGA AAAACCTCGT ACCACCGTGG GCTGGAAGGG GTTGATCAAC GACCCGCACA TGGATGGTTC GTTTGATGTT GAATCCGGCC TGAAGATTGC GCGTCATCTG CTGGTTGAAC TGGTCAGCAT GGGTCTGCCA CTGGCGACTG AAGCGCTCGA TCCTAATAGC CCGCAATACC TCGGCGATCT GTTTAGCTGG TCTGCGATTG GTGCACGTAC AACCGAATCA CAAACTCACC GCGAGATGGC TTCTGGCCTG TCGATGCCGG TCGGTTTTAA AAATGGCACC GATGGCAGTC TGGCAACCGC CATCAACGCC ATGCGTGCCG CTGCTATGCC GCACCGTTTT GTCGGGATTA ACCAGGCCGG CCAGGTTTGC CTGCTGCAAA CTCAGGGTAA CCCTGATGGA CATGTGATTT TGCGCGGCGG TAAAGCACCG AACTACAGCC CTGCGGATGT GGCGCAGTGT GAAAAAGAGA TGGAGCAGGC GGGACTGCGT CCGGCTCTGA TGGTAGATTG CAGCCATGGT AATTCGAACA AAGATTACCG TCGCCAGCCT GCGGTTGCGG AATCCGTGGT CGCCCAAATT AAAGATGGTA ACCGTTCTAT TATTGGACTG ATGATTGAGA GCAATATCCA TGAAGGTAAT CAGTCGTCTG AACAGCCGCG CAGTGCCATG AAACACGGTG TATCCGTTAC GGACGCCTGT ATCAGTTGGG AAGCGACAGA CGCTTTGCTG CATGAGATCC ACAAAGATTT GAACGGTCAA CTGGCGACGC GTCTGGCTTA A
|
Protein sequence | MQKDALNNVH ITDEQVLITP DQLKLEFPLS SEQEAQIEQS RQTISDIIAG RDPRLLVVCG PCSIHDPETA IEYARRFKVL AEEVSDSLYL VMRVYFEKPR TTVGWKGLIN DPHMDGSFDV ESGLKIARHL LVELVSMGLP LATEALDPNS PQYLGDLFSW SAIGARTTES QTHREMASGL SMPVGFKNGT DGSLATAINA MRAAAMPHRF VGINQAGQVC LLQTQGNPDG HVILRGGKAP NYSPADVAQC EKEMEQAGLR PALMVDCSHG NSNKDYRRQP AVAESVVAQI KDGNRSIIGL MIESNIHEGN QSSEQPRSAM KHGVSVTDAC ISWEATDALL HEIHKDLNGQ LATRLA
|
| |