Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_1245 |
Symbol | |
ID | 5114207 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | + |
Start bp | 1370555 |
End bp | 1371607 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640491432 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001175977 |
Protein GI | 146310903 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.020469 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.430126 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTATC AGAATGACGA TTTACGCATC AAAGAGATCA ATGAGTTATT ACCTCCTGTA GCGCTCCTTG AAAAATTCCC CGCCACTGAA AACGCTGCGA ACACCGTTTC TCATGCCCGT AAAGCGATCC ACAAAATCCT TAACGGCAAT GACGATCGTT TGCTGGTCGT CATCGGTCCC TGTTCCATTC ACGATCCCGC TGCAGCGAAA GAGTATGCCG CGCGTCTGCT CACGCTTCGC GAGGAATTAA AAGGTGAGCT GGAAGTGGTC ATGCGCGTCT ATTTTGAAAA GCCGCGTACC ACGGTGGGCT GGAAAGGGCT GATCAACGAT CCGCACATGG ACAACAGCTT CCAGATCAAC GACGGACTGC GTATTGCGCG CAAACTGCTT CTGGAAATTA ACGATGCTGG CCTGCCTGCG GCAGGTGAGT TCCTGGATAT GATCACCCCG CAATACCTTG CCGATTTGAT GAGCTGGGGT GCAATTGGTG CCCGTACCAC CGAATCCCAG GTGCACCGCG AATTGGCGTC AGGCCTGTCT TGTCCGGTTG GTTTTAAAAA CGGAACGGAC GGCACGATTA AAGTGGCTAT CGATGCTATC AACGCAGCGG GTGCGCCGCA CTGCTTCCTG TCCGTGACCA AATGGGGTCA CTCCGCGATT GTGAATACCA GCGGTAACGG CGACTGCCAC ATTATTCTGC GTGGCGGCAA AGAGCCAAAC TACAGCGCTA AGCATGTCGA AGAAGTTAAA GCGGGGCTGG AAAAAGCAGG CCTTTCAGCG AAAGTGATGA TTGATTTCAG TCATGCCAAC TCCAGCAAAC AGTTCAAAAA GCAGATGGAA GTCGGCGCAG ACGTTTGTCG GCAACTGATT AGCGGTGAGA ATGCGGTGAT TGGCGTGATG ATTGAGAGCC ATCTGGTAGA AGGTAATCAG AATCTGGAGA GCGGCGAACC GCTGGTCTAC GGCAAGAGCG TGACGGATGC TTGTATTGGC TGGGATGATA CCGATACGAT CCTGCGTCAG TTGGCAGATG CGGTAATAGC GCGTCGCGGA TAA
|
Protein sequence | MNYQNDDLRI KEINELLPPV ALLEKFPATE NAANTVSHAR KAIHKILNGN DDRLLVVIGP CSIHDPAAAK EYAARLLTLR EELKGELEVV MRVYFEKPRT TVGWKGLIND PHMDNSFQIN DGLRIARKLL LEINDAGLPA AGEFLDMITP QYLADLMSWG AIGARTTESQ VHRELASGLS CPVGFKNGTD GTIKVAIDAI NAAGAPHCFL SVTKWGHSAI VNTSGNGDCH IILRGGKEPN YSAKHVEEVK AGLEKAGLSA KVMIDFSHAN SSKQFKKQME VGADVCRQLI SGENAVIGVM IESHLVEGNQ NLESGEPLVY GKSVTDACIG WDDTDTILRQ LADAVIARRG
|
| |