Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A0855 |
Symbol | aroG |
ID | 6871338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 847955 |
End bp | 849007 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642784050 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_002214725 |
Protein GI | 198245276 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.340404 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 72 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTATC AGAACGACGA TTTACGCATT AAAGAAATCA ACGAGTTATT ACCTCCGGTC GCGCTGCTGG AAAAGTTTCC CGCCACGGAA AATGCAGCAA ATACCGTTGC TCACGCGCGC AAAGCCATCC ATAAAATTCT CAAAGGCAAT GACGATCGTC TGCTGGTGGT GATCGGTCCT TGTTCAATTC ATGATCCGGC AGCGGCGAAA GAGTATGCCG CCCGTTTGCT GGCGCTACGT GATGAGCTTC AAGGCGAGCT TGAAATTGTC ATGCGCGTCT ATTTTGAGAA ACCGCGTACC ACCGTCGGCT GGAAAGGGCT GATTAACGAT CCGCACATGG ATAACAGCTT CCAGATTAAC GACGGTCTGC GTATTGCGCG CAAACTGCTG CTGGATATTA ACGACAGCGG CCTGCCTGCC GCCGGCGAAT TCCTCGATAT GATCACGCCG CAATATCTGG CCGATCTGAT GAGCTGGGGC GCCATTGGCG CGCGGACTAC TGAATCCCAG GTTCATCGCG AACTGGCGTC TGGCCTCTCT TGTCCGGTCG GTTTTAAAAA TGGTACTGAT GGCACGATTA AAGTTGCCAT TGACGCCATC AACGCCGCCG GCGCGCCGCA TTGCTTCCTC TCCGTCACTA AATGGGGTCA TTCGGCGATT GTGAATACCA GCGGCAACGG CGACTGCCAT ATCATTCTGC GCGGAGGTAA AGCGCCAAAC TATAGCGCGC AACATGTTGC TGAGGTGAAA GAAGGCCTCA TCAAAGCGGG ACTGACGCCG CAGGTCATGA TCGATTTCAG CCATGCCAAC TCCTGTAAGC AATTTCAAAA GCAGATGGAG GTTTGCGCCG ATGTCTGTCA GCAGATAGCG GGCGGTGAAA AAGCGATTAT TGGCGTGATG GTAGAGAGTC ATCTGGTAGA AGGAAACCAG AGTCTGGAAA GCGGTCAGCC GCTGACCTAC GGTAAAAGCA TTACTGACGC CTGTATTGGC TGGGAAGATA CCGATGCGCT GCTTCGTCAG TTGTCGGCAG CGGTAAAAGC CCGTCGCGGC TAA
|
Protein sequence | MNYQNDDLRI KEINELLPPV ALLEKFPATE NAANTVAHAR KAIHKILKGN DDRLLVVIGP CSIHDPAAAK EYAARLLALR DELQGELEIV MRVYFEKPRT TVGWKGLIND PHMDNSFQIN DGLRIARKLL LDINDSGLPA AGEFLDMITP QYLADLMSWG AIGARTTESQ VHRELASGLS CPVGFKNGTD GTIKVAIDAI NAAGAPHCFL SVTKWGHSAI VNTSGNGDCH IILRGGKAPN YSAQHVAEVK EGLIKAGLTP QVMIDFSHAN SCKQFQKQME VCADVCQQIA GGEKAIIGVM VESHLVEGNQ SLESGQPLTY GKSITDACIG WEDTDALLRQ LSAAVKARRG
|
| |