Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A0919 |
Symbol | aroG |
ID | 6519830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | + |
Start bp | 889341 |
End bp | 890393 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642746051 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_002113862 |
Protein GI | 194734297 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.593552 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTATC AGAACGACGA TTTACGCATT AAAGAAATCA ACGAGTTATT ACCTCCGGTC GCGCTGCTGG AAAAGTTTCC CGCTACGGAA AATGCAGCGA ATACCGTTGC TCACGCGCGC AAAGCCATCC ATAAAATTCT CAAAGGCAAT GACGATCGTC TGCTGGTGGT GATCGGTCCT TGTTCAATTC ATGATCCGGC AGCGGCGAAA GAGTATGCCG CCCGTTTGCT GGCGCTACGC GATGAGCTTC AAGGCGAGCT TGAAATTGTC ATGCGCGTCT ATTTTGAGAA ACCGCGTACT ACCGTCGGCT GGAAAGGGCT GATTAACGAT CCGCACATGG ATAACAGCTT CCAGATTAAC GACGGTCTGC GTATTGCGCG CAAACTGCTG CTGGATATTA ACGACAGCGG CCTGCCTGCC GCCGGCGAAT TCCTCGATAT GATCACGCCG CAATATCTGG CCGATCTGAT GAGCTGGGGT GCCATTGGCG CGCGGACTAC TGAATCCCAG GTTCATCGCG AACTGGCGTC TGGCCTCTCT TGTCCGGTCG GTTTTAAAAA TGGTACTGAT GGCACGATTA AAGTCGCCAT TGACGCCATC AACGCCGCCG GCGCGCCGCA TTGCTTCCTC TCCGTCACTA AATGGGGTCA TTCGGCGATT GTGAATACCA GCGGCAACGG CGACTGCCAT ATCATTCTGC GCGGCGGTAA AGCGCCAAAC TATAGTGCGC AACATGTTGC TGAGGTGAAA GAAGGCCTCA CCAAAGCGGG ACTGACGCCG CAGGTCATGA TCGATTTCAG CCATGCCAAC TCCTGTAAGC AATTTCAAAA GCAGATGGAG GTTTGCGCCG ATGTCTGTCA GCAGATAGCG GGCGGTGAAA AAGCGATTAT TGGCGTGATG GTAGAGAGTC ATCTGGTAGA AGGAAACCAG AGTCTGGAAA GCGGTCAGCC GCTGACCTAC GGTAAAAGCA TTACTGACGC CTGTATTGGC TGGGAAGATA CCGATGCGCT GCTTCGTCAG TTGTCGGCAG CGGTAAAAGC CCGTCGTGGT TAA
|
Protein sequence | MNYQNDDLRI KEINELLPPV ALLEKFPATE NAANTVAHAR KAIHKILKGN DDRLLVVIGP CSIHDPAAAK EYAARLLALR DELQGELEIV MRVYFEKPRT TVGWKGLIND PHMDNSFQIN DGLRIARKLL LDINDSGLPA AGEFLDMITP QYLADLMSWG AIGARTTESQ VHRELASGLS CPVGFKNGTD GTIKVAIDAI NAAGAPHCFL SVTKWGHSAI VNTSGNGDCH IILRGGKAPN YSAQHVAEVK EGLTKAGLTP QVMIDFSHAN SCKQFQKQME VCADVCQQIA GGEKAIIGVM VESHLVEGNQ SLESGQPLTY GKSITDACIG WEDTDALLRQ LSAAVKARRG
|
| |