Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C2885 |
Symbol | aroF |
ID | 6487853 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | - |
Start bp | 2819672 |
End bp | 2820742 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642743050 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_002046674 |
Protein GI | 194449242 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 0.412469 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAAAG ACGCGCTGAA TAACGTACGT ATCACCGATG AACAGGTATT AATGACGCCG GAGCAGCTTA AAGCGGCCTT TCCCTTGAGC CTGGCGCAGG AAGCGCAGAT AGCGCAGTCC CGGGGAATCA TTTCTGACAT TATTGCCGGG CGCGATCCGC GTCTGTTGGT GGTATGCGGT CCTTGTTCTA TTCACGATCC TGAAACCGCT CTGGAATATG CCCGTCGATT TAAAGCCCTT GCCGCAGAGG TCAGCGATAG CCTCTATCTG GTAATGCGCG TCTATTTTGA AAAGCCGCGA ACTACCGTCG GCTGGAAAGG GCTGATTAAC GATCCTCACA TGGATGGCTC ATTTGATGTG GAAGCCGGGT TGAAAATAGC GCGTCAGCTA CTGGTGGAAC TGGTGAATAT GGGGTTGCCA TTGGCGACCG AAGCGTTGGA TCCGAACAGC CCGCAATACC TGGGCGATCT GTTTAGCTGG TCGGCGATAG GGGCGCGCAC AACCGAATCG CAAACCCACC GCGAAATGGC GTCTGGTCTT TCTATGCCGG TCGGCTTTAA AAACGGCACG GATGGCAGCC TGGCGACAGC GATTAACGCC ATGCGCGCCG CTGCGCAACC TCATCGTTTT GTTGGCATTA ACCAGGCCGG TCAGGTTGCG TTATTGCAAA CCCAGGGAAA TCCGCATGGC CATGTGATTC TGCGTGGCGG CAAAGCGCCA AACTATAGCC CGGCAGATGT CGCTCAGTGT GAAAAAGAGA TGGAACAGGC GGGACTACGT CCTTCGCTGA TGGTAGATTG CAGTCATGGT AACTCCAATA AAGATTATCG CCGCCAGCCA GCCGTTGCCG AATCTGTGGT TGCGCAGATT AAAGATGGCA ATCGTTCAAT CATTGGCTTA ATGATTGAAA GTAATATTCA TGAGGGTAAC CAGTCTTCCG AACAGCCGCG CAGCGAAATG AAGTATGGCG TTTCCGTCAC CGATGCTTGT ATTAGCTGGG AGATGACCGA TGCCCTGTTA CGTGAAATTC ATAAAGATTT GAGCGGCCAG CTGGCGGTGC GCGTCGCATA A
|
Protein sequence | MQKDALNNVR ITDEQVLMTP EQLKAAFPLS LAQEAQIAQS RGIISDIIAG RDPRLLVVCG PCSIHDPETA LEYARRFKAL AAEVSDSLYL VMRVYFEKPR TTVGWKGLIN DPHMDGSFDV EAGLKIARQL LVELVNMGLP LATEALDPNS PQYLGDLFSW SAIGARTTES QTHREMASGL SMPVGFKNGT DGSLATAINA MRAAAQPHRF VGINQAGQVA LLQTQGNPHG HVILRGGKAP NYSPADVAQC EKEMEQAGLR PSLMVDCSHG NSNKDYRRQP AVAESVVAQI KDGNRSIIGL MIESNIHEGN QSSEQPRSEM KYGVSVTDAC ISWEMTDALL REIHKDLSGQ LAVRVA
|
| |