Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A1442 |
Symbol | aroH |
ID | 6519349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | - |
Start bp | 1389367 |
End bp | 1390413 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642746559 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_002114364 |
Protein GI | 194735243 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0985484 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGAA CCGACGAACT CCGTACTGCG CGTATCGACA GCCTGGTAAC ACCGACCGAA CTCGCGCAGC GGTATCCTGT ATCGTCCTCC GTCGCCAGTC ACGTTACCGA CTCCCGACGC CGGATAGAAA AGATTTTAAA TGGTGAAGAT CCACGGCTAC TGGTCGTCAT TGGCCCCTGT TCGATTCACG ATCTGAATGC TGCTATGGAA TACGCGACGC AGCTCCAGAC ACAACGCCAA AAGCATCAGG CGCGTCTGGA AATCGTCATG CGCACCTATT TTGAAAAACC GCGCACCGTC GTGGGATGGA AAGGCCTGAT TTCCGATCCC GACTTGAATG GAAGTTACCG CGTCAATTAT GGGCTTGAAC TGGCGCGTCG CTTGCTATTG CAGGTGAACG AACTGGGAGT ACCGACCGCC ACAGAATTTC TTGATATGGT CACCGGCCAG TTTATTGCCG ATCTGATCAG TTGGGGAGCG ATTGGCGCGC GTACCACCGA AAGCCAAATC CATCGGGAAA TGGCTTCTGC GCTCTCTTGT CCGGTCGGCT TTAAAAATGG TACGGATGGC AATACTCGCA TTGCCGTTGA CGCTATTCGC GCCTCCCGCG CCAGCCATAT GTTTCTCTCG CCGGATAAAG ACGGACAGAT GACCATCTAC CAGACGAGTG GCAACCCGTA TGGGCACATC ATCATGCGCG GCGGTAAAAA ACCGAACTAC CACGCTGAAG ATATTGCCGC CGCCTGCGAC ACGTTGCATG AATTTGATCT GCCGGAACAT CTGGTCGTCG ACTTCAGCCA CGGCAACTGT CAAAAACAGC ATCGCCGCCA GTTGGAGGTA TGTGATGATA TTTGCCAGCA GATTCGTAAT GGCTCCACGG CAATAGCCGG GATTATGGCC GAGAGTTTTT TACGGGAAGG CACGCAAAAA ATTATCAGCG GTCAACCATT AATCTATGGT CAGTCCATTA CCGATCCCTG CCTGAACTGG GAAGATACGG AAGTTTTGTT GGAAAAACTT GCCGCGGCGG TAGATAGCCG CTTTTAA
|
Protein sequence | MNRTDELRTA RIDSLVTPTE LAQRYPVSSS VASHVTDSRR RIEKILNGED PRLLVVIGPC SIHDLNAAME YATQLQTQRQ KHQARLEIVM RTYFEKPRTV VGWKGLISDP DLNGSYRVNY GLELARRLLL QVNELGVPTA TEFLDMVTGQ FIADLISWGA IGARTTESQI HREMASALSC PVGFKNGTDG NTRIAVDAIR ASRASHMFLS PDKDGQMTIY QTSGNPYGHI IMRGGKKPNY HAEDIAAACD TLHEFDLPEH LVVDFSHGNC QKQHRRQLEV CDDICQQIRN GSTAIAGIMA ESFLREGTQK IISGQPLIYG QSITDPCLNW EDTEVLLEKL AAAVDSRF
|
| |