Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A3598 |
Symbol | aroE |
ID | 6518953 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | - |
Start bp | 3471180 |
End bp | 3471998 |
Gene Length | 819 bp |
Protein Length | 272 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642748581 |
Product | shikimate 5-dehydrogenase |
Protein accession | YP_002116345 |
Protein GI | 194735246 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0169] Shikimate 5-dehydrogenase |
TIGRFAM ID | [TIGR00507] shikimate 5-dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00141014 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.799601 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAACCT ATGCTGTTTT TGGAAATCCG ATTGCTCACA GCAAATCGCC ATTTATTCAT CAGCAGTTTG CTCAGCAGCT AGATATTGTT CACCCCTATG GTCGCGTGCT GGCCCCTATT AATAATTTCA TTAATACGCT TGATGCCTTT TTCGCGGCAG GGGGAAAAGG CGCAAACATC ACAGTACCTT TTAAAGAGGA GGCGTTTGCG CGATCGGATG AGTTAACGGA ACGAGCATCG CTGGCGGGAG CAGTCAATAC ATTAAAGCGG CTGGAAGATG GTCGTTTGCT TGGCGACAAT ACTGACGGTA TCGGTTTATT AAGCGATCTC AAACGGTTAA ATTTTATCCG CCCAGGATGG CGTATTTTGC TGATTGGCGC GGGCGGCGCA TCCCGGGGCG TGCTGTTACC TCTGCTTTCT TTGGATTGCG CGGTCACTAT CACTAACCGT ACAGCTTCAC GTGCCGAAGC GTTGGCGAAA ATCTTTGCTC ATACCGGCAG CGTTCATGCC ACGGATATGG ACAAGCTGCA TGGTTGTGAG TTTGACCTGA TTATTAATGC GACCTCCAGC GGCATACGGG GCGAAATCCC GGCGATTCCG GCGTCACTTA TTCACCCTTC CCTCTGTTGC TATGACATGT TCTATCAAAA AGGGAATACG CCATTTCTCT CCTGGTGTGT ACAACAGGGC GCAAAACGAT ACGCAGATGG GCTGGGAATG CTGGTGGGGC AGGCTGCACA TGCCGTTTTG CTTTGGCACG GTGTATTACC GCAGGTCGAG CCAGTGATTG AGCTGCTACA GCAGGAATTA TTAGCGTGA
|
Protein sequence | METYAVFGNP IAHSKSPFIH QQFAQQLDIV HPYGRVLAPI NNFINTLDAF FAAGGKGANI TVPFKEEAFA RSDELTERAS LAGAVNTLKR LEDGRLLGDN TDGIGLLSDL KRLNFIRPGW RILLIGAGGA SRGVLLPLLS LDCAVTITNR TASRAEALAK IFAHTGSVHA TDMDKLHGCE FDLIINATSS GIRGEIPAIP ASLIHPSLCC YDMFYQKGNT PFLSWCVQQG AKRYADGLGM LVGQAAHAVL LWHGVLPQVE PVIELLQQEL LA
|
| |