Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C1076 |
Symbol | aroA |
ID | 6492098 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 1067023 |
End bp | 1068306 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642741318 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_002044970 |
Protein GI | 194449431 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.28186 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 62 |
Fosmid unclonability p-value | 0.115677 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATCCC TGACGTTACA ACCCATCGCG CGGGTCGATG GCGCCATTAA TTTACCTGGC TCCAAAAGTG TTTCAAACCG TGCTTTGCTC CTGGCGGCTT TAGCTTGTGG TAAAACCGTT CTGACGAATC TGCTGGATAG CGATGACGTC CGCCATATGC TCAATGCCCT GAGCGCGTTG GGGATCAATT ACACCCTTTC TGCCGATCGC ACCCGCTGTG ATATCACGGG TAATGGCGGC CCATTACGCG CGTCAGGCGC TCTGGAACTG TTTCTCGGTA ATGCCGGAAC CGCGATGCGT CCGTTAGCGG CAGCGCTATG TCTGGGGCAA AATGAGATAG TGTTAACCGG CGAACCGCGT ATGAAAGAGC GTCCGATAGG CCATCTGGTC GATTCGCTGC GTCAGGGTGG GGCGAATATT GATTACCAGG AGCAGGAAAA CTATCCGCCC CTGCGTCTGC GCGGCGGTTT TACCGGCGGC GACATTGAGG TTGATGGTAG CGTTTCCAGC CAGTTCCTGA CCGCTCTGCT GATGACGGCG CCGCTGGCGC CTGAAGACAC AACTATTCGC GTTAAAGGCG AACTGGTATC AAAACCTTAC ATCGATATCA CGCTAAATTT AATGAAAACC TTTGGCGTGG AGATAACGAA CCATCACTAC CAACAATTTG TCGTGAAGGG CGGTCAACAG TATCACTCTC CGGGTCGCTA TCTGGTCGAG GGCGATGCCT CGTCAGCGTC CTATTTTCTC GCCGCCGGGG CGATAAAAGG CGGCACGGTA AAAGTGACCG GAATTGGCCG CAAAAGTATG CAGGGCGATA TTCGTTTTGC CGATGTGCTG GAGAAAATGG GCGCGACCAT TACCTGGGGC GATGATTTTA TTGCCTGCAC GCGCGGCGAA TTGCACGCCA TAGATATGGA TATGAACCAT ATTCCGGATG CGGCGATGAC GATTGCCACC ACGGCGCTGT TTGCGAAAGG AACCACGACG TTGCGCAATA TTTATAACTG GCGAGTGAAA GAAACCGATC GCCTGTTCGC GATGGCGACC GAGCTACGTA AAGTGGGCGC TGAAGTCGAA GAAGGGCACG ACTATATTCG TATCACGCCG CCGGCGAAGC TCCAACACGC GGATATTGGC ACGTACAACG ACCACCGTAT GGCGATGTGC TTCTCACTGG TCGCACTGTC CGATACGCCA GTCACGATCC TGGATCCGAA ATGTACCGCA AAAACGTTCC CTGATTATTT CGAACAACTG GCGCGGATGA GTACGCCTGC CTAA
|
Protein sequence | MESLTLQPIA RVDGAINLPG SKSVSNRALL LAALACGKTV LTNLLDSDDV RHMLNALSAL GINYTLSADR TRCDITGNGG PLRASGALEL FLGNAGTAMR PLAAALCLGQ NEIVLTGEPR MKERPIGHLV DSLRQGGANI DYQEQENYPP LRLRGGFTGG DIEVDGSVSS QFLTALLMTA PLAPEDTTIR VKGELVSKPY IDITLNLMKT FGVEITNHHY QQFVVKGGQQ YHSPGRYLVE GDASSASYFL AAGAIKGGTV KVTGIGRKSM QGDIRFADVL EKMGATITWG DDFIACTRGE LHAIDMDMNH IPDAAMTIAT TALFAKGTTT LRNIYNWRVK ETDRLFAMAT ELRKVGAEVE EGHDYIRITP PAKLQHADIG TYNDHRMAMC FSLVALSDTP VTILDPKCTA KTFPDYFEQL ARMSTPA
|
| |