Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A1042 |
Symbol | aroA |
ID | 6874734 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 1042607 |
End bp | 1043890 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642784228 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_002214902 |
Protein GI | 198246031 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.195117 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 0.218839 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATCCC TGACGTTACA ACCCATCGCG CGGGTCGATG GCGCCATTAA TTTACCTGGC TCCAAAAGTG TTTCAAACCG TGCTTTGCTC CTGGCGGCTT TAGCTTGTGG TAAAACCGTT CTGACGAATC TGCTGGATAG CGATGACGTC CGCCATATGC TCAATGCCCT GAGCGCGTTG GGGATCAATT ACACCCTTTC TGCCGATCGC ACCCGCTGTG ATATCACGGG TAATGGCGGC CCATTACGTG CGCCAGGCGC TCTGGAACTG TTTCTCGGTA ATGCCGGAAC CGCGATGCGT CCGTTAGCGG CAGCGCTATG TCTGGGGCAA AATGAGATAG TGTTAACCGG CGAACCGCGT ATGAAAGAGC GTCCGATAGG CCATCTGGTT GATTCGCTGC GTCAGGGCGG GGCGAATATT GATTACCTGG AGCAGGAAAA TTATCCGCCC CTGCGTCTGC GCGGCGGTTT TATCGGCGGC GACATTGAGG TTGATGGTAG CGTTTCCAGC CAGTTCCTGA CCGCTCTGCT GATGACGGCG CCGCTGGCCC CTAAAGACAC AATTATTCGC GTTAAAGGTG AACTGGTATC AAAACCTTAC ATCGATATCA CGCTAAATTT AATGAAAACC TTTGGCGTGG AGATAGCGAA CCACCACTAC CAACAATTTG TCGTGAAGGG AGGTCAACAG TATCACTCTC CAGGTCGCTA TCTGGTCGAG GGCGATGCCT CGTCAGCGTC CTATTTTCTC GCCGCCGGGG CGATAAAAGG CGGCACGGTA AAAGTGACCG GAATTGGCCG CAAAAGTATG CAGGGCGATA TTCGTTTTGC CGATGTGCTG GAGAAAATGG GCGCGACCAT TACCTGGGGC GATGATTTTA TTGCCTGCAC GCGCGGTGAA TTGCACGCCA TAGATATGGA TATGAACCAT ATTCCGGATG CGGCGATGAC GATTGCCACC ACGGCGCTGT TTGCGAAAGG AACCACGACG TTGCGCAATA TTTATAACTG GCGAGTGAAA GAAACCGATC GCCTGTTCGC GATGGCGACC GAGCTACGTA AAGTGGGCGC TGAAGTCGAA GAAGGGCACG ACTATATTCG TATCACGCCG CCGGCGAAGC TCCAACACGC GGATATTGGC ACGTACAACG ACCACCGTAT GGCGATGTGC TTCTCACTGG TCGCACTGTC CGATACGCCA GTTACGATCC TGGACCCTAA ATGTACCGCA AAAACGTTCC CTGATTATTT CGAACAACTG GCGCGAATGA GTACGCCTGC CTAA
|
Protein sequence | MESLTLQPIA RVDGAINLPG SKSVSNRALL LAALACGKTV LTNLLDSDDV RHMLNALSAL GINYTLSADR TRCDITGNGG PLRAPGALEL FLGNAGTAMR PLAAALCLGQ NEIVLTGEPR MKERPIGHLV DSLRQGGANI DYLEQENYPP LRLRGGFIGG DIEVDGSVSS QFLTALLMTA PLAPKDTIIR VKGELVSKPY IDITLNLMKT FGVEIANHHY QQFVVKGGQQ YHSPGRYLVE GDASSASYFL AAGAIKGGTV KVTGIGRKSM QGDIRFADVL EKMGATITWG DDFIACTRGE LHAIDMDMNH IPDAAMTIAT TALFAKGTTT LRNIYNWRVK ETDRLFAMAT ELRKVGAEVE EGHDYIRITP PAKLQHADIG TYNDHRMAMC FSLVALSDTP VTILDPKCTA KTFPDYFEQL ARMSTPA
|
| |