Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2573 |
Symbol | aroC |
ID | 6485537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 2493511 |
End bp | 2494596 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642737908 |
Product | chorismate synthase |
Protein accession | YP_002041649 |
Protein GI | 194444824 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 79 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGGAA ACACAATTGG ACAACTCTTT CGCGTAACCA CTTTCGGCGA ATCACACGGG CTGGCGCTTG GGTGTATCGT CGATGGCGTG CCGCCCGGCA TCCCGTTGAC GGAGGCCGAT CTGCAACACG ATCTCGACAG ACGCCGCCCC GGCACCTCGC GCTATACTAC CCAGCGCCGC GAACCGGACC AGGTAAAAAT TCTCTCCGGC GTGTTTGATG GCGTGACGAC CGGCACCAGC ATTGGCCTAC TGATTGAAAA CACCGATCAG CGCTCGCAGG ACTACAGCGC GATTAAAGAT GTTTTTCGTC CGGGACACGC GGATTACACC TATGAGCAGA AATACGGCCT GCGCGATTAC CGTGGCGGTG GACGTTCTTC CGCGCGTGAA ACCGCGATGC GCGTAGCGGC AGGGGCGATC GCCAAGAAAT ACCTGGCGGA AAAGTTCGGC ATCGAAATCC GCGGCTGCCT GACCCAGATG GGCGACATTC CGCTGGAGAT TAAAGACTGG CGTCAGGTTG AGCTTAATCC GTTCTTTTGT CCCGATGCGG ACAAACTTGA CGCGCTGGAC GAACTGATGC GCGCGCTGAA AAAAGAGGGT GACTCCATCG GCGCGAAAGT GACGGTGATG GCGAGCGGCG TGCCGGCAGG GCTTGGCGAA CCGGTATTTG ACCGACTGGA TGCGGACATC GCCCATGCGC TGATGAGCAT TAATGCGGTG AAAGGCGTGG AGATCGGCGA AGGATTTAAC GTGGTGGCGC TGCGCGGCAG CCAGAATCGC GATGAAATCA CGGCGCAGGG TTTTCAGAGC AACCACGCTG GCGGCATCCT CGGTGGCATC AGTAGCGGGC AACACATTGT GGCGCATATG GCGCTGAAAC CTACCTCCAG CATTACCGTG CCGGGACGTA CGATCAACCG GGCAGGTGAA GAAGTCGAAA TGATCACCAA AGGGCGCCAC GATCCGTGTG TGGGGATTCG CGCAGTGCCG ATCGCAGAAG CCATGCTGGC GATCGTGCTG ATGGATCACC TGCTGCGCCA TCGGGCACAG AATGCGGATG TAAAGACAGA GATTCCACGC TGGTAA
|
Protein sequence | MAGNTIGQLF RVTTFGESHG LALGCIVDGV PPGIPLTEAD LQHDLDRRRP GTSRYTTQRR EPDQVKILSG VFDGVTTGTS IGLLIENTDQ RSQDYSAIKD VFRPGHADYT YEQKYGLRDY RGGGRSSARE TAMRVAAGAI AKKYLAEKFG IEIRGCLTQM GDIPLEIKDW RQVELNPFFC PDADKLDALD ELMRALKKEG DSIGAKVTVM ASGVPAGLGE PVFDRLDADI AHALMSINAV KGVEIGEGFN VVALRGSQNR DEITAQGFQS NHAGGILGGI SSGQHIVAHM ALKPTSSITV PGRTINRAGE EVEMITKGRH DPCVGIRAVP IAEAMLAIVL MDHLLRHRAQ NADVKTEIPR W
|
| |