Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2707 |
Symbol | aroC |
ID | 6273142 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 2507756 |
End bp | 2508841 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641726671 |
Product | chorismate synthase |
Protein accession | YP_001881151 |
Protein GI | 187732090 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGGAA ACACAATTGG ACAACTCTTT CGCGTAACCA CCTTCGGCGA ATCGCATGGG CTGGCGCTCG GCTGCATCGT CGATGGTGTT CCGCCAGGCA TTCTGCTGAC GGAAGCGGAC CTGCAACATG ACCTCGACCG TCGTCGCCCT GGGACATCGC GCTATACCAC CCAGCGCCGC GAGCCGGATC AGGTCAAAAT TCTCTCCGGT GTTTTTGAAG GCGTTACTAC TGGCACCAGC ATTGGCTTGT TGATCGAAAA CACTGACCAG CGCTCTCAGG ATTACAGCGC AATTAAAGAC GTTTTCCGCC CAGGCCATGC TGATTACACC TACGAACAAA AATACGGTCT GCGCGATTAT CGCGGCGGTG GACGTTCTTC CGCCCGCGAA ACTGCCATGC GCGTGGCGGC AGGGGCGATT GCCAAAAAAT ATCTCGCCGA GAAATTTGGT ATTGAAATCC GCGGCTGCTT GACCCAGATG GGCGACATTC CGCTGGAAAT CAAAGACTGG TCGCTGGTTG AACAAAATCC GTTCTTCTGC CCGGATCCGG ACAAAATCGA CGCGTTAGAT GAACTGATGC GCGCGCTGAA AAAAGAGGGC GACTCCATCG GCGCGAAAGT CACCGTTGTT GCCAGTGGCG TCCCCGCCGG ACTTGGCGAG CCGGTATTTG ACCGCCTGGA TGCCGACATC GCCCATGCGC TGATGAGCAT CAACGCGGTG AAAGGCGTGG AAATTGGCGA CGGCTTTGAC GTAGTGGCGC TGCGCGGCAG CCAGAACCGC GACGAAATCA CCAAAGACGG TTTCCAGAGC AACCATGCGG GCGGCATTCT CGGCGGTATC AGCAGCGGGC AGCAAATCAT TGCTCATATG GCGCTGAAAC CGACCTCCAG CATTACCGTG CCGGGGCGCA CTATTAACCG CTTTGGCGAA GAAGTTGAGA TGATCACCAA AGGCCGTCAC GATCCCTGTG TCGGGATCCG TGCAGTGCCG ATCGCTGAAG CGATGCTGGC GATCGTTTTA ATGGATCACC TGTTACGGCA GCGGGCGCAA AATGCCGATG TGAAGACTGA TATTCCACGC TGGTAA
|
Protein sequence | MAGNTIGQLF RVTTFGESHG LALGCIVDGV PPGILLTEAD LQHDLDRRRP GTSRYTTQRR EPDQVKILSG VFEGVTTGTS IGLLIENTDQ RSQDYSAIKD VFRPGHADYT YEQKYGLRDY RGGGRSSARE TAMRVAAGAI AKKYLAEKFG IEIRGCLTQM GDIPLEIKDW SLVEQNPFFC PDPDKIDALD ELMRALKKEG DSIGAKVTVV ASGVPAGLGE PVFDRLDADI AHALMSINAV KGVEIGDGFD VVALRGSQNR DEITKDGFQS NHAGGILGGI SSGQQIIAHM ALKPTSSITV PGRTINRFGE EVEMITKGRH DPCVGIRAVP IAEAMLAIVL MDHLLRQRAQ NADVKTDIPR W
|
| |