Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_0691 |
Symbol | aroC |
ID | 4206008 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 811723 |
End bp | 812796 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 642565251 |
Product | chorismate synthase |
Protein accession | YP_698017 |
Protein GI | 110803913 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0522046 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAGGAG TTTGGGGTAA TAAAATAAAA TTATCTATAT TTGGAGAATC TCATGGAGAA GGAATAGGAA TAGTAATAGA TGGAATAGAA CCTGGAATAA AAATAAATAT GGATAACATA GAAAAAGATA TGGAAAGAAG AGCACCAGGA AGAAATAGTT TATCAACTCA AAGAAAAGAA GGGGATAAAC CAGAAATTTT AAGTGGAATA TTTAATGGAA TCACCACAGG GGCTCCTATT TCAATGATAA TAAGAAATAC AGATAAAAGA TCTAGGGATT ATTCAAAAAT AAAAGATGTA ATGAGACCAG GCCATGCAGA TTTCCCAGGA TATATAAGAT ATAATGGCTT TAATGATTAT AGAGGGGGAG GACATTTCTC AGGAAGAATA ACAGCGCCTT TAGTTTTTGC TGGAGCCTTA GCTAAGGAAA TGCTTAAGGA AAAAGGTATA ACTATTGGTT CTCATATTAA GCAAGTTGGA AAAGTTAAGG ATTCTTCCTT TGATGCATTA AATTTAAAGA AAGAAGATTT AGAAGAACTT TTAACTAAAG AACTTCCAGT AATAGATGCA AATAAAATAG AAGAAATTAA GGAAGAGATT ACTTCATATA GAATGGAAGG AGATTCTATT GGAGGAATTG TTGAGTGTGC CATAGTAGGA CTAGAGGCTG GTATAGGAAA TCCATTCTTT GATTCTTTAG AAAGTACCAT AGCTCATTTA GCTTTTTCAG TGCCTGCTGT AAAGGGAATT GAATTTGGAG AAGGTTTTGA CTTTGCAAAT ATGAAAGGAT CAGAAGCAAA TGACGAATAT TTCATAGAAG ATGAAAAAGT TAAGACATAC TCTAATAATA ATGGAGGAAT AACTGGTGGA ATATCAAATG GAATGCCAGT TATATTCAGA GTTGTTATAA AACCTACACC ATCTATTTCT AAAGAACAAA GAACTATAAA TATAAAAAAT ATGACAGAGG AAGTTCTAAG TGTAAATGGT AGACATGATC CTTGTATAGT TCAAAGAGCC TTAGTTGTTA TAGAGGCCAT TGCAGCTATT TCTATATTAG AGTTAATAAA ATAA
|
Protein sequence | MGGVWGNKIK LSIFGESHGE GIGIVIDGIE PGIKINMDNI EKDMERRAPG RNSLSTQRKE GDKPEILSGI FNGITTGAPI SMIIRNTDKR SRDYSKIKDV MRPGHADFPG YIRYNGFNDY RGGGHFSGRI TAPLVFAGAL AKEMLKEKGI TIGSHIKQVG KVKDSSFDAL NLKKEDLEEL LTKELPVIDA NKIEEIKEEI TSYRMEGDSI GGIVECAIVG LEAGIGNPFF DSLESTIAHL AFSVPAVKGI EFGEGFDFAN MKGSEANDEY FIEDEKVKTY SNNNGGITGG ISNGMPVIFR VVIKPTPSIS KEQRTINIKN MTEEVLSVNG RHDPCIVQRA LVVIEAIAAI SILELIK
|
| |