Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_0688 |
Symbol | aroF |
ID | 4205708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 808176 |
End bp | 809189 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 642565248 |
Product | 3-deoxy-7-phosphoheptulonate synthase |
Protein accession | YP_698014 |
Protein GI | 110802963 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase [TIGR01362] 3-deoxy-8-phosphooctulonate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAGTTA TATTAAAACC AGGAACAAAG GAAGAGGAGA TTTTAAAATT TATAAAAAAG ATAGAATCAC TTGGAGTTGA GACTCAAAGA ATTTCTGGAA GTGAAATGTG TGTGATTGGT TTAGTTGGAG ATACAAGTAA AATAGATCCT GCTAAAGTAG AAGCAAACAA AAATGTAGAG AGAATAATGC CTGTTCAAGA GCCCTTTAAA AAGGCAAATA GATTATTCCA TCCAGAGAAC TCTATAATTG ATGTCTTAGG AAATAAAATT GGTGATAAGA AAATAGCATT AATAGCTGGC CCTTGTTCCG TAGAGAGTGA GGAACAAATA ACTGAAATAG CCAAAGAAGT TAAGGCGTTA GGGGCAAGTT TCTTAAGAGG TGGTGCATTT AAACCAAGAA CTTCACCATA CAGTTTCCAA GGGTTAGAAC TTGAAGGATT GGAGCTTCTA AAAAAGGCTA AGGCAAAAAC AGGCCTTCCT ATAGTTACAG AAATAATGTC AACTAGTATG ATAGAAAAGT TTATTGAAGA TGTTGATGTT ATTCAAGTTG GAGCAAGAAA TATGCAAAAC TTTGATCTTT TAAAAGAGCT TGGAAAGACA AATAAACCTA TTCTTTTAAA GAGAGGATTG TCAGCTACAA TAGAGGAACT TATAATGTCA GCAGAGTACA TAATGTCTGG TGGAAATGAA AATGTAATTC TTTGTGAAAG AGGAATTAGA ACCTTTGAAA CTTATACAAG AAATACCTTA GACTTAAGTG CAATACCTGC TATTAAAAAA CTAAGTCATT TACCAGTAAT TGTTGATCCA AGTCATGCAG CAGGAAAGTC ATGGATGGTA GAACCATTAT CAAAGGCAGC CATAGCTGTA GGTGCAGATG GATTAATAAT AGAAGTACAT AATGACCCTG CTAATGCCTT ATGTGATGGT CAACAGTCAA TTAAACCAGA AGAGTACGGA AAGCTTTTGG AAGATTTAAG AGCTATCGCA AAGGCTGTTG GTAGAGAATT ATAG
|
Protein sequence | MIVILKPGTK EEEILKFIKK IESLGVETQR ISGSEMCVIG LVGDTSKIDP AKVEANKNVE RIMPVQEPFK KANRLFHPEN SIIDVLGNKI GDKKIALIAG PCSVESEEQI TEIAKEVKAL GASFLRGGAF KPRTSPYSFQ GLELEGLELL KKAKAKTGLP IVTEIMSTSM IEKFIEDVDV IQVGARNMQN FDLLKELGKT NKPILLKRGL SATIEELIMS AEYIMSGGNE NVILCERGIR TFETYTRNTL DLSAIPAIKK LSHLPVIVDP SHAAGKSWMV EPLSKAAIAV GADGLIIEVH NDPANALCDG QQSIKPEEYG KLLEDLRAIA KAVGREL
|
| |