Gene CPR_0688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0688 
SymbolaroF 
ID4205708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp808176 
End bp809189 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content35% 
IMG OID642565248 
Product3-deoxy-7-phosphoheptulonate synthase 
Protein accessionYP_698014 
Protein GI110802963 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase
[TIGR01362] 3-deoxy-8-phosphooctulonate synthase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAGTTA TATTAAAACC AGGAACAAAG GAAGAGGAGA TTTTAAAATT TATAAAAAAG 
ATAGAATCAC TTGGAGTTGA GACTCAAAGA ATTTCTGGAA GTGAAATGTG TGTGATTGGT
TTAGTTGGAG ATACAAGTAA AATAGATCCT GCTAAAGTAG AAGCAAACAA AAATGTAGAG
AGAATAATGC CTGTTCAAGA GCCCTTTAAA AAGGCAAATA GATTATTCCA TCCAGAGAAC
TCTATAATTG ATGTCTTAGG AAATAAAATT GGTGATAAGA AAATAGCATT AATAGCTGGC
CCTTGTTCCG TAGAGAGTGA GGAACAAATA ACTGAAATAG CCAAAGAAGT TAAGGCGTTA
GGGGCAAGTT TCTTAAGAGG TGGTGCATTT AAACCAAGAA CTTCACCATA CAGTTTCCAA
GGGTTAGAAC TTGAAGGATT GGAGCTTCTA AAAAAGGCTA AGGCAAAAAC AGGCCTTCCT
ATAGTTACAG AAATAATGTC AACTAGTATG ATAGAAAAGT TTATTGAAGA TGTTGATGTT
ATTCAAGTTG GAGCAAGAAA TATGCAAAAC TTTGATCTTT TAAAAGAGCT TGGAAAGACA
AATAAACCTA TTCTTTTAAA GAGAGGATTG TCAGCTACAA TAGAGGAACT TATAATGTCA
GCAGAGTACA TAATGTCTGG TGGAAATGAA AATGTAATTC TTTGTGAAAG AGGAATTAGA
ACCTTTGAAA CTTATACAAG AAATACCTTA GACTTAAGTG CAATACCTGC TATTAAAAAA
CTAAGTCATT TACCAGTAAT TGTTGATCCA AGTCATGCAG CAGGAAAGTC ATGGATGGTA
GAACCATTAT CAAAGGCAGC CATAGCTGTA GGTGCAGATG GATTAATAAT AGAAGTACAT
AATGACCCTG CTAATGCCTT ATGTGATGGT CAACAGTCAA TTAAACCAGA AGAGTACGGA
AAGCTTTTGG AAGATTTAAG AGCTATCGCA AAGGCTGTTG GTAGAGAATT ATAG
 
Protein sequence
MIVILKPGTK EEEILKFIKK IESLGVETQR ISGSEMCVIG LVGDTSKIDP AKVEANKNVE 
RIMPVQEPFK KANRLFHPEN SIIDVLGNKI GDKKIALIAG PCSVESEEQI TEIAKEVKAL
GASFLRGGAF KPRTSPYSFQ GLELEGLELL KKAKAKTGLP IVTEIMSTSM IEKFIEDVDV
IQVGARNMQN FDLLKELGKT NKPILLKRGL SATIEELIMS AEYIMSGGNE NVILCERGIR
TFETYTRNTL DLSAIPAIKK LSHLPVIVDP SHAAGKSWMV EPLSKAAIAV GADGLIIEVH
NDPANALCDG QQSIKPEEYG KLLEDLRAIA KAVGREL