Gene CPR_1787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1787 
Symboldxs 
ID4205334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1986054 
End bp1987913 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content30% 
IMG OID642566337 
Product1-deoxy-D-xylulose-5-phosphate synthase 
Protein accessionYP_699102 
Protein GI110802844 
COG category[H] Coenzyme transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG1154] Deoxyxylulose-5-phosphate synthase 
TIGRFAM ID[TIGR00204] 1-deoxy-D-xylulose-5-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAAA TTCTACAGAA AATAACTGAT CCTAAAGAAA TAAAGGATTT AGATGAGAAA 
GAATTAGAAA TGTTAGCTAA AGAGCTAAGA GAATTTTTAA TAGAAAGCGT TTCAAATACT
GGTGGACATT TTGCTTCAAA CTTAGGTGTT ATAGATCTAA CAGTAGCTTT ATTTAAAAAT
TTTGATTTTA GTGAAAATAG AATAATATGG GATGTAGGAC ATCAATCTTA TGCTTATAAG
ATATTAACTG GAAGAAAAGA TAAATTTAAT ACTTTAAGAC AATATGGTGG ATTGTGTGGT
TTTCCTAAGA GGACAGAAAG CGAATATGAT TTTTTTGCCA CTGGACATAG TAGTACATCA
TTATCTTCAG CAGCAGGTAT GGCTAGAGCT CAGAAGATTC TTGGAAAAGA TAATAAGGTC
ATAGCAGTTA TAGGTGATGG AGCCTTAACT GGAGGTATGG CTTTAGAAGC CTTAAACGAT
ATTGGATATA GAAAAGATAA TCTTATAATA ATATTAAATG ATAATCAAAT GTCTATATGT
AAAAATGTTG GAGGACTTGC AACCTATTTA AATAAGCTTA GAATGGGTGT AGGTTATAAT
AAATTAAAAT CAGATATTGG ATCAACTTTA GATACAACTT CTTTGGGCAA AAGAGTAAAG
AACTCTCTTT CAAAATTAAA AGATGGTATC AAAAAGATTG TTGTACCAAG TATGTACTTT
GAGGATATTG GATTAAAATA TTTTGGCATA GTAGATGGAC ATAACATTAG AGAATTAAAT
GAAGTTTTAA GTATAGCTAA AAGTATAAAA GGACCAGTTA TAATACATAC AGTTACTAAA
AAAGGAAAAG GATATGAATT AGCAGAAAAA AATCCTAATA AATATCATGG AGTATCTCCT
TTTGATTTAG GAGAAGGAGT GATTTCAAAG TTTGCAAATA GAAATTATTC TTCTACCTTT
GGAGAAGAAA TGATTAAATT AGCTAAAAAT GATGACAAAG TTGTTGCAAT TACAGCTGCT
ATGCCAGATG GAACAGGATT AAAAGACTTT AGAGAAGAAT TTCCTGATAG ATTTTTTGAT
GTAGGTATAG CGGAACAACA TGCTGTTACA TTAGCTGCTG GAATGGCGGC AGAAGGTTTA
AAACCATTTT TTGCAGTCTA TTCTACTTTC TTACAAAGAG CTTATGACCA AGTTTTACAT
GATGTATGCA TACAAAATCT ACCTGTTACA CTTTGTCTAG ATAGAGCTGG CTTAGTTGGA
GAAGATGGAG AAACTCATCA AGGTATATTC GATATTTCAT TTTTATCTCC AATGCCTAAT
ATGACTATTG TTGCACCTAA GTGTATAGAT GAAATGGAAG TTATCTTAAA ATGGGCAAGT
AATTTTAATG CACCTTTAGC TATAAGATAT CCAAGAGGTG GAGATATTGA TGTTAATTTA
AAACCATTAA GTAAAATAGA ATATGGAAAA TGGGAAAAGG TTCAAGAGGG AGAGAAGATA
GCAATAGTTG CTACTGGTAA AATGGTTCAA CATGCTATGA TTGCTGCACA AAAGATAAAA
GAAGAAAAAA ATATAGATAT TTTAATTATA AATGCAACCT TTATAAAACC AATAGATAAA
GAATTATTAA ATTCCTTGTC AAAGGATGGA TTTAAGATTG TAACTATTGA AGATAATATT
AAAAAAGGTG GCTTTGGAGA AGGCGTTCTA GAGTATTTAA ATGAAATTGG ACATGAAGAA
AAAATTGTTA CATTAGCATT TAATGATAAG TTTATAGAAC ATGGTAAGCC TGATATTTTA
TATAGAATTA ATGGATTAGA TGCAGAGGGA ATAAAAAACA CATTAATTGA ATTACTTTAA
 
Protein sequence
MSEILQKITD PKEIKDLDEK ELEMLAKELR EFLIESVSNT GGHFASNLGV IDLTVALFKN 
FDFSENRIIW DVGHQSYAYK ILTGRKDKFN TLRQYGGLCG FPKRTESEYD FFATGHSSTS
LSSAAGMARA QKILGKDNKV IAVIGDGALT GGMALEALND IGYRKDNLII ILNDNQMSIC
KNVGGLATYL NKLRMGVGYN KLKSDIGSTL DTTSLGKRVK NSLSKLKDGI KKIVVPSMYF
EDIGLKYFGI VDGHNIRELN EVLSIAKSIK GPVIIHTVTK KGKGYELAEK NPNKYHGVSP
FDLGEGVISK FANRNYSSTF GEEMIKLAKN DDKVVAITAA MPDGTGLKDF REEFPDRFFD
VGIAEQHAVT LAAGMAAEGL KPFFAVYSTF LQRAYDQVLH DVCIQNLPVT LCLDRAGLVG
EDGETHQGIF DISFLSPMPN MTIVAPKCID EMEVILKWAS NFNAPLAIRY PRGGDIDVNL
KPLSKIEYGK WEKVQEGEKI AIVATGKMVQ HAMIAAQKIK EEKNIDILII NATFIKPIDK
ELLNSLSKDG FKIVTIEDNI KKGGFGEGVL EYLNEIGHEE KIVTLAFNDK FIEHGKPDIL
YRINGLDAEG IKNTLIELL