Gene CPR_1664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1664 
SymbolispG 
ID4206119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1860187 
End bp1861236 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content32% 
IMG OID642566214 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionYP_698979 
Protein GI110803791 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.704095 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATAGAA AAGAAACTAG AAAGGTTAAG ATTGGAAACA TATATGTTGG AGGAGATTTT 
AGAGTTTCCA TTCAATCTAT GACCAATACA GATACAAAAG ATGTAGAATC TACAGTTAAA
CAAATAAAAG AGCTTCAAGA AGCTGGATGT GATATTGTTA GATGTGCAGT TTTAGATATG
GATGCAGCTT GCGCTATAAA AGATATAGTG GCAAAAATTA ATATACCACT AGTTGCAGAC
ATTCATTTTG ATTATAAATT AGCTTTAAAA GCAATAGAAA ATGGAGTTTC TGCAATAAGA
ATAAATCCAG GAAATATTGG ATCTAGGGAA AAGGTAGAAG CTGTAGTAAA AGCTTGTAAA
GAAAAAAATA TTCCTATAAG AATAGGGGTT AACTCAGGGT CATTATCAAA AGAGCTTTTA
GCAAAATACG GAAAACCTAC CCCAGATGCC CTAGTTGAAA GTGCATTAGA ACATGTTAAA
ATATTAGAAG AGTTAGATTT TCATGATATA GTAATTTCAA TGAAATCATC AAATGTTGAA
ACTATGATAG AAAGTTATAG AATAGCTTCA CAAAAAACAA ATTATCCTCT TCACTTAGGG
GTTACTGAGG CTGGTACACC TTGGAGAGGA ACAATAAAAT CTGCTATAGG AATAGGAACT
TTACTTGCAG AAGGAATAGG TGATACTATA AGAGTTTCTT TAACTGGAGA TCCTGTTGAA
GAGATAAAAG TAGGTAAAGA AATTCTTAAA AACTTTGGAT ATGTAAAAGA AGGAATAGAG
TTTATATCAT GTCCTACATG TGGAAGAACT CAAATAGACT TAATAAACAT AGCTAAAGAA
GTAGAAGAAA GATTAAGTTC TTGCAAGAAA AACATAAAGG TTGCAGTAAT GGGCTGTGTT
GTAAATGGAC CAGGAGAAGC AAGAGAGGCA GATATTGGAA TAGCTGGGGG TAAAGGCGAA
GGTCTTATCT TTAGAAAAGG TGAAATAATT AAAAAGGTAA AAGAAGAAGA CTTAGTTGAA
GAGCTTATAA AGATAATAGA AACAATATAA
 
Protein sequence
MNRKETRKVK IGNIYVGGDF RVSIQSMTNT DTKDVESTVK QIKELQEAGC DIVRCAVLDM 
DAACAIKDIV AKINIPLVAD IHFDYKLALK AIENGVSAIR INPGNIGSRE KVEAVVKACK
EKNIPIRIGV NSGSLSKELL AKYGKPTPDA LVESALEHVK ILEELDFHDI VISMKSSNVE
TMIESYRIAS QKTNYPLHLG VTEAGTPWRG TIKSAIGIGT LLAEGIGDTI RVSLTGDPVE
EIKVGKEILK NFGYVKEGIE FISCPTCGRT QIDLINIAKE VEERLSSCKK NIKVAVMGCV
VNGPGEAREA DIGIAGGKGE GLIFRKGEII KKVKEEDLVE ELIKIIETI