Gene CPR_1167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1167 
Symbol 
ID4206611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1311834 
End bp1312979 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content32% 
IMG OID642565723 
Productmalate oxidoreductase (NAD) (malic enzyme) 
Protein accessionYP_698489 
Protein GI110801879 
COG category[C] Energy production and conversion 
COG ID[COG0281] Malic enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.641999 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAAGG ACGAATTATT AAAACAAAGA GAATTAGCTC ATGGATTAAT AAGTATAAAA 
CCTAATTTTG ATATAAATAA TAGAGAACAA TTATCACAAA TATATACTCC TGGAGTATCA
ACTATTTGTA AAGAGGTTGA GCATCATCCT AGTATGCTGA AAACACTAAC TTCTGTTGGA
AATTCAATAG CTGTTATAAC AGATGGTACT GCGGTTTTAG GTCTTGGGAA TATAGGTACC
CTTGCAGGAT ATCCTATAGT AGAGGCTAAA GCTTTAGTTT ATAAAGATCT AGCTGGTGTA
AACGCTATCC CATTATGCGT TGATCAAATA GGATGCAATG AATTAATTAA AACAATAAAA
AATATATATT CAAGTTTTAG TGGAATTCAT CTTGAAGATA TAAAGGCACC GGAATGCTTT
TATATAGAAG ATGAACTTAA AAAAACTTTA AATATTCCAG TTTATCATGA TGACCAACAT
GGTACTGCCA TTGCTGTTTT AGGGGCTCTT TATAATGCAT CTAAGGTAGT TAATAAGGAT
TTTTCAAAGT TAAAGGTATT AATTTTAGGG GCAGGAGCTT CAGGAATTGC AACAGCAAAA
TTATTATTAA AGGCTGGAAT AGAAGATATT ATATTAGTTG ATAAGAATGG AGCTTTAGTT
AGTGGTGATG AAACTCTTAA TGATCCTCAA AAAGAAATGG CTAAAATAAC AAATAAAGAA
TTAAAAAAAG GAACTTTGGA AGAAGTAATT AAGGGAAGAG ATGTATTTAT AGGTTTATCA
GAAGGGAATC TTGTAACTAA GGAAATGGTA GAAAGTATGA ATGAGGATCC TATAATATTC
GCTTTAGCTA ATCCAACGCC AGAGATAAAA CCTGAAATTG CAAAGGAAGC TGGTGCAAGG
GTTATTGCAA CAGGTGGACC TTCTTATCCA AATCAGATTA ATAATATATT GGTTTTCCCA
GGACTATTTA AAGGATTATT AGAAGCTAAG GCAACTGATG TAACTTATGG TGTAATGATA
GCAGTTAGTA AAAAATTAGC TTCCTTAGTT GAAAATCCAA CTGCTGAAAA AATAATACCT
GGAGTATTTG ATGGTGATAT AGTTAAGTCT GTTTCTGAAA CTGTGGTAAA AAATATTGAG
AAGTAG
 
Protein sequence
MTKDELLKQR ELAHGLISIK PNFDINNREQ LSQIYTPGVS TICKEVEHHP SMLKTLTSVG 
NSIAVITDGT AVLGLGNIGT LAGYPIVEAK ALVYKDLAGV NAIPLCVDQI GCNELIKTIK
NIYSSFSGIH LEDIKAPECF YIEDELKKTL NIPVYHDDQH GTAIAVLGAL YNASKVVNKD
FSKLKVLILG AGASGIATAK LLLKAGIEDI ILVDKNGALV SGDETLNDPQ KEMAKITNKE
LKKGTLEEVI KGRDVFIGLS EGNLVTKEMV ESMNEDPIIF ALANPTPEIK PEIAKEAGAR
VIATGGPSYP NQINNILVFP GLFKGLLEAK ATDVTYGVMI AVSKKLASLV ENPTAEKIIP
GVFDGDIVKS VSETVVKNIE K