Gene CPR_1114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1114 
Symbol 
ID4206483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1257753 
End bp1258817 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content26% 
IMG OID642565670 
Productthreonine-phosphate decarboxylase 
Protein accessionYP_698436 
Protein GI110803458 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01140] L-threonine-O-3-phosphate decarboxylase
[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.677622 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTAG GACATGGTGG TAATGTAGAA GAAATTAAGA GGATTTATGG CTTAAAAGAA 
AATGAAATTA TTGATTTTTC AGCTAATATA AACCCTATAG GAATCTCTGA AAGAGTAAAG
GAAGCTATGG TAAGAGGAAT AAATTCTATA GAGAGATATC CAGATATAAC TTATTATGAG
CTTAAAAAGG GGATTTCAAA CTTTGAAAAG GTAGCTTTAG AAAATATAGT TCTAGGAAAT
GGTGCGGCAG AGGTTATTTT TAATATAGTA AGAGGAATAA AACCTAAAAA GGCTCTTATA
GTATCACCAA CCTTTTCAGA GTATGAGGAT GCTTTAAATT CTGTTGAGTG TGAAGTTAAC
CATTATATAT TAAAAGATAA TTATTCAGTA GATAATGGAT TTTTAGAAGA AATCAAGGAA
GAATTAGATA TTATATTTTT ATGTAATCCC AATAATCCTA CAGGTGCTTT AATAGAAAAA
GATTTTTCAT TAAAAGTATT AGAAAAAGCA AAAAAAATGA ATATTACAGT TGTTTTTGAT
GAATCTTTTT TAGACTTTGT TGAAGATAAT TATAAATACT CTCTTATTAA AGAACTTGAT
AATTTCCAAA ATTTAATTGT AGTAAAATCT TTAACTAAAT TATTTGCCTT TCCTGGAATA
AGATTAGGTT ATGGATTAAG TTCTAATAAT AATTTTATAG AAAAAATAAA TTTAGTAGGA
ACTCCTTGGA GTGTAAATAC CGTAGCAGAC TATGCAGGAA GAGAAGCATT AAAAGATGTT
GAGTATATTA AAAACAGCGT AGCTTACATA AAAAAAGAAA ATGAATTTTT ATATAATGGC
CTAGAGAATT TTAAAGATAT TATTACTTTT AAAGGTGCAG TAAATTTTAT ATTTTTTAAA
TTGAATAAAA ATATAAATTT AAGAGAAGAA TTAATTAGAA GAGGTATAAT AATAAGAAGT
TGTAGTAATT ATATTGGCTT AGACCATAGC TTCTATAGAG TGGCAGTTAG AACAAGGGAA
GAAAACAAAA AGCTTTTAGA AGAACTTTCA AAGGTTTTAA AATAA
 
Protein sequence
MNLGHGGNVE EIKRIYGLKE NEIIDFSANI NPIGISERVK EAMVRGINSI ERYPDITYYE 
LKKGISNFEK VALENIVLGN GAAEVIFNIV RGIKPKKALI VSPTFSEYED ALNSVECEVN
HYILKDNYSV DNGFLEEIKE ELDIIFLCNP NNPTGALIEK DFSLKVLEKA KKMNITVVFD
ESFLDFVEDN YKYSLIKELD NFQNLIVVKS LTKLFAFPGI RLGYGLSSNN NFIEKINLVG
TPWSVNTVAD YAGREALKDV EYIKNSVAYI KKENEFLYNG LENFKDIITF KGAVNFIFFK
LNKNINLREE LIRRGIIIRS CSNYIGLDHS FYRVAVRTRE ENKKLLEELS KVLK