Gene CPR_1200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1200 
SymbolpyrC 
ID4206448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1345144 
End bp1346343 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content31% 
IMG OID642565756 
Productdihydroorotase, multifunctional complex type 
Protein accessionYP_698522 
Protein GI110801777 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0128604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTGC TAATTAAAAA TGTAAATTTA ATAGATGAAA GCAACAACTT TTTTGGTGAT 
ATATATATAG AAAAAGGGGT AATAAAAGAA CTTGGAACTG AACTAAATAA AGAATGCGAA
ACTCTAGATG GAAAAGGCTT AGTACTTATG CCTGCATTTA TAGATACTCA TGCACACTTT
AGAGATCCAG GCTTTGAATA TAAAGAGGAT ATTGAAAGTG GATCTAAGGC TGCAGTTAGG
GGTGGATACA CAACAGTAAC CTTAATGCCA AACACAAAAC CCGTTTGTAG TTCAAAAGAA
ATTTTAGATT ATGTGGTTAA TAAGGGTAAA GAGGTAGACT TAGTAGATCT ATATCAAACA
GTTTCCATAA CAAAGAATTT ATCAGGTGAA GAAATAAATC ATCTTAGAGA ATTTGAGGGA
AATCCTAATG TTAAGGCAAT AACAGATGAT GGTAAAGGTG TATCAGATTC TAAGATTATG
ATGGAGGCTA TGAAAATAGC TAAGGAAAAT AACTGGATAG TAATGTCCCA TGCTGAAAGT
CCAGAATTCT CAAAAGTTGA TATGAGATTA GCTGAAAATA TGATGACATG GAGAGATATT
ACATTAGCAA AGTTTATAGA TTGTAGACTT CACATGTCTC ATGTAAGTAC TAAGGAAGCT
ATGAAATATA TAATAGAAGG AAAAAATGAT GGAGTTAAAG TAACTTGCGA AATAACTCCT
CACCATTTAG CTTTAAATAA TAAGATTAGT AATTATAGAG TTAATCCTCC TATAAGAGAA
GAAGAGGATG TAAATTTCTT AATAAAGGCA ATAAAAATGA ACTATGTTGA TTGTATAGGA
ACAGATCATG CTCCTCATTC AAAGGAAGAT AAGGAAAAAG GAGCACCTGG CATGATTGGA
ATTGAACAAG CTTTCTCAAT ATGTTATACC AAGCTAGTTA AGGAAAATCA CATAAGCTTA
AATAAGCTAA GTCAATTAAT GAGTGGAAAT GCTGCTAAAT TATTAAACTT AAATAAAGGA
AAACTTCAAC CAGGTTTTCT TGGAGATTTA GTTCTTATAG ATTTAAACAA GAAAAGAATA
TTCAAAGAAG AAGATATAGT ATCTAGAAGT AAAAACACAC CATTTAATGG AATGGAGTTT
TATGGAGATG TAGTACTAAC AATAAAGAAT GGAAAAATAG TTTACAAGGG TGAATTTTAG
 
Protein sequence
MNLLIKNVNL IDESNNFFGD IYIEKGVIKE LGTELNKECE TLDGKGLVLM PAFIDTHAHF 
RDPGFEYKED IESGSKAAVR GGYTTVTLMP NTKPVCSSKE ILDYVVNKGK EVDLVDLYQT
VSITKNLSGE EINHLREFEG NPNVKAITDD GKGVSDSKIM MEAMKIAKEN NWIVMSHAES
PEFSKVDMRL AENMMTWRDI TLAKFIDCRL HMSHVSTKEA MKYIIEGKND GVKVTCEITP
HHLALNNKIS NYRVNPPIRE EEDVNFLIKA IKMNYVDCIG TDHAPHSKED KEKGAPGMIG
IEQAFSICYT KLVKENHISL NKLSQLMSGN AAKLLNLNKG KLQPGFLGDL VLIDLNKKRI
FKEEDIVSRS KNTPFNGMEF YGDVVLTIKN GKIVYKGEF