Gene CPR_0737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0737 
SymbolcodA 
ID4206223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp862996 
End bp864258 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content30% 
IMG OID642565297 
Productcytosine deaminase 
Protein accessionYP_698063 
Protein GI110803700 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.159604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCTA TTTTATTTAA AAATGCTAGA TTAAAAGGAA ATGAAACACT AGTTGATTTA 
CTTGTTGAAA ATGGAGTTTA TAAGGAAATA GGTCCAAATC TTTCTGAAAA ATATAAAGAT
GTAGAAACTT ATGATTTAAA AGGAGATCTT GTTGTACCAC CTTATGTTGA TCCACATATT
CATTTAGACT ATGTATATAC AGCTCGTATG CCTGGTGCAA ATAATGGAAC AGGAACTCTT
TTTGAAGGAA TTCAAAGATG GTCTGAAACA AAAGGAAACA TGACAATAGA TGAAATTAAA
GAACGTGCAA GAATAGCTCT TAAAAAAGAG ATTTTATATG GTACTCAATA TATGAGAACA
CATGTTGACG TAACAGATCC TAAGTTTACA GGATTAAAAG CTATTATGGA ATTAAAAGAA
GAATATAAAG ATATTATAGA TATTCAAATC ATAGCTTTCC CACAAGAAGG AATGTATTCT
TATAAAGGTG GAGATGAATT AGTAGAAGAA GCTTTAAAAA TGGGTGCTGA TGTAGTTGGT
GCTATTCCTC ACTTTGAATT CACAAGAGAA ATGGGAGAAA AATCAGTTAA GAAAACTGTA
GAGCTTGCAA TGAAATATAA TAAATTAATT GACGTTCACT GTGATGAAAC TGATGATGAC
CAATCAAGAT TTGTTGAATT ATTAGCAGCA GAAGCTTACT TAAATGGAAT TGGAGAACTT
ACAACTGCAA GCCATACTTG TGCTATGGGT TCATATAATA ATGCTTATGC ATTTAAATTA
TTCAAACTTT TAAAATTATC AAAAATGAAC TTCATATCAT GTCCAACAGA AAATATTCAC
TTACAAGGAA GATATGACAC TTATCCAAAG AGAAGAGGTC TTACAAGAGT TAAGGAATTA
AATGATGCAG GAATTAATGT TTGTTTTGCT CAAGACTCAA TTTCAGACCC ATGGTACCCA
TTAGGAAATG GTAACCTAAT GAATATCTTA GATGCTGGTA TTCATATATG CCATATGATG
TCTGTTGATG AAATTAATAA TGCCTTAGAT TTAATTACAA CAAATGGTGC CAAAACTCTT
CATATACAAG ATAAATATGG TATAGAAGTA GGAAAAGATG CTAACTTCAT AGTTTTAAAT
GCTAAAAATG AATTTGATGC AATCCTTGAA AGAGTTGGAG TTAACTGCTC TGTAAGAAGA
GGAGAATTCC TATTTAAGAG AGAACCTGAA GTAATAGACA CAAAAATAAC TCTATTAAAA
TAG
 
Protein sequence
MKAILFKNAR LKGNETLVDL LVENGVYKEI GPNLSEKYKD VETYDLKGDL VVPPYVDPHI 
HLDYVYTARM PGANNGTGTL FEGIQRWSET KGNMTIDEIK ERARIALKKE ILYGTQYMRT
HVDVTDPKFT GLKAIMELKE EYKDIIDIQI IAFPQEGMYS YKGGDELVEE ALKMGADVVG
AIPHFEFTRE MGEKSVKKTV ELAMKYNKLI DVHCDETDDD QSRFVELLAA EAYLNGIGEL
TTASHTCAMG SYNNAYAFKL FKLLKLSKMN FISCPTENIH LQGRYDTYPK RRGLTRVKEL
NDAGINVCFA QDSISDPWYP LGNGNLMNIL DAGIHICHMM SVDEINNALD LITTNGAKTL
HIQDKYGIEV GKDANFIVLN AKNEFDAILE RVGVNCSVRR GEFLFKREPE VIDTKITLLK