Gene CPR_1951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1951 
Symbol 
ID4203970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2155036 
End bp2156193 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content32% 
IMG OID642566501 
Productamidohydrolase, putative 
Protein accessionYP_699261 
Protein GI110801746 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.140573 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAATTA AAAATGGGAA AATATTTACC TGTGAAGAAG GTAAGATATA TGAAAAAGGT 
GATATTCTAA TTAAGGATGG AAAGATAAGT AGAATTGGGG AAGATTTAAG TCAATACATA
GGAGAAGAAG AGGTTATTGA TGCTAAAGGA CTATTAATAT TTCCAGGGTT TATTGAAGCA
CATTGTCATT TAGGACTACA TGAAGAAGGA AATAATGGGG CAGGAAATGG AACCAATGAA
GCTAGTGAAC CTATAACCCC ACAAATGAGA GCTATAGATG GAATAAATCC TTTTGATGGA
GGATTCCAAT CTGCAAGGGA AGCAGGGGTT ACCACAGCTG TAATTGGGCC TGGAAGCGCT
AATGTAATAG GAGGACAGTT TGCCGCTGTA AAAACAAGTG GAATATGTAT TGATGATATG
ATAATAAAGG AACCTGTAGC AATAAAGGTT GCCTTTGGAG AAAATCCAAA AAGGGTTTAT
TCTGGAAAGA ATAAAATGCC TAATACAAGA ATGGCTATTG CAGCTTTATT AAGAGAAACT
TTAACAGAGG CTGTTAATTA TAAAAATAGA AAAATTGATG CTGAAATAGA GGATAGGGAT
TTTAGTAAGA ATTTAAAATA TGAGGCTTTA CTTCCACTAA TTAACAAAGA AATACCTATG
AAAGCTCATG CCCATAGAGC AGATGATATT TTAACTGCCA TAAGAATAGC TAAGGAATTT
AATCTTAAAT TAACTTTAGA TCATTGTACA GAAGGAGATT TGATAAGTGA TTATATTAAA
AGAGAAAACT TAGATGCTAT AGTTGGACCA ACTTTAAGTT TTAATGGAAA GGCAGAGACT
TTAAATAAAA CCTTTAAGAC TCCAAAGGCC TTAATAGATA AAGGAATTAA AGTAGCAATA
ACTACAGACC ATCCAGTAGT AACAATAGAC AATCTTCCAC TATGTGCAGC TATGGCTATG
AAAGAAGGAA TTACTTTTAA TGAAGCCTTA GAAGCAATAA CAATAAATCC AGCTAAAATA
ATAGGTATTG ATGAAAGAGT TGGAAGCTTA AAGGAAGGAA AGGATGGAGA TTTAGTAATT
TTAAATGGAA GTCCTTTTGA AATAGCTACA AAAACTATTT ATACAATTAT AAATGGAGAG
GTAGTTTATA AAGACTAG
 
Protein sequence
MLIKNGKIFT CEEGKIYEKG DILIKDGKIS RIGEDLSQYI GEEEVIDAKG LLIFPGFIEA 
HCHLGLHEEG NNGAGNGTNE ASEPITPQMR AIDGINPFDG GFQSAREAGV TTAVIGPGSA
NVIGGQFAAV KTSGICIDDM IIKEPVAIKV AFGENPKRVY SGKNKMPNTR MAIAALLRET
LTEAVNYKNR KIDAEIEDRD FSKNLKYEAL LPLINKEIPM KAHAHRADDI LTAIRIAKEF
NLKLTLDHCT EGDLISDYIK RENLDAIVGP TLSFNGKAET LNKTFKTPKA LIDKGIKVAI
TTDHPVVTID NLPLCAAMAM KEGITFNEAL EAITINPAKI IGIDERVGSL KEGKDGDLVI
LNGSPFEIAT KTIYTIINGE VVYKD