Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1951 |
Symbol | |
ID | 4203970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 2155036 |
End bp | 2156193 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 642566501 |
Product | amidohydrolase, putative |
Protein accession | YP_699261 |
Protein GI | 110801746 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.140573 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAATTA AAAATGGGAA AATATTTACC TGTGAAGAAG GTAAGATATA TGAAAAAGGT GATATTCTAA TTAAGGATGG AAAGATAAGT AGAATTGGGG AAGATTTAAG TCAATACATA GGAGAAGAAG AGGTTATTGA TGCTAAAGGA CTATTAATAT TTCCAGGGTT TATTGAAGCA CATTGTCATT TAGGACTACA TGAAGAAGGA AATAATGGGG CAGGAAATGG AACCAATGAA GCTAGTGAAC CTATAACCCC ACAAATGAGA GCTATAGATG GAATAAATCC TTTTGATGGA GGATTCCAAT CTGCAAGGGA AGCAGGGGTT ACCACAGCTG TAATTGGGCC TGGAAGCGCT AATGTAATAG GAGGACAGTT TGCCGCTGTA AAAACAAGTG GAATATGTAT TGATGATATG ATAATAAAGG AACCTGTAGC AATAAAGGTT GCCTTTGGAG AAAATCCAAA AAGGGTTTAT TCTGGAAAGA ATAAAATGCC TAATACAAGA ATGGCTATTG CAGCTTTATT AAGAGAAACT TTAACAGAGG CTGTTAATTA TAAAAATAGA AAAATTGATG CTGAAATAGA GGATAGGGAT TTTAGTAAGA ATTTAAAATA TGAGGCTTTA CTTCCACTAA TTAACAAAGA AATACCTATG AAAGCTCATG CCCATAGAGC AGATGATATT TTAACTGCCA TAAGAATAGC TAAGGAATTT AATCTTAAAT TAACTTTAGA TCATTGTACA GAAGGAGATT TGATAAGTGA TTATATTAAA AGAGAAAACT TAGATGCTAT AGTTGGACCA ACTTTAAGTT TTAATGGAAA GGCAGAGACT TTAAATAAAA CCTTTAAGAC TCCAAAGGCC TTAATAGATA AAGGAATTAA AGTAGCAATA ACTACAGACC ATCCAGTAGT AACAATAGAC AATCTTCCAC TATGTGCAGC TATGGCTATG AAAGAAGGAA TTACTTTTAA TGAAGCCTTA GAAGCAATAA CAATAAATCC AGCTAAAATA ATAGGTATTG ATGAAAGAGT TGGAAGCTTA AAGGAAGGAA AGGATGGAGA TTTAGTAATT TTAAATGGAA GTCCTTTTGA AATAGCTACA AAAACTATTT ATACAATTAT AAATGGAGAG GTAGTTTATA AAGACTAG
|
Protein sequence | MLIKNGKIFT CEEGKIYEKG DILIKDGKIS RIGEDLSQYI GEEEVIDAKG LLIFPGFIEA HCHLGLHEEG NNGAGNGTNE ASEPITPQMR AIDGINPFDG GFQSAREAGV TTAVIGPGSA NVIGGQFAAV KTSGICIDDM IIKEPVAIKV AFGENPKRVY SGKNKMPNTR MAIAALLRET LTEAVNYKNR KIDAEIEDRD FSKNLKYEAL LPLINKEIPM KAHAHRADDI LTAIRIAKEF NLKLTLDHCT EGDLISDYIK RENLDAIVGP TLSFNGKAET LNKTFKTPKA LIDKGIKVAI TTDHPVVTID NLPLCAAMAM KEGITFNEAL EAITINPAKI IGIDERVGSL KEGKDGDLVI LNGSPFEIAT KTIYTIINGE VVYKD
|
| |