Gene CPR_1106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1106 
Symbol 
ID4204449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1250870 
End bp1252168 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content31% 
IMG OID642565662 
Productamidohydrolase family protein 
Protein accessionYP_698428 
Protein GI110801523 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.181704 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGATATTG AGAAGTTTTT AAGGGAGATT AAAGAAAAAA TAATTAATTA TAGAAGGGAT 
TTTCATAAAT ATCCTGAAAG TGCATGGAAA GAATTTAGAA CATCTTCTTT AATTGGAAAA
TATTTAATAG AAATGGGTTA TGAAATAAAA ATAGGAAAAG AAATAATAAG TGAAGAGCAT
AGATTAGATG TATCAAATGA ACTTGATTTA AAGAAAAGTT ATGAAAAATC ATTAAATCAA
GGTGCCTATA AAGAATTAAG TGATTATATG ATTGGGGGAT ACACTGGAGT AGTTGGGATT
TTACAAAATG GAGAGGGTCC AACTATTGCT TTAAGATTTG ATATAGATGG AGTAAAGGTC
CTTGAATGTG AAAATAATGA GCATTTTCCT TTTAAAAATG GTTTTTCCTC TTTAAATAGA
GGGATTATGC ATGCCTGTGG ACATGATGGT CATATAGCTA TTGGTCTTGG AGTTGCAGAA
GGGATTATTA GATTTAAGGA ATATTTAAGT GGAACAATAA AACTTATTTT TCAACCTGGA
GAAGAAAGTG TATGTGGTGG AAATCCAATG GCTCAATCAG GAATCTTAGA TGATGTAGAT
TATTTATTAA GTGGACACAT AGGAATAAAG GCAAGGAAGT CAGGAGAAAT AATCTGTGGA
ACAAAGGGAT TTTTAGCAAC ATCTAAAATT AATGCTGAAT TTACAGGGAA GTCTTCCCAT
GCCGCAGTTG CACCTGAAAA AGGGCATAAT GCTTTGCTTT CTGCATCTAC AGCAGTATTA
AACTTAGATG CTATACCAAG AAGTGGTAAA GGCGTAACAA GAATAAATGT AGGAAAGTTA
ATTGCAGGGA GTGGTAGAAA TGTAATTCCT GGAAAAGCTT TTATGGAAGT AGAAACTAGA
GGAGAAACTA CAGAATTAAA TGAATATATG GAGAGTTATG CAACTAGAAT CTTAAAAGCT
TCTGCAGATA TGCATGATAA CCATGTGAAA ATATCTTGTT TAGGAAGATC TATAAGTGGT
AGTAGTGATA GAGAGTTAAT TGATATTATT AAATCGGAGG CAAAAAATAT AAATGATTAT
AACAATATAA TAGATGAGGA AGATTTTGGT GCTAGTGAAG ATATATGTTA TATGATGAAA
AGAGTACAGA AAAATGGAGG AAAAGCATCT TATATAATGT TTGGAAGTGA GTTAAAGGAT
GAACATCATT CTATTAGGTT TGATTTTAAT GAAGAAGATA TGTTTCCAGC AATAAATTTA
TATTTAAGAG TAGTAATGAA TCTTTGCAAT AAAAGATAA
 
Protein sequence
MDIEKFLREI KEKIINYRRD FHKYPESAWK EFRTSSLIGK YLIEMGYEIK IGKEIISEEH 
RLDVSNELDL KKSYEKSLNQ GAYKELSDYM IGGYTGVVGI LQNGEGPTIA LRFDIDGVKV
LECENNEHFP FKNGFSSLNR GIMHACGHDG HIAIGLGVAE GIIRFKEYLS GTIKLIFQPG
EESVCGGNPM AQSGILDDVD YLLSGHIGIK ARKSGEIICG TKGFLATSKI NAEFTGKSSH
AAVAPEKGHN ALLSASTAVL NLDAIPRSGK GVTRINVGKL IAGSGRNVIP GKAFMEVETR
GETTELNEYM ESYATRILKA SADMHDNHVK ISCLGRSISG SSDRELIDII KSEAKNINDY
NNIIDEEDFG ASEDICYMMK RVQKNGGKAS YIMFGSELKD EHHSIRFDFN EEDMFPAINL
YLRVVMNLCN KR