Gene CPR_1225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1225 
Symbol 
ID4206126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1376330 
End bp1377526 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content33% 
IMG OID642565781 
Productamidohydrolase family protein 
Protein accessionYP_698547 
Protein GI110803798 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0413476 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATAA ATTTAATGAA TGAAGCTCAA GAAATAAAAG ACTTACTTGT AGCTTTAAGA 
AGAGATTTTC ATGAAAATCC TGAATTAGGT TTTGAAGAAT GGAGAACTTC AGGAAAAATA
AAGGAATTTT TAACTAATGA AGGTATTGAA TATATAGAAA CTGCTAAAAC AGGAGTATGT
GGCATAATAA AGGGCACACT AAAGGATGAC TCTAAAAAAG ATAGATGCAT AGCTTTAAGA
GCTGACATTG ATGGTCTTCC TATGGATGAT AAAAAGACTT GTTCATATTC ATCAAAGGTT
AAAGGAAGAA TGCATGCTTG TGGACATGAT GCCCACACAA CAATATTGTT AGGTGCAGCT
AAATTATTAA GTAGACATAG AGATAAGTTT AGTGGTACTG TTAAGTTACT CTTTGAACCA
GCAGAGGAAA CAACAGGCGG AGCTCCTATA ATGATAGAAG AAGGAGTTTT AGAAAATCCT
AGAGTAGAAA AAATAATAGG CCTTCATGTT GAAGAAACTT TAGATGCCGG AGAAATAATG
ATAAAAAAAG GAGTAGTTAA TGCAGCATCT AATCCTTTCA CAATAAAGAT AAAAGGAAGA
GGAGGACATG GAGCTTATCC TCACATGGCT GTAGACCCTA TAGTTATGGC TTCTCAAGTT
GTTTTAGGAT TACAAACAAT AGTAAGTAGA GAAATAAAGC CTGTAAATCC AGCAGTTGTT
ACAGTAGGAA GTATAAATGG AGGAACTGCT CAGAATATAA TACCAGATGA GGTTATATTA
AAAGGTGTTA TAAGAACAAT GACTCTAGAA GATAGAGCTT ATGCTAAGGA AAGACTAAGA
GAAATAGCTA CATCTATTTG TACAGCCATG AGAGGTGAAT GTGAAATAGA TATAGAAGAA
AGCTATCCAT GTCTTTATAA TAATAGCTCC GTTGTAGATT TAGTAACTGA AGCTGCAAAA
GAAATTATTG GGTCTCAAAA TGTTAAGGAA CAAGAAGCAC CAAAGCTTGG AGTTGAAAGC
TTTGCATATT TTGCCCTAGA AAGAGATTCA GCTTTTTATT TCTTAGGAGC TAGAAATGAG
GAAAGAAATA TTATTTATTC AGCTCATAAT AGTAGATTCG ATATAGACGA GAATTTATTA
CCAATTGGAG TTTCAATTCA ATGTAAAGCA GCATTAAATT ATTTGACAAG GGAGTAA
 
Protein sequence
MNINLMNEAQ EIKDLLVALR RDFHENPELG FEEWRTSGKI KEFLTNEGIE YIETAKTGVC 
GIIKGTLKDD SKKDRCIALR ADIDGLPMDD KKTCSYSSKV KGRMHACGHD AHTTILLGAA
KLLSRHRDKF SGTVKLLFEP AEETTGGAPI MIEEGVLENP RVEKIIGLHV EETLDAGEIM
IKKGVVNAAS NPFTIKIKGR GGHGAYPHMA VDPIVMASQV VLGLQTIVSR EIKPVNPAVV
TVGSINGGTA QNIIPDEVIL KGVIRTMTLE DRAYAKERLR EIATSICTAM RGECEIDIEE
SYPCLYNNSS VVDLVTEAAK EIIGSQNVKE QEAPKLGVES FAYFALERDS AFYFLGARNE
ERNIIYSAHN SRFDIDENLL PIGVSIQCKA ALNYLTRE