Gene CPR_B0004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_B0004 
Symbol 
ID4206641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008264 
Strand
Start bp4586 
End bp5614 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content28% 
IMG OID642567236 
ProductN-acetylmuramoyl-L-alanine amidase domain-containing protein 
Protein accessionYP_699923 
Protein GI110804011 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0860] N-acetylmuramoyl-L-alanine amidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones207 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAG CTATACGTGG TGGACATAAC TTTTTAGCAA AAGGTGCTTG TGGGTTAATA 
GATGAAACTA TAGAAGATAG AAAAGTTTAT AAAGCTGTTA TTAAAAACTT AATTGAAAAT
AATTTTGAAG TTTTAGATGT TACTCCAGAT GACTGTGATG TAAATACAGA CTTAAAATTT
GGAGTAGAAA AAGCTAATAA TTTTAATGCA GATTTATTCG TTTCTATACA CTTTGATAAA
TGTTATGACA AATTTGATGG TCCTTTAGGT ACAGGAACTT GGGTTTGTGA AAAAGGGGGA
AAAGCAGAAA TATATGCTCA AAATATAGTA GATACAATCT CTGAAGGAAC TTCCTTAAAA
AATAGGGGAG TTAAAACTAA TGCTAAGCTT TATGAATTAA ATAAAACAAT CATGCCAGCT
GTAATAGTTG AAGTTTGTTT CTGCGAATCT AAAGTAGATG TAGATATTTA TAGAGAAAAA
GGGTCTGATT TAATTGGTTA TTTAATAGCT AAAGGAATTT GTAAATCTGT AAATAAAGAA
ATTAGTTCTG ATTTGCCTCA AGTTAATCTT GAGAACACTA CTAATTCACA AAATAATAAT
TTATTTAAAA CTAATGCAAC AGCAAAGGTT GCTTTAGATC CTAGGGATAA CCCTAGTAAT
AACTACAAAG ATTTAGGTGA AATCTATGCA AATGAAAGAA TAAAAATTTT AGCTGAAGTT
TGTGATTTAA AATTTTTCTT ACCAGCAACA TATTGGCAAG ATGCATTAAA TAAAGAATCT
TCTCCTATAT GGGTAAATTC AAAACAGACA GTACTTAATG TTGATACCAA TGCAACTGTA
ATCAATGTTT TAACCGAATT AGATGCTAGA TATACTCCTT CTCCTGATTC AAATAGAATG
GGATATGTAA AAAATCAAGA AAGACTTTTT GTTCATAAAA TTGAAAATAA TTATGCTTTA
GCTACTTACT TAGCTAGTGA AGGATATAAA ACAGCATGGT TTACAGCAGA GTATATAAAA
TTAGATTAA
 
Protein sequence
MKIAIRGGHN FLAKGACGLI DETIEDRKVY KAVIKNLIEN NFEVLDVTPD DCDVNTDLKF 
GVEKANNFNA DLFVSIHFDK CYDKFDGPLG TGTWVCEKGG KAEIYAQNIV DTISEGTSLK
NRGVKTNAKL YELNKTIMPA VIVEVCFCES KVDVDIYREK GSDLIGYLIA KGICKSVNKE
ISSDLPQVNL ENTTNSQNNN LFKTNATAKV ALDPRDNPSN NYKDLGEIYA NERIKILAEV
CDLKFFLPAT YWQDALNKES SPIWVNSKQT VLNVDTNATV INVLTELDAR YTPSPDSNRM
GYVKNQERLF VHKIENNYAL ATYLASEGYK TAWFTAEYIK LD