Gene CPR_1943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1943 
Symbol 
ID4206211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2146315 
End bp2147586 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content31% 
IMG OID642566493 
ProductD-alanyl-D-alanine carboxypeptidase family protein 
Protein accessionYP_699253 
Protein GI110803688 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1686] D-alanyl-D-alanine carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATTAAAT TTAAGAAAAA GCTTTTAAGT TCAATATTGA TTGCTTTATC AGTATCACTT 
TTTGCGCCAA TAAAAGCTAC TGTTCAAGCT GCTGAAATTT CCCAGCCTAA TATAGTTGGA
AAGTATGCTG TCACTTTAGA TTATGAAACT GGTGAAATTA TTTATGCAAA GGGAATAGAT
GAAAAAGCAT ACCCTGCTAG TACAACAAAG GTAATGACAA GTCTTTTATT TGCTGAACAT
GCTTCAAAGA ATGATTCTTT TCCATATACA GCAGATGCAA AAGTTCAACA ACCTTATACA
TTAAACGATA GCTTTGGACC AATACCTGTT GGCGAAGGAA TGAATGCTAA TGATTTAATG
AAAGCTTTAC TTATGTTTTC AGCCAATGAT GCTGCCGCTG TAATTGCAGA TGGCGTGGCT
GGAAGTACTG AAAAATTTAG TGTAATGATG AATGACGAAG TAAAGAAATT AGGACTTAAA
AATACTCACT TTGTTACTCC AAACGGCTTA CATAATGATG ATCACTATTC AACAGCTTAC
GATTTAGCTG TTATTTTACA AAATGCCTAT AAAAATCCTT GGGTAAGAGA AACTATGGCA
CTTAAGGATA GCGACATAAC TGTAAATGGA AAAAAAGTAC TTTTAGAAAA TAGAAATAAA
GAGCTTGGCA TCAACGGAAA TATTGGTGGA AAAACTGGGT TTACAACTCC TGCTGGGAGA
TGTTTAGTGT CAGTATACGA AAGAAATGGT AGAAAAATAA TAGGTACTGT TTTAAATTCT
CAATATGATG CTAAGGATGA AATTGTATTT AATGATATGA ATAAAATTAT TGATTACAGT
TACTCTGTAG ATAAAGTTCC ATACATAAAG GCTGGTACAA CAATAGATAC TATTCCAGTT
GAATATAAAC TTTTTAGATG GTTTGGACCA ACTAAAAAAA TAGACGTTCC TTTTGTTGCA
ACTGAAAACA TAGATTATTA TAAAAATTAT GTAAATGAAA AAGAAACATC AAAATCAATA
AACTTAAATG ATATGAATGC TTGGCAATTA GCTTCTAATC CTGAATCAGC TGCTGTTACT
GTTACTCAAA GAGCTTACGT TAAGGATTAT CCAGTAAAAG CTGATATAGG TACTTTTACT
CTTATAAAAG CTAATTTCTT AAGTTATTTA GGAATAATTG TTCTAGTAGC TATTGCGATT
GTATTAATAT TACTTATTAT AAGAGCAATA AATTTAAGAA AACGTAAAAA ACGTAGAAGA
AATATATTTT AA
 
Protein sequence
MIKFKKKLLS SILIALSVSL FAPIKATVQA AEISQPNIVG KYAVTLDYET GEIIYAKGID 
EKAYPASTTK VMTSLLFAEH ASKNDSFPYT ADAKVQQPYT LNDSFGPIPV GEGMNANDLM
KALLMFSAND AAAVIADGVA GSTEKFSVMM NDEVKKLGLK NTHFVTPNGL HNDDHYSTAY
DLAVILQNAY KNPWVRETMA LKDSDITVNG KKVLLENRNK ELGINGNIGG KTGFTTPAGR
CLVSVYERNG RKIIGTVLNS QYDAKDEIVF NDMNKIIDYS YSVDKVPYIK AGTTIDTIPV
EYKLFRWFGP TKKIDVPFVA TENIDYYKNY VNEKETSKSI NLNDMNAWQL ASNPESAAVT
VTQRAYVKDY PVKADIGTFT LIKANFLSYL GIIVLVAIAI VLILLIIRAI NLRKRKKRRR
NIF