Gene CPF_2231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2231 
Symbol 
ID4202850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2476905 
End bp2478176 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content31% 
IMG OID638083096 
ProductD-alanyl-D-alanine carboxypeptidase family protein 
Protein accessionYP_696655 
Protein GI110801081 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1686] D-alanyl-D-alanine carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATTAAAT TTAAGAAAAA GCTTTTAAGT TCAATATTGA TTGCTTTATC AGTATCACTT 
TTTACGCCAA CAAAAGCTAC TGTTCAGGCT GCCGAAATTT CCCAGCCTAA TATAGTTGGA
AAGTATGCTG TCACTTTAGA TTATGACACT GGTGAAATTA TTTATGCAAA GGGAATAGAT
GAAAAAGCAT ATCCTGCTAG TACAACAAAG GTAATGACAA GTCTTTTATT TGCTGAACAT
GCTTCAAAGA ATGATTCTTT TCCATATACA GCAGATGCAA AAGCTCAACA ACCTTATACA
TTAAACAATA GCTTTGGACC AATACCTGTT GGCGAAGGAA TGAATGCTAA TGATTTAATG
AAAGCTTTAC TTATGTTTTC AGCTAATGAT GCTGCCGCTG TAATTGCAGA CGGGTTGGCT
GGAAGTGCTG AAAAATTTAG TGTAATGATG AATGACGAAG TAAAAAAATT AGGATTAAAA
AATACTCACT TTGTTACTCC AAATGGCTTA CATAATGATG ATCACTATTC AACAGCTTAC
GATTTAGCTG TCATTTTACA AAATGCTTAT AAAAATCCTT GGGTAAGCGA AACTATGGCA
CTTAAGGATA GCGACATAAC TGTAAACGGA AAAAAAGTAC TTTTAGAAAA TAGAAATAAA
GAACTTGGTA TCGACGGAAA TATTGGTGGA AAGACTGGAT TTACAACTCC TGCTGGAAGA
TGTTTAGTAT CAGTATACGA AAGAAATGGT AGAAAAATAA TAGGTGCTGT TTTAAATTCT
CAATATGATG CTAAGGATGA AATTGTATTT AATGATATGA ATAAAATTAT TGATTACAGT
TACTCTGTAG ATAAAGTTCC ATACATAAAG GCTGGTACAA CAATAGATAC TATTCCAGTT
GAATATAAAC TTTTTAGATG GTTTGGACCA ACTAAGAAAA TAGATGTTCC TTTTGTTGCA
ACTGAAAACA TAGATTATTA TAAAAATTAT GTAAATGAAA AAGAAACATC AAAATCAATA
AACTTAAATG ATATGAACGC TTGGCAATTA GCTTCTAATC CTGAATCAGC TGCTGTTACT
GTTACTCAAA GAGCTTACGT TAAGGATTAT CCAGTAAAAG CTGATATAGG TACTTTTACT
CTTATAAAAG CTAATTTCTT AAGTTATTTA GGAATAATTG TTCTAGCAAT AGTTGTAATT
GTATTAATAT TACTTATTAT AAGAGCAATA AATTTAAGAA AACGTAGAAG ACGTAGAAGA
AATATATTTT AA
 
Protein sequence
MIKFKKKLLS SILIALSVSL FTPTKATVQA AEISQPNIVG KYAVTLDYDT GEIIYAKGID 
EKAYPASTTK VMTSLLFAEH ASKNDSFPYT ADAKAQQPYT LNNSFGPIPV GEGMNANDLM
KALLMFSAND AAAVIADGLA GSAEKFSVMM NDEVKKLGLK NTHFVTPNGL HNDDHYSTAY
DLAVILQNAY KNPWVSETMA LKDSDITVNG KKVLLENRNK ELGIDGNIGG KTGFTTPAGR
CLVSVYERNG RKIIGAVLNS QYDAKDEIVF NDMNKIIDYS YSVDKVPYIK AGTTIDTIPV
EYKLFRWFGP TKKIDVPFVA TENIDYYKNY VNEKETSKSI NLNDMNAWQL ASNPESAAVT
VTQRAYVKDY PVKADIGTFT LIKANFLSYL GIIVLAIVVI VLILLIIRAI NLRKRRRRRR
NIF