Gene CPR_0340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0340 
Symbol 
ID4204350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp409999 
End bp412350 
Gene Length2352 bp 
Protein Length783 aa 
Translation table11 
GC content33% 
IMG OID642564897 
ProductDNA polymerase III, alpha subunit, interruption-C 
Protein accessionYP_697669 
Protein GI110801675 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTCTTAA ATCCGGAGAG AGTAAGTATG CCAGATATAG ATTCCGATTT CTGTTATGAG 
GGAAGACAAA GAGTTATAGA CTATGTTGTT GAGAAGTATG GTCACGACAA TGTATCACAG
ATTATAACTT TTGGAACAAT GGCAGCAAGA GCTTGTATTA GGGACGTAGG TAGAGCTATG
AATTACACTT ATGCAGAGGT TGATAGAATA GCTAAACAAA TACCTACAGT TCTTGGAGTA
ACAATAGATA AGGCTTTAGA TTTAAATCCA GAGCTTAAAA CAATATATGA TACTGAAGAG
AGAGTAAAAG AACTTATTGA TGTAAGTAGA AGGCTAGAAG GGCTTCCAAG ACATTCATCA
ACTCACGCTG CGGGAGTAGT TATAGCTTCT CAGCCTTTAG TTGAGTATGT TCCACTTCAA
AAAAATGATG AATCCATTGT AACTCAATTT GATATGACAA CACTTGAAGA GCTTGGTTTA
TTAAAAATGG ACTTCTTAGG TCTTAGAACC TTAACTGTTA TGCGTGATGC TGTGGATATG
ATCAAGTATA ACAGGGGAGT AGACATAGAT TTAGATAATT TAGACTTTGA TGATAAAGAA
GTTTATAAAA TGATTGGTGA AGGAAATACA GTAGGGGTAT TCCAGTTAGA GTCAGCAGGA
ATGACCTCTT TCATGAAAGA GCTTAAGCCA GACTGTTTAG AGGATATCAT AGCGGGAATA
TCATTATATA GACCTGGTCC TATGGCAGAA ATACCAAGAT ATATAAGTGG TAAAAGGGAT
CCAAAGTCAG TAGAATATAT AGTTCCTGAG CTTGAAAACA TATTAAATGT AACCTATGGG
GTTATGGTTT ATCAAGAGCA GGTTATGGAG ATTGTTAGAA AATTAGCAGG ATATTCCATG
GGTAGAAGTG ACCTTGTTAG AAGAGCTATG TCTAAGAAAA AGCATAAGGT CATGGAAGAA
GAGAGAAAGA ACTTCATTTA TGGAATTGAA GATGAAAATG GAAATATAGA GGTTCCAGGT
TGCTTAAGAA ATGGTATTTC AGCTGAGGCA GCTAATAAAA TATTTGACTC TATGATGGAT
TTTGCATCCT ATGCATTTAA CAAAAGTCAT GCTGCTGCCT ATGCCGTAAT AGGATTCCAG
ACAGCTTATC TTATGAGATA TTATCCAGTA GAGTTTTTAG CTGCTATGCT TAACTCTGTT
ATGGGTAACA GTGATAAGGT ATCTGAATAT ATAAGAAGTG CAGAAAAGCT TGGAATTCAG
GTATTACCAC CAGATATAAA TGAGAGTTAT ACTAAGTTTA CAGTTAAGGG TGATACTATA
AGATTTGGTA TGGGAGCTAT AAAAAATGTA GGGGTAAATG TTGTAGAAAA CATAGCTAAA
TCAAGAGATG AAAAGGGTAA ATTCACTTCA CTTATGGATT TTTGCAATAA AATAGACTTA
AGTATTGTAA ATAAAAGATC AGTAGAAAGT TTAATAAAAG CTGGTACATT TGATAGCTTA
AATGTTTATA GATCACAATT ATTATCTGTT TTTGAAAAAA TAATGGATGG AGTATATTCT
CAGAGAAAGA AAAATATAGA TGGGCAAATG TCATTGTTTG GAGCGCTTCA AGAGGAAAGT
GAATCAAACT TAGAAATAAG ATACCCAAAC ATAAAGGAAT TCAACAAGAA ATATATGCTT
GCTATGGAAA AGGAAATGAC AGGACTTTAT ATGAGTGGTC ATCCACTAGA TGATTATGAA
AAGCCTTTAA AGGAACAAAC ATCAATAACT ATAGAAAAAA TAATAGAAGC TCAAAAGAAT
TTAAATGAGC AAAAAAATGT TGAATTAGAA GATTTAGTCT TAGACAGTTC TTTAACTGAT
GGAACTAGAG TGATTATAGG GGGAATTCTT ACTAATGTTT CTAGAAAGGT AACTAGAAAT
AATACTCTTA TGGCATTTGC AAAGGTAGAG GACTTAACTG GCTATTTAGA ATGTGTTATA
TTCCCTAAAA CCTTAGAAAA ATGTAATGCA CTAGTAAATG AGGACTCCTT TGTTTTAATA
AGAGGAAGAG TTTCTCTAAA GGAAGATGAG GAGCCAAAAA TACTTTGTGA AGATATACAA
CCACTAGAAT TAATAAATTC TTCAAAGGTG TATATAAAGG TTGAAGATAG AGAAAAAGCT
AATATGATAG TTAAGCCTTT AAGAGTTTTA TTATCACAAT ATAAAGGTGA TTCTCCAGTT
TATATATTTG CAGCAAAGGA AAAAGCATCT TTTAGATTAA ACAGAGATAT GTGGGTTGAT
TTAGATACAG ATGTAATAGA TTTTTTAATT AGCAAATTTG GAGAAGGAAA TGTAAAAGTA
GTTGAAGGTT AA
 
Protein sequence
MFLNPERVSM PDIDSDFCYE GRQRVIDYVV EKYGHDNVSQ IITFGTMAAR ACIRDVGRAM 
NYTYAEVDRI AKQIPTVLGV TIDKALDLNP ELKTIYDTEE RVKELIDVSR RLEGLPRHSS
THAAGVVIAS QPLVEYVPLQ KNDESIVTQF DMTTLEELGL LKMDFLGLRT LTVMRDAVDM
IKYNRGVDID LDNLDFDDKE VYKMIGEGNT VGVFQLESAG MTSFMKELKP DCLEDIIAGI
SLYRPGPMAE IPRYISGKRD PKSVEYIVPE LENILNVTYG VMVYQEQVME IVRKLAGYSM
GRSDLVRRAM SKKKHKVMEE ERKNFIYGIE DENGNIEVPG CLRNGISAEA ANKIFDSMMD
FASYAFNKSH AAAYAVIGFQ TAYLMRYYPV EFLAAMLNSV MGNSDKVSEY IRSAEKLGIQ
VLPPDINESY TKFTVKGDTI RFGMGAIKNV GVNVVENIAK SRDEKGKFTS LMDFCNKIDL
SIVNKRSVES LIKAGTFDSL NVYRSQLLSV FEKIMDGVYS QRKKNIDGQM SLFGALQEES
ESNLEIRYPN IKEFNKKYML AMEKEMTGLY MSGHPLDDYE KPLKEQTSIT IEKIIEAQKN
LNEQKNVELE DLVLDSSLTD GTRVIIGGIL TNVSRKVTRN NTLMAFAKVE DLTGYLECVI
FPKTLEKCNA LVNEDSFVLI RGRVSLKEDE EPKILCEDIQ PLELINSSKV YIKVEDREKA
NMIVKPLRVL LSQYKGDSPV YIFAAKEKAS FRLNRDMWVD LDTDVIDFLI SKFGEGNVKV
VEG