Gene CPF_1233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1233 
Symbol 
ID4201081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1401770 
End bp1403710 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content31% 
IMG OID638082114 
Productputative tetracycline resistance protein 
Protein accessionYP_695679 
Protein GI110800811 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0480] Translation elongation factors (GTPases) 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00484] translation elongation factor EF-G 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.139219 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGA CTATTGGTAT ATTAGCTCAT GTTGATGGAG GAAAAACCAC TTTTTCTGAG 
CAACTTTTAT ATCATACAAA GAGTATAAGA AATAGAGGAA GAGTTGATCA TAAAAATTCT
TATTTAGATA ATAATGAAAT AGAGAGGGAT AGGGGGATAA CTATATATTC TGAGGTGGGG
AAATTTTCTA TAGAAAATCA AGAATATTAT CTTATAGATA CTCCAGGGCA TATAGATTTT
TCACCAGAAA TGGAAAGAGC TATTAGTGTT TTAGATTATG CTATTTTGAT TATAAGTGCA
GTAGAAGGGG TTCAGGGACA TAGTGAAACA ATATGGGAAT TATTAAATAA GTATAAGGTT
CCTACTTTTA TTTTCATAAA TAAGATTGAT AGAGAAGGAG CAGAAGTAAA TAAAGCTATA
AATGAAATGA AACACAAACT TAGTGAAGAT ATTATTTTCT TTTCAAGTAA ACTAGAGGAG
GGGACTATAG AAGAAGTAGT TGAAAGCGAT GAGGACTTAC TAAACTTATA TTTAGAAGGA
AATTTAAGTG AAGAAGAATT ATTAAATAAA ATACCAAGCA TGATAAAGGA ACTTAAAATT
TTTCCTTGCT TATGTGGTTC TGCTTTACTA GATGAGGGCA TAGAAGATTT TATAAGGTGG
TTTCACAACT TATCATTTAC TAACTATGAG GAAAGAGAAG ATTCTTTTAG AGGAAGAGTT
TTTAAAGTAA GACATGATGA AAAGGGAAAT AGATTAACTT TTATAAAAGC TCTAAGTGGA
ACTTTAAGAA CTAAGGAAGA ATTGACATAT TTAAAAGAAG GAAAAGAGTC TTTAGAGAAA
GTAAATGAAG TTAGAATATA TAATGGAAGT AAATATGAAC TTGTAAATGA AGTCAGGGCA
GGAGATATAT TTGCGGTGGT AGGAGTTAAA GGACTAGAAT CAGGTGATGG GATTGCTATA
GAAAATATTG ATTCATATGA TATGATTCCT ACTTTGAAGT CTAAGGTGGT TTATAGAGAA
GGGTTAAATC CAAAGGAAGT ACTTTCATGG TTTAAAATTT TAGAAAGTGA AGAGAGTACT
TTAAGTGTAT CTTGGGATGA AAGATTAAAA GAAATTCACG TTAATATTAT GGGAAAAGTT
CAATTGGAAG TTCTTAAAGA AGTTATGAAA AATAGATTTA ATGAAGAAAT AGAATTTGGA
ACTCCAGAGA TATTATATAA GGAAACATTA AATGAAGAAG TGATAGGATA TGGCCATTTT
GAGCCTTTAG GACATTATAG TGAGGTTCAC TTAAAAATTG AGCCTGGGGA AAGAAATTCA
GGAGTAGTAT TTGAAAATAA GTGCCATGCA GATGATCTTA CACCAGGAAA TCAAAATTTA
ATAAGGACTC ATATATTTGA ATGTGAGCAT AAGGGAATAT TAACAGGTTC GCCTATTACG
GATCTTAAAA TAACCTTATT AACAGGAAGA GCTCACAATA AACATACAAG TGGTGGAGAT
TTTAGAGAAG CTACCAAGAG AGCTTTAAGA CAGGGATTAG AAAGTGGGGA AAATAAACTT
TTGGAGCCCT ATTATAAGTT TAAAATAGAT GTGGATCTTA ACCTAATAGG AAGAGTAATG
AATGATATAC AAAGGATGAA TGGAGAGTTT AAGGATCCTA TTATAGATGG AGAGAGAGCA
ACCATAGAAG GAAAGGGGCC TGTTTCTACA TTTACAAATT ATGGTATGGA GTTTCAGTCA
TTTACTAAGG GAAAGGGAGG ACTTTCTCTT AAGTTTCATG GGTATGATTT ATGTCATAAT
GAAGAAGAGA TTATTGAAAA GAGGGCATAT GATAGAAATG CAGACATTGA TTATACTTCT
ACTTCTATAT TTTGTTCAAA GGGTCAAGCT TATTTAGTTA AAGGAGAAGA GGCAAAAGAA
CATATGCATT GTTTAGTTTA G
 
Protein sequence
MKKTIGILAH VDGGKTTFSE QLLYHTKSIR NRGRVDHKNS YLDNNEIERD RGITIYSEVG 
KFSIENQEYY LIDTPGHIDF SPEMERAISV LDYAILIISA VEGVQGHSET IWELLNKYKV
PTFIFINKID REGAEVNKAI NEMKHKLSED IIFFSSKLEE GTIEEVVESD EDLLNLYLEG
NLSEEELLNK IPSMIKELKI FPCLCGSALL DEGIEDFIRW FHNLSFTNYE EREDSFRGRV
FKVRHDEKGN RLTFIKALSG TLRTKEELTY LKEGKESLEK VNEVRIYNGS KYELVNEVRA
GDIFAVVGVK GLESGDGIAI ENIDSYDMIP TLKSKVVYRE GLNPKEVLSW FKILESEEST
LSVSWDERLK EIHVNIMGKV QLEVLKEVMK NRFNEEIEFG TPEILYKETL NEEVIGYGHF
EPLGHYSEVH LKIEPGERNS GVVFENKCHA DDLTPGNQNL IRTHIFECEH KGILTGSPIT
DLKITLLTGR AHNKHTSGGD FREATKRALR QGLESGENKL LEPYYKFKID VDLNLIGRVM
NDIQRMNGEF KDPIIDGERA TIEGKGPVST FTNYGMEFQS FTKGKGGLSL KFHGYDLCHN
EEEIIEKRAY DRNADIDYTS TSIFCSKGQA YLVKGEEAKE HMHCLV