Gene CPR_1042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1042 
Symbol 
ID4204322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1187247 
End bp1189187 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content29% 
IMG OID642565599 
Producttranslation elongation factor G, putative 
Protein accessionYP_698365 
Protein GI110801473 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0480] Translation elongation factors (GTPases) 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00484] translation elongation factor EF-G 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000207187 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA CTATTGGTAT ATTAGCTCAT GTTGATGGAG GAAAAACAAC TTTTTCTGAG 
CAACTTTTAT ATCATACAAA TAGTATAAGA AATAGGGGAA GAGTTGATCA TAAGAATTCT
TATTTAGATA ATAATGAAAT AGAGAAAGAC AGAGGTATAA CTATATATTC TGAGGTAGGT
AAATTTTCTA TAGAAAATCA AGAATATTAT CTTATAGATA CTCCAGGGCA TATAGATTTT
TCACCAGAAA TGGAAAGAGC TATTAGTGTT TTAGATTATG CTATTTTGAT TATAAGTGCA
GTAGAAGGGG TTCAGGGACA TAGTGAAACA ATATGGGAAT TATTAAATAA GTATAAGATT
CCTACTTTTA TTTTCATAAA TAAGATTGAT AGAGAAGGAG CAGAAGTAAA TAAGGTTATA
AATGAAATGA AACACAAACT TAGTGAAGAT ATTATTTTCT TTTCAAGTGA ATTAGAAGAT
GAGACTATAG AAGAAGTAGT TGAAAGAGAT GAGGACTTAT TAAACTTATA TTTAGAAGGA
AATTTAAGTG AAGAAGAATT ATTAAATAAA ATACCAAGTA TGATAAAGGA ACTTAAAATT
TTTCCTTGTT TATGTGGTTC TGCTTTACTA GATGAGGGCG TAGAAGATTT TATAAGTTGG
TTTCATAACT TATCATTTAC TAACTATGAG GAATCGGAAG ATTCTTTTAA AGGAAGAGTT
TTTAAAGTAA GACATGATGA AAAGGGAAAT AGATTAACCT TTATAAAAGC TCTAAGTGGA
ATTTTAAGAA TTAAGGAAGA ATTGACATAT TTAAAAGAAG GAAAAGAGTT TTTAGAGAAA
GTAAATGAAA TTAGAATATA TAATGGAAGT AAATATGAAC TTGTAAATGA AGTTAAGGCA
GGAGATATAT TTGCAGTGGT AGGAGTTAAA GGACTAGAAT CTGGTGATGG AATTGGTATA
GAAAATATTG ATTCATATGA TATGGTTCCT ACTTTGAAGT CTAAGGTGGT TTATAGAGAA
GGGTTAAATC CAAAGGAAGT ACTTTCATGG TTTAAAATCT TAGAAAGTGA AGAGAGTACC
TTAAGTATAT CTTGGGATGA AAGATTAAAA GAAATTCACG TTAATATTAT GGGAAAAGTT
CAATTGGAAG TTCTTAAAGA AGTTATGAAA AATAGATTTA ATGAAGAAAT AGAATTTGGA
ACTCCAGAGA TATTATATAA AGAGACATTA AATGAAGAAG TAATAGGATA TGGCCATTTT
GAGCCTTTAG GACATTATAG TGAGGTTCAC TTAAAAATTG AGCCTTTAGA AAGAAATTCA
GGAATAGTAT TTGAAAATAA ATGCCATGCA GATGATCTTA CAGTTGGAAA TCAAAATTTA
ATAAGGACTC ATATATTTGA ATGTGAGCAT AAAGGAATAT TAACAGGTTC GCCTATTACG
GATCTTAAAA TAACCTTATT AACAGGAAAA GCTCACAACA AACACACAAG TGGTGGGGAT
TTTAGAGAAG CTACAAAGAG AGCTTTAAGA CAGGGATTAG AAAGTGGAGA AAATAAACTT
TTGGAGCCCT ATTATAAGTT TAAAATAGAT GTGGATCTTA ACTTAATAGG AAGAGTAATG
AATGATATAC AAAAGATGCA TGGGGAGTTT AAGGATCCAA TTATAGATGG AGAGAGAGCA
ACTATAGAAG GAAATGGACC TGTTTCTACA TTTATAAATT ATGGTATGGA GTTTCAGTCA
TTTACTAAGG GAAAGGGAGG ACTTTCTCTT AAGTTTCATG GATATGATTT ATGTCATAAT
GAAGAAGAGA TTATAAAAAA GAGGGCATAT GATAGAAATG CAGACATTGA TTATACTTCT
ACTTCTATAT TTTGTTCAAA GGGTCAAGCT TATTTAGTTA AAGGAGAAGA GGCAAAAGAA
CATATGCATT GTTTAGTTTA G
 
Protein sequence
MKKTIGILAH VDGGKTTFSE QLLYHTNSIR NRGRVDHKNS YLDNNEIEKD RGITIYSEVG 
KFSIENQEYY LIDTPGHIDF SPEMERAISV LDYAILIISA VEGVQGHSET IWELLNKYKI
PTFIFINKID REGAEVNKVI NEMKHKLSED IIFFSSELED ETIEEVVERD EDLLNLYLEG
NLSEEELLNK IPSMIKELKI FPCLCGSALL DEGVEDFISW FHNLSFTNYE ESEDSFKGRV
FKVRHDEKGN RLTFIKALSG ILRIKEELTY LKEGKEFLEK VNEIRIYNGS KYELVNEVKA
GDIFAVVGVK GLESGDGIGI ENIDSYDMVP TLKSKVVYRE GLNPKEVLSW FKILESEEST
LSISWDERLK EIHVNIMGKV QLEVLKEVMK NRFNEEIEFG TPEILYKETL NEEVIGYGHF
EPLGHYSEVH LKIEPLERNS GIVFENKCHA DDLTVGNQNL IRTHIFECEH KGILTGSPIT
DLKITLLTGK AHNKHTSGGD FREATKRALR QGLESGENKL LEPYYKFKID VDLNLIGRVM
NDIQKMHGEF KDPIIDGERA TIEGNGPVST FINYGMEFQS FTKGKGGLSL KFHGYDLCHN
EEEIIKKRAY DRNADIDYTS TSIFCSKGQA YLVKGEEAKE HMHCLV