Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1726 |
Symbol | engA |
ID | 4204226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 1922415 |
End bp | 1923731 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 642566276 |
Product | GTP-binding protein EngA |
Protein accession | YP_699041 |
Protein GI | 110803315 |
COG category | [R] General function prediction only |
COG ID | [COG1160] Predicted GTPases |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR03594] ribosome-associated GTPase EngA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.138722 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAC CAATAGTTGC TATGGTTGGA AGACCGAACG TAGGTAAGTC GACTCTTTTC AATAAATTAG CAGGAAAAAG AATTTCAATA GTACAAGATA CACCAGGGGT TACTAGAGAC AGAGTATATG CAGAATCAGA ATGGTTAAAC AGAAAATTCA CAATGATAGA TACAGGTGGA ATAGAGCCTG AAAGTAGTGA TATAATTGTT AAACAAATGA GAAGACAAGC GCAAATTGCT ATAGAAATGG CTGATGTAAT AGTATTCGTT GTTGATGGTA AGGAAGGACT TACTGCTGCT GACCAAGAAG TTGCACAAAT GCTTAGAAAA AGTAAAAAGC CTGTTGTTTT AGTAGTTAAT AAAATAGATA GATTAGCTTT AGAAGAAAAT AGCTATGAGT TCTATAATTT GGGAATTGGA GATCCTATAA CTATATCAGC ATCTCAAGGA TTAGGACTTG GAGATATGCT AGATGAGGTT GTTAAATATT TTAATGATCC TTCAGAAGAT GAAGAGGATG ATGAATATAT TAGAATAGCT ATGATAGGTA AACCAAATGT AGGTAAATCA TCACTTATAA ATAGATTATT AGGTGAAGAG AGAGTTATAG TAAGTAATGT TCCAGGAACA ACAAGAGATT CTATAGATAG TTACTTAGAA ACAGAAGATG GAAAGTTCAT CTTAGTTGAT ACTGCTGGAT TAAGAAGAAA AAGTAAAGTA AAAGAAGAAA TAGAAAGATA TAGTGTAATC AGAACTTATG CTGCCATAGA GAAAGCTGAT GTAGCTATAC TTGTAATAGA TGCTGAGCAA GGAATAACTG AGCAAGATGA AAAAATAATA GGATATGCTC ATGAAATGAA TAAAGCAATT ATGGTTGTTG TAAATAAATG GGATCTTATT GAAAAAGATG ATAAAACATT AAGTAATTAT CAAAAAGACT TACAACAAAA ACTTAAGTTT ATGCCATATG CTAAATACTT ATTCATATCA GCTTTAACAG GACAAAGAGT ACATAAAATA TTATCAACAG CTAAATATTG TTATGATAAT TACTCTAAGA GAGTTTCAAC TGGATTATTA AATGATGTTA TAAGTAAGGC TGTTTTAATG AAAGAGCCAC CAGTTGTAGC CTTAAAGAGA TTAAAAATAT ACTATGCTAC TCAGGTTGCT ACAAAGCCAC CTAAGTTTGT GTTCTTTGTA AATGACCCTA ATTTATTACA TTTCTCATAT GGTAGATATT TAGAAAACCA ATTAAGAGAA AGTTTTGATT TTGATGGAAC TGGTATAGAA ATAGAATATA GAGCTAGAAA GGAGTAA
|
Protein sequence | MSKPIVAMVG RPNVGKSTLF NKLAGKRISI VQDTPGVTRD RVYAESEWLN RKFTMIDTGG IEPESSDIIV KQMRRQAQIA IEMADVIVFV VDGKEGLTAA DQEVAQMLRK SKKPVVLVVN KIDRLALEEN SYEFYNLGIG DPITISASQG LGLGDMLDEV VKYFNDPSED EEDDEYIRIA MIGKPNVGKS SLINRLLGEE RVIVSNVPGT TRDSIDSYLE TEDGKFILVD TAGLRRKSKV KEEIERYSVI RTYAAIEKAD VAILVIDAEQ GITEQDEKII GYAHEMNKAI MVVVNKWDLI EKDDKTLSNY QKDLQQKLKF MPYAKYLFIS ALTGQRVHKI LSTAKYCYDN YSKRVSTGLL NDVISKAVLM KEPPVVALKR LKIYYATQVA TKPPKFVFFV NDPNLLHFSY GRYLENQLRE SFDFDGTGIE IEYRARKE
|
| |