Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_2198 |
Symbol | |
ID | 4206089 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 2427188 |
End bp | 2428279 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 642566748 |
Product | ethanolamine utilization protein euth |
Protein accession | YP_699498 |
Protein GI | 258676973 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3192] Ethanolamine utilization protein |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.846609 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGAGAAAT TAGTGCTTGG TATAATAGGT ATTTTCTTTA TAATTAGTGG ATTTGATTAC ATAAATAATA ATAAATTGGG ATTAGGAGAT AAATTTAAAG AGGGAATGAT ATCTATGGGA TCAATAGCTA TATCCATGAT AGGGATATAT TCTCTTTCAC CTTTAATAGG AGAAGGAATA GGATTCTTAT TAACTCCAAT AAGCAATTTT CTGGGAATTG ATTCATCCAT ATTTCCATCA ATGTTTTTAG CTGTAGATAT GGGAGCCTTA GGAATTGCAG AAAGTTTATC ATCAAATATT CACATGTATT GGATTTCTGG AGTAATAATA GCTTCGACTT TAGGGGCGAC CATAAGCTTT TCTATTCCCT TAGCTTTAGG AATCATAGAG GAGAAGTATC TTGAAGACTT AACAACAGGT TTATTATATG GAATAATGAC TTTACCTATA GCGCCAATAG TTGCAGGTTT ATTTTTAGGA GTTGATATTA AATTATTACT ATTTAATATT TTTCCATTAA TAATATTTGC TGTATTATTA GCAGTTTTTA TGAATAGGTT TAAAGATACT ACAGTAAAAT TCTTTATTAA ATTAGGTAAG CTTATACAAC TTGTTAGTAT ATTAGGGCTT TTAGTTTTAG GATTTTTATC TATTATAGGA GTAAAGCCTA TAGGAAGTAT TTTACCTATA GATGAGGCTT TAAGTGTGGT TGGTAAAATA TCTATATTCT TAGGAGGAGC ATATCCTTTA ATTAATTTTA TAACAGAAAA ATTTTCAAAA ATCTTAAGTA GACTAGGAGA AAAGATAAAT ATAGATGAGT TTTCTATTGT AGTTTTTTTA GGAACTCTTG CCTCAAATAT AATATTATTC CAAAGCTTTG ATAAGATGAG CTCTAAGGGA AGAATGGCTT TAACTGCTTT TAGTGTAAGT GGAGCCTTTG TAATTGGAGG ACAGCTAGGA TTTGTATCTC TTAAGACACC TGAGATTATA AATATTTATA TAGCATCAAA ATTAATAGCT GGTATAACTG CCATGGCTGT AACATTAATA TTATATAGAA AAACAGAGGA AAACTTGAAT AATGAAAGTT AA
|
Protein sequence | MEKLVLGIIG IFFIISGFDY INNNKLGLGD KFKEGMISMG SIAISMIGIY SLSPLIGEGI GFLLTPISNF LGIDSSIFPS MFLAVDMGAL GIAESLSSNI HMYWISGVII ASTLGATISF SIPLALGIIE EKYLEDLTTG LLYGIMTLPI APIVAGLFLG VDIKLLLFNI FPLIIFAVLL AVFMNRFKDT TVKFFIKLGK LIQLVSILGL LVLGFLSIIG VKPIGSILPI DEALSVVGKI SIFLGGAYPL INFITEKFSK ILSRLGEKIN IDEFSIVVFL GTLASNIILF QSFDKMSSKG RMALTAFSVS GAFVIGGQLG FVSLKTPEII NIYIASKLIA GITAMAVTLI LYRKTEENLN NES
|
| |