Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_0895 |
Symbol | |
ID | 4203908 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | + |
Start bp | 1060624 |
End bp | 1062102 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 638081777 |
Product | ethanolamine utilization protein |
Protein accession | YP_695344 |
Protein GI | 110799892 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR02518] acetaldehyde dehydrogenase (acetylating) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGTTTT TAGATAAAGA CTTAGTTTCC ATACAGGAAA CTAGGGGACT TATAAGAAAG GCTAAAGAAG CTCAAAGTAA GTTAGCCCGT ATGAGTCAAC AAGACATAGA TAGAATAGTT AAAGCTATAT CTGATGCTGC TTATGAAAAT TCTGAAAAAC TTGCTAAGAT GGCAAATGAG GAAACAGGAT TTGGTAAATG GGAAGATAAA GTATTAAAAA ATGTTTTTGC TGCAAGAACA GTTTATGAAT CAATAAAGGA TACAAAAACT GTTGGAATAG TAGAGGAAAA TGTTGAGAAA AGAGCATTTA AAATTATGGT GCCAGTAGGA GTAGTTGCAG GACTTATACC ATCAACAAAT CCAACTTCAA CAGCAATCTA CAAAGCTATG ATTTCAATAA AAGCTAGAAA TGCAATTGTT TTATCACCAC ATCCAAGTGC AAAAAAATGT ATTATTGAAA CAGCTAAGAT AATAGCTAAG GCTGCTGAGA GAGCAGGTTG TCCAGAAGGA GCTATAGGAT GCATTACAGT ACCTTCAATA GAAGGAACTA ATGAACTTAT GAAGAATAAA GATACTTCAT TAATCCTAGC AACAGGTGGA GAAGCAATGG TTAGAGCTGC TTATTCATCA GGAACACCAG CTATAGGTGT TGGACCAGGA AATGGACCAG CATTCATAGA TAAGAGTGCG GATGTTAAGT TAGCAGTTAA GAGAATATTA GATTCTAAAA CTTTTGATAA TGGAACAATA TGTGCTTCAG AACAATCAAT AGTAGTTGAA AAAGCTATGG AAGATGAAGT TGTTAGAGAA TTAAAAGAAC AAGGAGCTTA CTTCTTAACT GAAGAACAAG CAAATCAACT ATCTAAATTT GTAATGAGAG CAAATGGAAC TATGAATCCA CAAATAGTTG GAAAAACTCC TCAAGACATA GCTAAATTAG CTGGTTTAGA AGGAATTCCA TCTTGGGCAA GAGTTTTAAT TCCAAGAGAG TGCCATGTTG GACATAAGTA TCCATTCTCA AGAGAAAAGT TAACAACTAT TTTAACTTTC TTTGTAGAAG AGAATGTAGA TGCTGTATTA AATAGATGTA GAGAAATATT ACTAAATGAA GGAGCTGGAC ATACATTCTG TATGCATGCA AATAATGAGG AGCTTGTTAA GAGATTCGCA TTAGAAATGC CGGTTTCAAG AATGGTAGTA AATTCACCAG GAGCTCTTGG TGGAATAGGA GCTACAATCA ACTTAGTACC TGCCTTAACA TTAGGATGTG GTGCAGTTGG AGGAAGTTCT ACATCACATA ACATAGGACC ATTAGATTTA ATGAATACGA GAAATGTAGC TTATGGAGTT AGAGAACTTG AAGATATTAG AGAGTTAGCT GTAGGAGCTA CAAAGGAAGT TGCTGCAAGT TTAGATTTAG GTGACAAGGA AGAATTAATA AACTTATTAG TTAAGAAGAT TATTGAGGAA CTTAAGTAA
|
Protein sequence | MEFLDKDLVS IQETRGLIRK AKEAQSKLAR MSQQDIDRIV KAISDAAYEN SEKLAKMANE ETGFGKWEDK VLKNVFAART VYESIKDTKT VGIVEENVEK RAFKIMVPVG VVAGLIPSTN PTSTAIYKAM ISIKARNAIV LSPHPSAKKC IIETAKIIAK AAERAGCPEG AIGCITVPSI EGTNELMKNK DTSLILATGG EAMVRAAYSS GTPAIGVGPG NGPAFIDKSA DVKLAVKRIL DSKTFDNGTI CASEQSIVVE KAMEDEVVRE LKEQGAYFLT EEQANQLSKF VMRANGTMNP QIVGKTPQDI AKLAGLEGIP SWARVLIPRE CHVGHKYPFS REKLTTILTF FVEENVDAVL NRCREILLNE GAGHTFCMHA NNEELVKRFA LEMPVSRMVV NSPGALGGIG ATINLVPALT LGCGAVGGSS TSHNIGPLDL MNTRNVAYGV RELEDIRELA VGATKEVAAS LDLGDKEELI NLLVKKIIEE LK
|
| |