Gene CPF_0895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0895 
Symbol 
ID4203908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1060624 
End bp1062102 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content35% 
IMG OID638081777 
Productethanolamine utilization protein 
Protein accessionYP_695344 
Protein GI110799892 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02518] acetaldehyde dehydrogenase (acetylating) 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGTTTT TAGATAAAGA CTTAGTTTCC ATACAGGAAA CTAGGGGACT TATAAGAAAG 
GCTAAAGAAG CTCAAAGTAA GTTAGCCCGT ATGAGTCAAC AAGACATAGA TAGAATAGTT
AAAGCTATAT CTGATGCTGC TTATGAAAAT TCTGAAAAAC TTGCTAAGAT GGCAAATGAG
GAAACAGGAT TTGGTAAATG GGAAGATAAA GTATTAAAAA ATGTTTTTGC TGCAAGAACA
GTTTATGAAT CAATAAAGGA TACAAAAACT GTTGGAATAG TAGAGGAAAA TGTTGAGAAA
AGAGCATTTA AAATTATGGT GCCAGTAGGA GTAGTTGCAG GACTTATACC ATCAACAAAT
CCAACTTCAA CAGCAATCTA CAAAGCTATG ATTTCAATAA AAGCTAGAAA TGCAATTGTT
TTATCACCAC ATCCAAGTGC AAAAAAATGT ATTATTGAAA CAGCTAAGAT AATAGCTAAG
GCTGCTGAGA GAGCAGGTTG TCCAGAAGGA GCTATAGGAT GCATTACAGT ACCTTCAATA
GAAGGAACTA ATGAACTTAT GAAGAATAAA GATACTTCAT TAATCCTAGC AACAGGTGGA
GAAGCAATGG TTAGAGCTGC TTATTCATCA GGAACACCAG CTATAGGTGT TGGACCAGGA
AATGGACCAG CATTCATAGA TAAGAGTGCG GATGTTAAGT TAGCAGTTAA GAGAATATTA
GATTCTAAAA CTTTTGATAA TGGAACAATA TGTGCTTCAG AACAATCAAT AGTAGTTGAA
AAAGCTATGG AAGATGAAGT TGTTAGAGAA TTAAAAGAAC AAGGAGCTTA CTTCTTAACT
GAAGAACAAG CAAATCAACT ATCTAAATTT GTAATGAGAG CAAATGGAAC TATGAATCCA
CAAATAGTTG GAAAAACTCC TCAAGACATA GCTAAATTAG CTGGTTTAGA AGGAATTCCA
TCTTGGGCAA GAGTTTTAAT TCCAAGAGAG TGCCATGTTG GACATAAGTA TCCATTCTCA
AGAGAAAAGT TAACAACTAT TTTAACTTTC TTTGTAGAAG AGAATGTAGA TGCTGTATTA
AATAGATGTA GAGAAATATT ACTAAATGAA GGAGCTGGAC ATACATTCTG TATGCATGCA
AATAATGAGG AGCTTGTTAA GAGATTCGCA TTAGAAATGC CGGTTTCAAG AATGGTAGTA
AATTCACCAG GAGCTCTTGG TGGAATAGGA GCTACAATCA ACTTAGTACC TGCCTTAACA
TTAGGATGTG GTGCAGTTGG AGGAAGTTCT ACATCACATA ACATAGGACC ATTAGATTTA
ATGAATACGA GAAATGTAGC TTATGGAGTT AGAGAACTTG AAGATATTAG AGAGTTAGCT
GTAGGAGCTA CAAAGGAAGT TGCTGCAAGT TTAGATTTAG GTGACAAGGA AGAATTAATA
AACTTATTAG TTAAGAAGAT TATTGAGGAA CTTAAGTAA
 
Protein sequence
MEFLDKDLVS IQETRGLIRK AKEAQSKLAR MSQQDIDRIV KAISDAAYEN SEKLAKMANE 
ETGFGKWEDK VLKNVFAART VYESIKDTKT VGIVEENVEK RAFKIMVPVG VVAGLIPSTN
PTSTAIYKAM ISIKARNAIV LSPHPSAKKC IIETAKIIAK AAERAGCPEG AIGCITVPSI
EGTNELMKNK DTSLILATGG EAMVRAAYSS GTPAIGVGPG NGPAFIDKSA DVKLAVKRIL
DSKTFDNGTI CASEQSIVVE KAMEDEVVRE LKEQGAYFLT EEQANQLSKF VMRANGTMNP
QIVGKTPQDI AKLAGLEGIP SWARVLIPRE CHVGHKYPFS REKLTTILTF FVEENVDAVL
NRCREILLNE GAGHTFCMHA NNEELVKRFA LEMPVSRMVV NSPGALGGIG ATINLVPALT
LGCGAVGGSS TSHNIGPLDL MNTRNVAYGV RELEDIRELA VGATKEVAAS LDLGDKEELI
NLLVKKIIEE LK