Gene Cphy_3041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3041 
Symbol 
ID5743367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp3718159 
End bp3719544 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content37% 
IMG OID641294142 
Productaldehyde dehydrogenase 
Protein accessionYP_001560137 
Protein GI160881169 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAGA TAGAAACAGG GTATCAAAAA CTTGTAACAA ACCAAAGAGA ATTCTTTCGT 
ACTGGCAAGA CCAAACAAGT GGATTTTCGA ATTCAAGCAC TAAAGAAACT TCAGTCCGAG
ATTAAAAATC GAGAAGCAGA AATAATGGAG GCTTTAAAGA AAGACCTAAA TAAATCAAGT
TTTGAATCTT ATATGACAGA AATCGGTATG GTACTTGATG AAATCCGTCA TTGTATCGCA
CATGTGAAAA AGTGGTCCAA ACCAAAGAGT GTTAAAACTC CACTTGCGCA ATTTCCTTCA
AAGAGTTTTA CCATATCGGA ACCATATGGT GTTGTACTAA TTATGTCTCC TTGGAATTAT
CCATTTCAAT TATGCATAGA ACCATTAATT GGTGCTATTA CAGCAGGAAA CTGTGCAGTA
TTAAAGCCGT CCGCATATGC AGCAGAAACG TCTAAAGTAA TCAATACCTT AATACGTGCT
TGCTTTCCAA AGGAGTACGT TACGGTAATT GAAGGCGGTA GAAAAGAGAA TCAGGGATTA
CTGGCTACGA GATTTGATTA TATCTTCTTT ACCGGTGGTG TCGAAGTCGG AAAGATTGTT
ATGGAAGCAG CAGCTCAATT CCTAACTCCA GTGTCATTAG AGCTTGGAGG TAAGAGCCCT
TGTATTATTG AGAAATCAGC AGATATCAAT CTTGCTGCAA AGCGTGTTGC TTTTGGAAAG
TATCTCAATG CTGGTCAGAC ATGTGTTGCA CCTGATTATG TTTTCGTTCA GAAAGAAGTG
GAAGAGGAAT TTTTTAAGAA ATTAGGGTTG TGGGTACACA AATTCTTTGG TGAAGAACCT
TTAAAGAATG AAAATCTTCC GAAAATTATT AATGAACATC ATTATCATAG ATTACTTTCC
CTTCTTGAGG GAGAAGATAT TGTCATCGGT GGAAAAGGAC AGGATAATAT AAGAAAGATT
GAACCTACGG TACTAAAAAG TGTATCAACG GATTCCAATA TAATGCAAGA AGAAATTTTT
GGACCGATTC TCCCTGTACT TAGCTATAAG ACAATAGAGG AAGTAATAGA GTATGTCACA
GCACACGAAA AGCCATTGGC ATGCTATTTA TTTACAACGA ATGTACAGAT AGAAAAGAAA
GTATTAAAGC ACGTTTCTTT TGGTGGTGGA TGTGTCAACG ATACCATTAT TCATCTTGCA
ACACCTTATA TGGGATTTGG TGGTGTTGGT GCTAGTGGTA TGGGAAGTTA TCATGGATTT
GAAAGTTTTC GCACGTTTAG TCATACTAAG AGCATTGTGA AAAAAGCAAA TTGGCTTGAT
CTTCCGATGA GATACCATCC ATATACAGAG AAGAATTTGA AAATGATTCG TAAATTCTTA
AAATAG
 
Protein sequence
MSEIETGYQK LVTNQREFFR TGKTKQVDFR IQALKKLQSE IKNREAEIME ALKKDLNKSS 
FESYMTEIGM VLDEIRHCIA HVKKWSKPKS VKTPLAQFPS KSFTISEPYG VVLIMSPWNY
PFQLCIEPLI GAITAGNCAV LKPSAYAAET SKVINTLIRA CFPKEYVTVI EGGRKENQGL
LATRFDYIFF TGGVEVGKIV MEAAAQFLTP VSLELGGKSP CIIEKSADIN LAAKRVAFGK
YLNAGQTCVA PDYVFVQKEV EEEFFKKLGL WVHKFFGEEP LKNENLPKII NEHHYHRLLS
LLEGEDIVIG GKGQDNIRKI EPTVLKSVST DSNIMQEEIF GPILPVLSYK TIEEVIEYVT
AHEKPLACYL FTTNVQIEKK VLKHVSFGGG CVNDTIIHLA TPYMGFGGVG ASGMGSYHGF
ESFRTFSHTK SIVKKANWLD LPMRYHPYTE KNLKMIRKFL K