Gene Cphy_1416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_1416 
Symbol 
ID5742366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp1753956 
End bp1755491 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content44% 
IMG OID641292519 
Productaldehyde dehydrogenase family protein 
Protein accessionYP_001558530 
Protein GI160879562 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTTG TAGATAAAGA CCTGCTCTCC ATACAGGAGG CAAGGATTCT AATGGAAAGT 
GCACGAGATG CTAGAAACAC CATGCTCACA TTTCCTCAGG AAAGACTGGA TCTCATCGTA
GATGTTTTAG CCGCTGCAGC GAAGGAAATT GCTGAGGAAC TGGCAGTTAT GTCCGCAGAA
GAAACCGGAT ATGGACGATT TCAAGATAAG TATGTGAAGA ATAGATTTGT CTGCGATTAC
CTGCCAAAAC GCTTAAAAGC CATGAAATGT GTAGGCTTTT TAAATGAAAA TACTCAGTTA
AAGACCATGG ATGTAGGCGT TCCTGTAGGT GTGCTTGTTG CACTTGCTCC GCCGGTAAGC
CCGGTTTCCA CTGCGCTATA TAACGTACTG GTGGCTGTGA AGTCTGGTAA CCCCATTGTC
ATTGCTCCTC ATGAGAGAGC AAGAAAGGTA ACCGGTAAAC TTCTTGATCG TCTGTTAGAA
GCGGGAAAAT GCTACGGGCT TCCGGAAGGA GCAATCGGCT ATTTAAAGAC AGTAACAAGA
CCAGGAGCTC TGGAGCTAAT CCATCACCCA GCAGCAGCCA TGATTATAAA CACCGGAGTT
CCGGAGCTTA GTAGCGAAGC ATCTAAAAGC GGAAAGCCTT ATATTTACGG CGGAACGGGA
AATGGACCAG TCTTTATTGA ACGTACCGCT GATGTAAGAA AAGCGGTAGA GGATATCATT
GCAAGCCGCA CCTTTGATTA CGGAATCGTG TCTGCGGCAG AACAATATAT GGTAGTAGAC
AGTCTTATTG CAGCTGAAGT AAAAGCTGAG ATGTTAAGAA ACGGTGCCTA CTTCATGAAC
GAGGAAGAGG AGAAAAAGCT AATAGACCTC CTAAACCTTA CGAGTGGAAA GGCAGATACA
GAAATTATGG GAAGACCAGC CGAAGAACTT GCCAAACGAG CAGGATTTAT GGTACCTAAT
ACCACGACTG TGCTGGTTTC CGAACAGAAA TATATTTCCG ACAGGAACCC ATTTGCAAAA
GAGCTTCTTT GTCCTGTATT GGCTTACTAC ATCGAAAATG ACTGGATGCA TGCTTGTGAG
AAGTGCATGA GTCTTTTAGT AAACGAAAGC CATGGACATA CCCTGGTGAT TCATTCCAGG
GATGAAGAAG TAATAGGCCA GTTCGCCTTA AAGAAACCAG TAGGCAGAGT ACTTGTAAAT
ACCCCCGCTA CCCTGGGTAG TATGGGTGCA ACCACAAACT TGTTTCCGGC TATGACCCTA
GGAAGCATTA CAGCAGGCGC CGGAATCACA GCGGACAATG TTTCTCCTAT GAATTTCATA
TACATTCGTA AAGTAGGATA TGGAGTTCGG GGAGTACAAG AATTTCTTGG TTCGGTTGAG
AAAACCTCAA GCGGATACGC GAAAGCTCCT GAAACAATCA GGAACAATGC CCTTGAAACA
AACAAGGTCA ATGCCTTTGA AACAAGCAAA GGCATGGAAG ATGCTAGAGA TCTTTTGAAA
CAGATTTTAC AAGCCTTGTC CAAAGAACTG GATTAA
 
Protein sequence
MSFVDKDLLS IQEARILMES ARDARNTMLT FPQERLDLIV DVLAAAAKEI AEELAVMSAE 
ETGYGRFQDK YVKNRFVCDY LPKRLKAMKC VGFLNENTQL KTMDVGVPVG VLVALAPPVS
PVSTALYNVL VAVKSGNPIV IAPHERARKV TGKLLDRLLE AGKCYGLPEG AIGYLKTVTR
PGALELIHHP AAAMIINTGV PELSSEASKS GKPYIYGGTG NGPVFIERTA DVRKAVEDII
ASRTFDYGIV SAAEQYMVVD SLIAAEVKAE MLRNGAYFMN EEEEKKLIDL LNLTSGKADT
EIMGRPAEEL AKRAGFMVPN TTTVLVSEQK YISDRNPFAK ELLCPVLAYY IENDWMHACE
KCMSLLVNES HGHTLVIHSR DEEVIGQFAL KKPVGRVLVN TPATLGSMGA TTNLFPAMTL
GSITAGAGIT ADNVSPMNFI YIRKVGYGVR GVQEFLGSVE KTSSGYAKAP ETIRNNALET
NKVNAFETSK GMEDARDLLK QILQALSKEL D