Gene Cphy_1029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_1029 
Symbol 
ID5741865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp1301846 
End bp1303036 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content39% 
IMG OID641292136 
Productiron-containing alcohol dehydrogenase 
Protein accessionYP_001558148 
Protein GI160879180 
COG category[C] Energy production and conversion 
COG ID[COG1454] Alcohol dehydrogenase, class IV 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000715077 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACGTT TTACACTACC AAGAGATTTA TATCATGGAA AGGGTTCTCT TGCGGAACTA 
AAAAATTTAA CAGGTAAAAA AGCAATTATC GTTGTTGGAG GCGGCTCCAT GAAACGTTTT
GGATTTTTGG ATAGAGCCAT TGATTACATA AAAGAAGCTG GTATGGAAGT CTCTTTGTTT
GAAAATGTAG AGCCAGACCC TAGTGTAGAA ACTGTAATGA AGGGTGCTGC TGCGATGAGA
GAATTCGAGC CGGATTGGAT TATATCCATG GGTGGCGGTT CTCCAATTGA TGCAGCAAAA
GCAATGTGGG CATTCTATGA ATATCCAGAC ACAACATTCG AAGATTTGAT TGTTCCATTT
AACTTCCCAA CCCTACGTAC AAAAGCAAAA TTCTGTGCTA TCCCATCTAC CTCTGGAACA
GCAACTGAAG TGACTGCTTT TAGCGTAATT ACAGACTATC ACAAGGGTAT TAAATATCCT
CTGGCAGACT TTAATATTAC ACCAGATGTT GCAATCGTAG ATCCTGATTT AGCAGAGACA
ATGCCTGCAA AACTCACCGC ACATACTGGC ATGGATGCTA TGACACACGC TGTGGAAGCA
TATGTTTCCA CACTACATTG CGATTATACC GATCCTCTTG CAATGCATGC TATCCGTATG
GTTCATGAAT ATTTAAAGTC TTCTTATGAT GGCAATATGG ATGCACGTGA TAAGATGCAC
AATGCACAAT GTTTAGCTGG TATGGCATTC TCCAACGCAT TACTTGGTAT TGTTCACTCC
ATGGCTCATA AAACCGGCGC TGCCTACTCA GGAGGTCATA TTGTTCATGG TTGTGCAAAT
GCAATGTATC TACCAAAAGT TATTAAATTT AATTCTAAAA ATGAAGATGC AGCGAAACGT
TACGCTGAAA TCGCAACTGC ACTTTTCTTA AAAGGCAATA CGACTACAGA ACTTGTAGAT
GCTCTAATTG AAGAATTAAA TCAGATGAAC CGCTCCTTGA ATATTCCAAG CTGTATCAAG
GAATATGAAA ATGGTATCAT CGATGAAAAA GAATTCTTAG AAAAATTACC TGAAGTCGCT
GCAAATGCTA TCTCTGATGC TTGTACTGGA TCAAATCCAA GAATCCCAAC ACAAGAAGAG
ATGGAGAAGT TATTAAAAGC ATGCTTCTAT AACGAAGAGA TTACTTTCTA A
 
Protein sequence
MARFTLPRDL YHGKGSLAEL KNLTGKKAII VVGGGSMKRF GFLDRAIDYI KEAGMEVSLF 
ENVEPDPSVE TVMKGAAAMR EFEPDWIISM GGGSPIDAAK AMWAFYEYPD TTFEDLIVPF
NFPTLRTKAK FCAIPSTSGT ATEVTAFSVI TDYHKGIKYP LADFNITPDV AIVDPDLAET
MPAKLTAHTG MDAMTHAVEA YVSTLHCDYT DPLAMHAIRM VHEYLKSSYD GNMDARDKMH
NAQCLAGMAF SNALLGIVHS MAHKTGAAYS GGHIVHGCAN AMYLPKVIKF NSKNEDAAKR
YAEIATALFL KGNTTTELVD ALIEELNQMN RSLNIPSCIK EYENGIIDEK EFLEKLPEVA
ANAISDACTG SNPRIPTQEE MEKLLKACFY NEEITF