Gene Cphamn1_1084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1084 
Symbol 
ID6374758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1172611 
End bp1173639 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content54% 
IMG OID642683585 
Productzinc-binding alcohol dehydrogenase family protein 
Protein accessionYP_001959503 
Protein GI189500033 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID[TIGR02822] zinc-binding alcohol dehydrogenase family protein 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.31 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.438887 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCGC AGGTTATTGA AAGGATTACA GACTTGTCAG AGAGCTCTGA ACCGCTGAGG 
ATGGTTGAGA TGCCGGTTCC TGAGCCTGCT GCTGGTGAGG TGCTGCTGAA GGTGCTGACC
TGCGGGGTTT GTCATACGGA GCTGGATGAG ATAGAAGGGA GAACCCCGCC TGCTTTTTTT
CCGATCGTTC CGGGCCATCA GGTTGTGGGA GAGGTTGTCG CTCAAGGAGC AGGGGTAAGC
CAACCCGAAA TCGGGAGCAG GGTAGGGGTA GCCTGGATAT ATTCCGCCTG CGGGAAATGT
GAACTGTGCC TCGACGGTAA AGAGAATCTG TGCCTGGAGT TTCGTGCCAC CGGACGGGAC
GCTCATGGGG GGTATGCGGA ATATATGACT GTTCCCGTTT CTTCTGCCTA TTCACTTCCT
GATCTCTTTT CCGATGCTGA AGCCGCGCCT CTTCTGTGTG CGGGTGCTGT CGGGTATCGG
TCACTGAAGC TGCTGAATCT GCAAAACGGC CAGCCTGCGG GGTTGACAGG TTTCGGGGCT
TCAGCGCATC TTGTTTTGAA ATTGATGCGG TTTCTCTACC CTGATTCGCC GGTTCATGTT
TTTGCCCGAA ACCTGCAAGA GCGTGAATTC TCCCTTGCTC TCGGAGCAGT CTGGGCTGGA
GATACAACCG ATTCATCTCC GGAACTCCTT GCCGGTATCA TCGACACCAC GCCGGTCTGG
CTGCCCGTCC TGTCCGCACT TGAGAATCTC AGACCATCAG GCCGTCTGGT CATCAATGCG
ATCCGCAAAG AAGCGTCGGA TACAGATGTG CTTACGCAGC TCGATTATGC GAAGCATCTC
TGGATGGAGA AGGAGATCAA AAGCGTGGCC AACGTTGCCG CTGAGGATGT CAGGCAGTTC
CTGAAGATTG CTGCATCCAT GCACATGAAG CCGGAAGTGC AGATCTATTC TTTTGAAGAG
GCGAACAGAG CCCTTATTGA CATAAAGCAG CGCCGGATCA GGGGCGCGAA AGTGCTTCAG
ATTGCCTGA
 
Protein sequence
MKAQVIERIT DLSESSEPLR MVEMPVPEPA AGEVLLKVLT CGVCHTELDE IEGRTPPAFF 
PIVPGHQVVG EVVAQGAGVS QPEIGSRVGV AWIYSACGKC ELCLDGKENL CLEFRATGRD
AHGGYAEYMT VPVSSAYSLP DLFSDAEAAP LLCAGAVGYR SLKLLNLQNG QPAGLTGFGA
SAHLVLKLMR FLYPDSPVHV FARNLQEREF SLALGAVWAG DTTDSSPELL AGIIDTTPVW
LPVLSALENL RPSGRLVINA IRKEASDTDV LTQLDYAKHL WMEKEIKSVA NVAAEDVRQF
LKIAASMHMK PEVQIYSFEE ANRALIDIKQ RRIRGAKVLQ IA