Gene Cpha266_1004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1004 
Symbol 
ID4570926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1141515 
End bp1142666 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content51% 
IMG OID639765606 
Productiron-containing alcohol dehydrogenase 
Protein accessionYP_911475 
Protein GI119356831 
COG category[C] Energy production and conversion 
COG ID[COG1454] Alcohol dehydrogenase, class IV 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGCAA TTCATTTTGG CGCCGGTATG TTTGATCGCC TTGTCGATTT TTCCCTTCCT 
TATGGTCGAA ACGCTCTTCT TGTTACCGGC AGAGGTTCGC TGCAACGGCA GGGTTATCTC
CACTCTCTTT TTAATTCGTT CAGCAAGGCA GGTATACAGT ATGCCCATAT CATTGTTGAC
CATGAGCCAT CTCCCGAGCT GATAGATCAT GCAGTGTCCT GTTACAGGGA TCTTTCGATT
GATGTTGTTC TTGCCGTTGG CGGCGGAAGC GTGATCGATG CAGGCAAGGC TATTGCAGCA
ATGCTTCCCT CTGGTGATGC GGTTGAGCGA TATATTGAAG GTTTACCGGG CGCTGTTCCG
CACAATGGAA GAAAAGTGCC GTTTATTGCT GTTCCGACCA CCTCCGGCAC TGGCAGTGAG
GTGACCAATA ATGCTGTTAT CAGCCGAACC GGAAAAAACG GGTTCAAGCG TTCGCTTCGT
CACTCATCCT TTGTACCGGA TGTTGCCGTT GTTGATCCTT TGTTGATGTG TTCTGCTTCC
CGGGAACTTA CCGCCTCGTC AGGTATGGAT GCCTGCACCC AGCTTCTTGA AGCGTACGTG
TCGCCTTTTG CGACGCCGTA TACCGATCTT CTTGCCTTTC AGGGGCTTGA ATATTTTGCA
CGATCATTTC TCTCGGCCTG TTCTGACGGT GCCGATGATT CTGCGGTACG TGCCGATATG
GCATATGCTG CGCTTCTTTC GGGTGTTGTC CTTTCTAATG CCGGACTTGG TATTGTGCAC
GGGTTTGCCT CTTCTGTTGG CGGGGAGTAC GATATTCCCC ATGGAATGCT ATGCGCCACC
CTGCTTGCCG AAGCGACACG GACAAACATT CGCGAGTTGC GTGCCAGCCG AAACGAATAT
CCTGCTTTGA AAAAATATGC GCAGGCAGGA CAACTGTTTT CAGGCAGAGA TGCAACGGAT
TGTTCTGAAG GGTGCAATAT GCTGATTGAA AAGCTTGAGG AGTGGCAGCA GCAGCTTGCA
ATACCAAAGC TCAGCGCTTA TGGAATCGGT ATCTTCGACA CAGAGCGCCT TGCCTCTCGA
ACTCGCAGCA AGAGCAACCC TGTTGATCTC ACTCTTCAAA GTATGCAAGC TATTCTCGCT
GCGAGAGTTT AG
 
Protein sequence
MPAIHFGAGM FDRLVDFSLP YGRNALLVTG RGSLQRQGYL HSLFNSFSKA GIQYAHIIVD 
HEPSPELIDH AVSCYRDLSI DVVLAVGGGS VIDAGKAIAA MLPSGDAVER YIEGLPGAVP
HNGRKVPFIA VPTTSGTGSE VTNNAVISRT GKNGFKRSLR HSSFVPDVAV VDPLLMCSAS
RELTASSGMD ACTQLLEAYV SPFATPYTDL LAFQGLEYFA RSFLSACSDG ADDSAVRADM
AYAALLSGVV LSNAGLGIVH GFASSVGGEY DIPHGMLCAT LLAEATRTNI RELRASRNEY
PALKKYAQAG QLFSGRDATD CSEGCNMLIE KLEEWQQQLA IPKLSAYGIG IFDTERLASR
TRSKSNPVDL TLQSMQAILA ARV