Gene Bind_2341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_2341 
Symbol 
ID6200120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp2678198 
End bp2679244 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content60% 
IMG OID641706326 
Productalcohol dehydrogenase 
Protein accessionYP_001833443 
Protein GI182679297 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.231034 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGCTC TCGTTCTCGA GCGCAAGGGC GAATTGTCTC TGCGCGATGT GGACCTGCCC 
TTGGCGGTCG GGCCGGGCCA AGTGAAAATC GCGGTCCATA CGGTGGGTAT CTGCGGCAGC
GATGTGCATT ATTTCACGCA TGGCAGTATT GGGCCCTATA TCGTTGAAAA ACCGATGGTC
CTTGGGCATG AGGCGACCGG CACGATCGTC GAAGTCGGAC CGAATGTCAG CACTTTAAAG
GTCGGAGACC GGGTCTGTAT GGAACCTGGG GTGCCCGACA TGTCGTCCCG GGCGTCCAAG
CTTGGTCTCT ATAATGTAGA CCCATCCGTG ACCTTCTGGG CGACGCCTCC CGTCCATGGT
GTGCTGACAC CTTATGTCGT TCACCCTGCC GCCTTCACCT ATAAGTTGCC CGCCAATGTA
TCCTTCGCGG AAGGCGCTTT GGTCGAGCCC TTCGCGATTG GCATGCAGGC GGCAACGCGG
GCGCGGATCG CTCCGGGCGA TGTTGCAGCA GTTATCGGTG CAGGTACGAT TGGCATCATG
ACGGCTCTCG CGGCGGTGGC TGGCGGCTGC TCGCGCGTCT TCATCTCCGA TTTCAGCAAG
GAGAAACTGG CGATCGCTGG GGGCTATGAT TGCATTGTTC CCGTCAATGC CGGTGAGGAA
TCGCTGGCCG ACGTCGTCGC CAGGGAGACG GAGAACTGGG GGGCCGACGT TGTGTTCGAG
GCCAGCGGCA GCCCCAAGGC CTATGGCGAT CTCTTCCGGA TCGTCCGTCC GGGCGGCGCC
GTGGTGCTCG TTGGCCTGCC GGTGGAGCCC GTGGCCTTTG ATGTGTCGAG TGCCATTTCC
AAGGAAGTGC GGATCGAGAC AGTGTTTCGC TACGCCAATA TTTTCGATCG CGCCTTGGCC
CTGATCGCAT CCGGCAAGGT CAATCTGAAG CCTCTGATTA CAGGCACATT CCCCTTTTCG
GATAGTGTCG TTGCTTTTGA GCGGGCCGCT GCCGGCCGGC CGACGGATGT GAAGCTGCAG
ATCGAGGTCG TCAGCGAAAA CGCCTGA
 
Protein sequence
MQALVLERKG ELSLRDVDLP LAVGPGQVKI AVHTVGICGS DVHYFTHGSI GPYIVEKPMV 
LGHEATGTIV EVGPNVSTLK VGDRVCMEPG VPDMSSRASK LGLYNVDPSV TFWATPPVHG
VLTPYVVHPA AFTYKLPANV SFAEGALVEP FAIGMQAATR ARIAPGDVAA VIGAGTIGIM
TALAAVAGGC SRVFISDFSK EKLAIAGGYD CIVPVNAGEE SLADVVARET ENWGADVVFE
ASGSPKAYGD LFRIVRPGGA VVLVGLPVEP VAFDVSSAIS KEVRIETVFR YANIFDRALA
LIASGKVNLK PLITGTFPFS DSVVAFERAA AGRPTDVKLQ IEVVSENA