Gene Gdia_1118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1118 
Symbol 
ID6974522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1255759 
End bp1256748 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content71% 
IMG OID643390647 
Productzinc-binding alcohol dehydrogenase family protein 
Protein accessionYP_002275516 
Protein GI209543287 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID[TIGR02822] zinc-binding alcohol dehydrogenase family protein 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.439125 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCACG CGATGCGATT GAATGCCCCG CATACCGACC TGGAATGGGT GGAACTGCCC 
GACCGCCTGC CCGGCCCCGG CGAGATCCGG GTGCGCGTCG GGGCCTGCGG CGTGTGCCGC
ACCGACCTGC ACGTGGTGGA TGGCGACCTG CCCTTTCCCG GCCATCCGGT CATTCCGGGG
CACGAGATCG TGGGCCGGAT CGAGGCGCTG GGCGAGGGTG TGCAGGACCT GAAGATCGGC
CAGCGGGTCG GCGTGCCGTG GCTGGGCCAT ACCTGCGGCA TCTGCCGCTA CTGCCACAGC
GGGCATGAAA ACCTGTGCGA CCATCCGCTT TTCACCGGCT ACACCCGCGA CGGCGGCTAT
GCCACCGCCG CCATCGCCGA TGCCCGCTAT GCGTTTCCGC TGGGCGAGGA AGGCAGCGAC
GTGGACCTGG CCCCCCTGCT GTGCGCGGGG CTGATCGGCT GGCGGTCGCT GGTGATGGCG
GGCGAGGACG CGAAGACGGT GGGGCTGTAC GGCTTCGGCG CCGCCGCGCA CATCATCGCC
CAGGTGGCGC TGTGGCAGGG CCGCACCGTC TATGGCTTCA CCCGCCCGGG CGACCGCCCG
ACGCAGGATT TCGCCCGGTC GCTGGGCGCG ACCTGGGCCG GCGGATCGGA CGAGGCGCCG
CCGGAGAAGC TGGACGCCGC CATCATCTTC GCCCCCGTGG GCGCGCTGGT TCCGGCGGCC
CTGCGCGCGG TGCGCAAGGG CGGCCGCGTG GTCTGTGCCG GTATCCACAT GAGCGACATC
CCCAGCTTCC CCTACGATTT GTTTTGGGAG GAACGGCAAC TGGTTTCGGT CGCCAACCTG
ACACGGCAGG ACGGTATCGA TTTCCTCTCG CTGGCGCCCA GGATCGGCGT CCGCACCAAG
ACGACGCGCT ATGACCTGCG CGATGCCAAC CGCGCGCTGG CCGACCTGCG GGCCGGACGG
TTCGAGGGCG CGGCGGTGCT GGTGCCCTGA
 
Protein sequence
MMHAMRLNAP HTDLEWVELP DRLPGPGEIR VRVGACGVCR TDLHVVDGDL PFPGHPVIPG 
HEIVGRIEAL GEGVQDLKIG QRVGVPWLGH TCGICRYCHS GHENLCDHPL FTGYTRDGGY
ATAAIADARY AFPLGEEGSD VDLAPLLCAG LIGWRSLVMA GEDAKTVGLY GFGAAAHIIA
QVALWQGRTV YGFTRPGDRP TQDFARSLGA TWAGGSDEAP PEKLDAAIIF APVGALVPAA
LRAVRKGGRV VCAGIHMSDI PSFPYDLFWE ERQLVSVANL TRQDGIDFLS LAPRIGVRTK
TTRYDLRDAN RALADLRAGR FEGAAVLVP