Gene Gdia_2378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2378 
Symbol 
ID6975808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2634083 
End bp2635738 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content66% 
IMG OID643391902 
ProductCholine dehydrogenase 
Protein accessionYP_002276744 
Protein GI209544515 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.448643 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATATG ACTATGTGAT TGTGGGCGGA GGGCCGGCGG GCTGCGTTCT CGCCGCCCGC 
CTGAGCGAGG ACCCCCGGGT CCGGGTCCTC CTGCTCGAGG CCGGCGGAAG CGACCGGAAC
ATGCTGTATC GCATCCCCGC CGGCTTCGCG AAAATGACCA AGGGCATCGG CAGCTGAGGG
TGGGAGACCG TTCCCCAGAG GCACATGCAG GGCCGCGTGC TGCGCTATAC GCAGGCCATG
GTGATCGGGG GCGGATCGTC GATCAACGCG CAGATCTACA CCCGTGGCAA CGCGGGCGAT
TATGACGGCT GGGCACGGGA AAAGGGCTGC GAGGCCTGGG AATATCGTCG CGTCCTGCCT
TATTTCAAAC GGGCGGAAAA CAACCAGCGC TTCCTCGACG ACTATCATGG TGCCGGGGGG
CCGCTGGGTG TGTCGATGCC CGCGGCGCCC CTGCCGATCT GCGAGGCCTA TATCAAGGCC
GCCCAGGAAC TTGGTATTCC CTACAACCAT GATTTCAATG GACCCCGTCA GGCCGGCATC
GGGTTCTTCC AGCTGACGCA GCGCAATCAC GAACGGTCGT CGGCATCCCG TGCCTATCTC
GGCGCGGCGC GGGGGCGGAA AAACCTGACC GTGCGGCTCA ATGCCCAGGT GCTGCGGGTC
GTGGTCGAGA AGGGGCGGGC AATCGGGGTC GAGCTTTCGT TTTCCGGCCG GACGGGATTC
GTCCGGGCGG AGCGCGAGGT CATTCTCTGC TCGGGGGCCA TAGGCTCGCC CAAGCTGCTG
CTGCAATCGG GCATCGGCCC GGCCGACGAA CTGTGCGCCC TGGATATCCC CGTCATGCAC
GATCTGCCGG GCGTGGGCCG CAACCTGCAG GACCATCTGG ATCTTTTCGT CATTGCCGAA
TGTAGGGGCG ATTTCACCTA TGACGGTGTC GCGCGGCCGC ATCGGACGCT TGCCGCCGGC
CTGCAATACC TGATCTACAG AAACGGCCCG GCAGCCTCGA GCCTTTTCGA GACGGGAGGG
TTCTGGTACG TCGATCCCAG GGCCGCATAT CCGGATCTTC AGTTTCACCT GGGCCTGGGT
TCGGGGATCG AGGCAGGCGT CGCGCGGCTT CGGAACGCGG GCGTGACCCT GAATACCGCC
TATCTGCGCC CCCGGTCGCG CGGCACCGTG ACGCTGCGGT CCGCCGACCC GGCGGCCGCC
CCGCTGATCG ATCCGAATTA TTTCAGCGAT CCGCATGATC GAACCATGTC GATCGAGGGC
CTGAAGATCG CGCGCGAGAT CATCCTGCAG CCGGCGATGC AGGATTTCGT CCTGGCCGAG
CGTCTGCCCG GTCCCGCCGT GCGCACCGAC GCCGAACTGT TCGATTACGC GTGCCGGAAC
GCCAAGACCG ACCACCATCC GGTGGGGACG TGCCGGATGG GCGTCGGGGC GGATGCCGTG
GTGGACCCGG AACTGCGCCT GCACGGCATT GCCGGGCTGC GCGTCTGCGA TGCGTCGGTG
ATGCCGAAGA TACCCTCATG CAACACCAAC AGCCCGACCA TCATGGTGGG CGAGAAAGGT
GCGGACATGA TCCTCGGCCG GCAGCCCCTG GCGCCGGCGA TCCTTGACGA CCAGCGCAAC
GATATCCCGC AGCACGCGCG GCGCGAGGTC GCCTGA
 
Protein sequence
MAYDYVIVGG GPAGCVLAAR LSEDPRVRVL LLEAGGSDRN MLYRIPAGFA KMTKGIGSUG 
WETVPQRHMQ GRVLRYTQAM VIGGGSSINA QIYTRGNAGD YDGWAREKGC EAWEYRRVLP
YFKRAENNQR FLDDYHGAGG PLGVSMPAAP LPICEAYIKA AQELGIPYNH DFNGPRQAGI
GFFQLTQRNH ERSSASRAYL GAARGRKNLT VRLNAQVLRV VVEKGRAIGV ELSFSGRTGF
VRAEREVILC SGAIGSPKLL LQSGIGPADE LCALDIPVMH DLPGVGRNLQ DHLDLFVIAE
CRGDFTYDGV ARPHRTLAAG LQYLIYRNGP AASSLFETGG FWYVDPRAAY PDLQFHLGLG
SGIEAGVARL RNAGVTLNTA YLRPRSRGTV TLRSADPAAA PLIDPNYFSD PHDRTMSIEG
LKIAREIILQ PAMQDFVLAE RLPGPAVRTD AELFDYACRN AKTDHHPVGT CRMGVGADAV
VDPELRLHGI AGLRVCDASV MPKIPSCNTN SPTIMVGEKG ADMILGRQPL APAILDDQRN
DIPQHARREV A