Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_2378 |
Symbol | |
ID | 6975808 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 2634083 |
End bp | 2635738 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643391902 |
Product | Choline dehydrogenase |
Protein accession | YP_002276744 |
Protein GI | 209544515 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.448643 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATATG ACTATGTGAT TGTGGGCGGA GGGCCGGCGG GCTGCGTTCT CGCCGCCCGC CTGAGCGAGG ACCCCCGGGT CCGGGTCCTC CTGCTCGAGG CCGGCGGAAG CGACCGGAAC ATGCTGTATC GCATCCCCGC CGGCTTCGCG AAAATGACCA AGGGCATCGG CAGCTGAGGG TGGGAGACCG TTCCCCAGAG GCACATGCAG GGCCGCGTGC TGCGCTATAC GCAGGCCATG GTGATCGGGG GCGGATCGTC GATCAACGCG CAGATCTACA CCCGTGGCAA CGCGGGCGAT TATGACGGCT GGGCACGGGA AAAGGGCTGC GAGGCCTGGG AATATCGTCG CGTCCTGCCT TATTTCAAAC GGGCGGAAAA CAACCAGCGC TTCCTCGACG ACTATCATGG TGCCGGGGGG CCGCTGGGTG TGTCGATGCC CGCGGCGCCC CTGCCGATCT GCGAGGCCTA TATCAAGGCC GCCCAGGAAC TTGGTATTCC CTACAACCAT GATTTCAATG GACCCCGTCA GGCCGGCATC GGGTTCTTCC AGCTGACGCA GCGCAATCAC GAACGGTCGT CGGCATCCCG TGCCTATCTC GGCGCGGCGC GGGGGCGGAA AAACCTGACC GTGCGGCTCA ATGCCCAGGT GCTGCGGGTC GTGGTCGAGA AGGGGCGGGC AATCGGGGTC GAGCTTTCGT TTTCCGGCCG GACGGGATTC GTCCGGGCGG AGCGCGAGGT CATTCTCTGC TCGGGGGCCA TAGGCTCGCC CAAGCTGCTG CTGCAATCGG GCATCGGCCC GGCCGACGAA CTGTGCGCCC TGGATATCCC CGTCATGCAC GATCTGCCGG GCGTGGGCCG CAACCTGCAG GACCATCTGG ATCTTTTCGT CATTGCCGAA TGTAGGGGCG ATTTCACCTA TGACGGTGTC GCGCGGCCGC ATCGGACGCT TGCCGCCGGC CTGCAATACC TGATCTACAG AAACGGCCCG GCAGCCTCGA GCCTTTTCGA GACGGGAGGG TTCTGGTACG TCGATCCCAG GGCCGCATAT CCGGATCTTC AGTTTCACCT GGGCCTGGGT TCGGGGATCG AGGCAGGCGT CGCGCGGCTT CGGAACGCGG GCGTGACCCT GAATACCGCC TATCTGCGCC CCCGGTCGCG CGGCACCGTG ACGCTGCGGT CCGCCGACCC GGCGGCCGCC CCGCTGATCG ATCCGAATTA TTTCAGCGAT CCGCATGATC GAACCATGTC GATCGAGGGC CTGAAGATCG CGCGCGAGAT CATCCTGCAG CCGGCGATGC AGGATTTCGT CCTGGCCGAG CGTCTGCCCG GTCCCGCCGT GCGCACCGAC GCCGAACTGT TCGATTACGC GTGCCGGAAC GCCAAGACCG ACCACCATCC GGTGGGGACG TGCCGGATGG GCGTCGGGGC GGATGCCGTG GTGGACCCGG AACTGCGCCT GCACGGCATT GCCGGGCTGC GCGTCTGCGA TGCGTCGGTG ATGCCGAAGA TACCCTCATG CAACACCAAC AGCCCGACCA TCATGGTGGG CGAGAAAGGT GCGGACATGA TCCTCGGCCG GCAGCCCCTG GCGCCGGCGA TCCTTGACGA CCAGCGCAAC GATATCCCGC AGCACGCGCG GCGCGAGGTC GCCTGA
|
Protein sequence | MAYDYVIVGG GPAGCVLAAR LSEDPRVRVL LLEAGGSDRN MLYRIPAGFA KMTKGIGSUG WETVPQRHMQ GRVLRYTQAM VIGGGSSINA QIYTRGNAGD YDGWAREKGC EAWEYRRVLP YFKRAENNQR FLDDYHGAGG PLGVSMPAAP LPICEAYIKA AQELGIPYNH DFNGPRQAGI GFFQLTQRNH ERSSASRAYL GAARGRKNLT VRLNAQVLRV VVEKGRAIGV ELSFSGRTGF VRAEREVILC SGAIGSPKLL LQSGIGPADE LCALDIPVMH DLPGVGRNLQ DHLDLFVIAE CRGDFTYDGV ARPHRTLAAG LQYLIYRNGP AASSLFETGG FWYVDPRAAY PDLQFHLGLG SGIEAGVARL RNAGVTLNTA YLRPRSRGTV TLRSADPAAA PLIDPNYFSD PHDRTMSIEG LKIAREIILQ PAMQDFVLAE RLPGPAVRTD AELFDYACRN AKTDHHPVGT CRMGVGADAV VDPELRLHGI AGLRVCDASV MPKIPSCNTN SPTIMVGEKG ADMILGRQPL APAILDDQRN DIPQHARREV A
|
| |