Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0128 |
Symbol | gcd |
ID | 5592492 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 140319 |
End bp | 142709 |
Gene Length | 2391 bp |
Protein Length | 796 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640919315 |
Product | glucose dehydrogenase |
Protein accession | YP_001456910 |
Protein GI | 157159592 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4993] Glucose dehydrogenase |
TIGRFAM ID | [TIGR03074] membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.126967 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAATTA ACAATACAGG CTCGCGACGA TTACTCGTCA CGCTAACAGC CCTTTTTGCA GCGCTTTGCG GGCTGTATCT ACTCATTGGC GGAGGCTGGC TGGTCGCGAT TGGCGGCTCC TGGTACTACC CTATCGCTGG TCTTGTGATG CTCGGCGTCG CCTGGATGCT GTGGCGCAGT AAACGTGCCG CGCTTTGGCT GTACGCAGCC CTGCTGCTCG GCACCATGAT TTGGGGCGTC TGGGAAGTTG GTTTCGACTT CTGGGCGCTG ACTCCGCGCA GCGACATTCT GGTCTTCTTC GGCATCTGGC TGATCCTGCC GTTTGTCTGG CGTCGCCTGG TCATTCCTGC CAGCGGCGCA GTTGCCGCAC TGGTGGTGGC ACTGCTGATT AGCGGTGGTA TCCTGACCTG GGCCGGATTT AACGATCCGC AGGAAATCAA CGGCACCTTA AGCGCCAATG CCACACCTGC GGAAGCTATC TCACCCGTTG CCGATCAGGA CTGGCCTGCC TATGGTCGCA ATCAGGAAGG TCAACGCTTT TCACCTCTGA AACAAATTAA CGCCGATAAC GTCCATAATC TGAAAGAAGC CTGGGTGTTC CGTACTGGCG ATGTGAAGCA GCCGAACGAT CCGGGCGAAA TCACCAATGA AGTGACGCCG ATTAAAGTGG GCGACACCCT TTACCTGTGT ACCGCTCACC AGCGCCTGTT TGCGCTCGAT GCCGCCAGCG GCAAAGAGAA ATGGCATTAC GATCCTGAGC TGAAAACTAA CGAGTCTTTC CAGCACGTAA CCTGCCGTGG CGTCTCTTAT CATGAAGCCA AAGCAGAAAC GGCTTCGCCG GAAGTGATGG CGGATTGCCC GCGTCGTATC ATTCTTCCGG TCAATGATGG TCGACTGATT GCGATTAACG CTGAAAACGG CAAACTGTGC GAAACCTTCG CCAATAAAGG CGTGCTCAAT CTGCAAAGCA ATATGCCAGA CACCAAACCG GGTCTGTATG AACCGACTTC GCCACCGATT ATCACCGATA AAACCATCGT GATGGCCGGT TCAGTTACCG ATAACTTCTC AACCCGCGAA ACGTCTGGCG TGATCCGTGG TTTTGATGTC AACACCGGGG AGCTGCTGTG GGCTTTTGAT CCCGGCGCGA AAGATCCGAA CGCAATCCCG TCTGACGAAC ACACCTTTAC CTTTAACTCG CCAAACTCCT GGGCACCAGC GGCCTATGAC GCGAAGCTGG ATCTGGTCTA TCTGCCGATG GGCGTGACCA CGCCGGATAT CTGGGGCGGT AACCGCACAC CGGAACAGGA ACGTTATGCC AGCTCGATTC TGGCGCTGAA TGCCACTACC GGGAAACTGG CGTGGAGCTA CCAGACCGTT CACCACGACC TGTGGGACAT GGATCTTCCG GCACAGCCGA CGCTGGCGGA CATCACCGTT AATGGTCAGA AAGTGCCAGT TATTTACGCT CCGGCGAAAA CCGGCAACAT TTTTGTGCTC GATCGTCGCA ATGGCGAACT GGTGGTTCCG GCCCCGGAAA AACCGGTGCC GCAAGGTGCT GCGAAAGGCG ATTACGTTAC TCCGACCCAG CCGTTCTCTG AACTGAGCTT CCGTCCGAAG AAAGATTTGA GCGGTGCGGA TATGTGGGGA GCCACCATGT TTGACCAACT GGTGTGCCGC GTGATGTTCC ACCAGATGCG CTATGAAGGC ATTTTCACGC CGCCATCTGA ACAGGGCACG CTGGTCTTCC CGGGTAACCT GGGAATGTTT GAATGGGGCG GGATTTCCGT TGATCCAAAT CGTGAAGTGG CAATTGCCAA CCCAATGGCA CTGCCGTTTG TTTCGAAACT GATCCCACGC GGTCCGGGTA ACCCGATGGA GCAGCCGAAA GATGCCAAAG GCACGGGTAC GGAATCCGGC ATTCAGCCAC AGTACGGTGT ACCGTATGGT GTCACGCTCA ACCCGTTCCT CTCACCGTTT GGTCTGCCGT GTAAACAGCC AGCATGGGGT TATATCTCGG CGCTGGATCT GAAAACCAAT GAAGTGGTGT GGAAGAAACG TATTGGTACG CCGCAAGACA GTATGCCGTT CCCGATGCCT GTACCGGTGC CGTTCAATAT GGGTATGCCT ATGCTGGGCG GGCCAATCTC CACGGCGGGT AACGTGCTGT TTATCGCTGC TACGGCAGAT AACTACCTGC GCGCTTACAA CATGAGCAAC GGTGAAAAAC TGTGGCAGGG CCGTTTACCA GCGGGTGGTC AGGCGACGCC AATGACCTAT GAAGTGAATG GCAAGCAGTA TGTGGTGATC TCCGCAGGCG GTCACGGTTC ATTTGGTACG AAGATGGGCG ACTATATTGT GGCTTATGCG CTGCCGGATG ATGTGAAATA A
|
Protein sequence | MAINNTGSRR LLVTLTALFA ALCGLYLLIG GGWLVAIGGS WYYPIAGLVM LGVAWMLWRS KRAALWLYAA LLLGTMIWGV WEVGFDFWAL TPRSDILVFF GIWLILPFVW RRLVIPASGA VAALVVALLI SGGILTWAGF NDPQEINGTL SANATPAEAI SPVADQDWPA YGRNQEGQRF SPLKQINADN VHNLKEAWVF RTGDVKQPND PGEITNEVTP IKVGDTLYLC TAHQRLFALD AASGKEKWHY DPELKTNESF QHVTCRGVSY HEAKAETASP EVMADCPRRI ILPVNDGRLI AINAENGKLC ETFANKGVLN LQSNMPDTKP GLYEPTSPPI ITDKTIVMAG SVTDNFSTRE TSGVIRGFDV NTGELLWAFD PGAKDPNAIP SDEHTFTFNS PNSWAPAAYD AKLDLVYLPM GVTTPDIWGG NRTPEQERYA SSILALNATT GKLAWSYQTV HHDLWDMDLP AQPTLADITV NGQKVPVIYA PAKTGNIFVL DRRNGELVVP APEKPVPQGA AKGDYVTPTQ PFSELSFRPK KDLSGADMWG ATMFDQLVCR VMFHQMRYEG IFTPPSEQGT LVFPGNLGMF EWGGISVDPN REVAIANPMA LPFVSKLIPR GPGNPMEQPK DAKGTGTESG IQPQYGVPYG VTLNPFLSPF GLPCKQPAWG YISALDLKTN EVVWKKRIGT PQDSMPFPMP VPVPFNMGMP MLGGPISTAG NVLFIAATAD NYLRAYNMSN GEKLWQGRLP AGGQATPMTY EVNGKQYVVI SAGGHGSFGT KMGDYIVAYA LPDDVK
|
| |