Gene ECH74115_0132 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0132 
Symbolgcd 
ID6967745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp143188 
End bp145578 
Gene Length2391 bp 
Protein Length796 aa 
Translation table11 
GC content55% 
IMG OID643384209 
Productquinoprotein glucose dehydrogenase 
Protein accessionYP_002268732 
Protein GI209400840 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4993] Glucose dehydrogenase 
TIGRFAM ID[TIGR03074] membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.556262 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATTA ACAATACAGG CTCGCGACGA TTACTCGTCA CGCTAACAGC CCTTTTTGCA 
GCGCTTTGCG GGCTGTATCT ACTCATTGGC GGAGGCTGGC TGGTCGCGAT TGGCGGCTCC
TGGTACTACC CTATCGCTGG CCTTGTGATG CTCGGCGTCG CCTGGATGCT GTGGCGCAGT
AAACGTGCCG CGCTTTGGCT ATACGCCGCC CTGCTGCTCG GCACCATGAT TTGGGGCGTC
TGGGAAGTTG GTTTCGACTT CTGGGCGCTG ACTCCGCGCA GCGATATTCT GGTCTTCTTC
GGCATCTGGC TGATCCTGCC GTTTGTCTGG CGTCGCCTGG TCATTCCTGC CAGCGGCGCA
GTTGCCGCAC TGGTGGTGGC ACTGCTGATT AGCGGTAGTA TCCTGACCTG GGCCGGATTT
AACGATCCGC AGGAAATCAA CGGCACCTTA AGCGCCGATG CCACACCTGC GGAAGCTATC
TCACCCGTTG CCGATCAGGA CTGGCCTGCC TATGGTCGCA ATCAGGAAGG TCAACGCTTT
TCACCTCTGA AACAAATTAA CGCCGATAAC GTCCATAATC TGAAAGAAGC CTGGGTGTTC
CGTACTGGCG ATGTGAAGCA GCCGAACGAT CCGGGCGAAA TCACCAATGA AGTGACGCCG
ATTAAAGTGG GCGACACCCT TTACCTGTGT ACCGCTCACC AGCGCCTGTT TGCGCTCGAT
GCCGCCAGCG GCAAAGAGAA ATGGCATTAC GATCCTGAGC TGAAAACTAA CGAGTCTTTC
CAGCACGTAA CCTGCCGTGG CGTCTCTTAT CATGAAGCCA AAGCAGAAAC GGCTTCGCCG
GAAGTGATGG CGGATTGCCC GCGTCGTATC ATTCTTCCGG TCAATGATGG TCGCCTGATT
GCGATTAACG CTGAAAACGG CAAGCTGTGC GAAACCTTCG CCAATAAAGG CGTGCTCAAT
CTGCAAAGCA ATATGCCAGA CACCAAACCG GGTCTGTATG AACCGACTTC GCCACCGATT
ATCACCGATA AAACCATCGT GATGGCAGGT TCAGTTACCG ATAACTTCTC AACCCGCGAA
ACGTCTGGCG TGATCCGTGG TTTTGATGTC AACACCGGGG AGCTGCTGTG GGCTTTTGAT
CCGGGCGCGA AAGATCCGAA CGCAATCCCG TCTGACGAAC ACACCTTTAC CTTTAACTCG
CCAAACTCGT GGGCACCAGC GGCCTATGAC GCGAAGCTGG ATCTGGTGTA TCTGCCGATG
GGCGTGACCA CGCCGGATAT CTGGGGCGGT AACCGCACAC CGGAACAGGA ACGTTATGCC
AGCTCGATTC TGGCGCTGAA TGCCACAACC GGGAAATTGG CGTGGAGCTA CCAGACCGTT
CACCACGACC TGTGGGATAT GGATCTTCCG GCACAGCCGA CGCTGGCGGA CATCACCGTT
AATGGTCAGA AAGTGCCAGT TATTTACGCT CCGGCGAAAA CCGGCAACAT TTTTGTGCTC
GATCGTCGTA ATGGCGAACT GGTGGTTCCG GCACCGGAAA AACCGGTTCC CCAAGGTGCT
GCCAAAGGCG ATTACGTTAC CCCTACTCAA CCATTCTCGG AACTGAGCTT CCGTCCGACG
AAAGATTTGA GCGGTGCGGA TATGTGGGGA GCCACCATGT TTGACCAACT GGTGTGCCGC
GTGATGTTCC ACCAGATGCG CTATGAAGGC ATTTTCACGC CGCCATCTGA ACAGGGCACG
CTGGTCTTCC CGGGTAACCT GGGAATGTTT GAATGGGGCG GGATTTCCGT TGATCCAAAT
CGTGAAGTGG CAATTGCCAA CCCAATGGCA CTGCCGTTTG TTTCGAAACT GATCCCACGC
GGTCCGGGTA ACCCGATGGA GCAGCCGAAA GATGCCAAAG GCACGGGTAC GGAATCCGGC
ATTCAGCCAC AGTACGGTGT ACCGTATGGT GTCACGCTCA ACCCGTTCCT CTCACCATTT
GGTCTGCCAT GTAAACAGCC AGCATGGGGT TATATCTCGG CGCTGGATCT GAAAACTAAT
GAAGTGGTGT GGAAGAAACG TATTGGTACG CCGCAGGACA GTATGCCGTT CCCGATGCCT
GTACCGGTGC CGTTCAATAT GGGTATGCCG ATGCTGGGCG GGCCAATCTC CACGGCGGGT
AACGTGCTGT TTATCGCCGC TACGGCAGAT AACTACCTGC GCGCTTACAA CATGAGCAAC
GGTGAAAAAC TGTGGCAGGG TCGTTTACCA GCGGGTGGTC AGGCGACGCC AATGACCTAT
GAAGTGAATG GTAAGCAGTA TGTGGTGATC TCCGCAGGCG GTCACGGTTC ATTTGGTACG
AAGATGGGTG ACTATATTGT GGCTTATGCG CTGCCGGATG ATGTGAAGTA A
 
Protein sequence
MAINNTGSRR LLVTLTALFA ALCGLYLLIG GGWLVAIGGS WYYPIAGLVM LGVAWMLWRS 
KRAALWLYAA LLLGTMIWGV WEVGFDFWAL TPRSDILVFF GIWLILPFVW RRLVIPASGA
VAALVVALLI SGSILTWAGF NDPQEINGTL SADATPAEAI SPVADQDWPA YGRNQEGQRF
SPLKQINADN VHNLKEAWVF RTGDVKQPND PGEITNEVTP IKVGDTLYLC TAHQRLFALD
AASGKEKWHY DPELKTNESF QHVTCRGVSY HEAKAETASP EVMADCPRRI ILPVNDGRLI
AINAENGKLC ETFANKGVLN LQSNMPDTKP GLYEPTSPPI ITDKTIVMAG SVTDNFSTRE
TSGVIRGFDV NTGELLWAFD PGAKDPNAIP SDEHTFTFNS PNSWAPAAYD AKLDLVYLPM
GVTTPDIWGG NRTPEQERYA SSILALNATT GKLAWSYQTV HHDLWDMDLP AQPTLADITV
NGQKVPVIYA PAKTGNIFVL DRRNGELVVP APEKPVPQGA AKGDYVTPTQ PFSELSFRPT
KDLSGADMWG ATMFDQLVCR VMFHQMRYEG IFTPPSEQGT LVFPGNLGMF EWGGISVDPN
REVAIANPMA LPFVSKLIPR GPGNPMEQPK DAKGTGTESG IQPQYGVPYG VTLNPFLSPF
GLPCKQPAWG YISALDLKTN EVVWKKRIGT PQDSMPFPMP VPVPFNMGMP MLGGPISTAG
NVLFIAATAD NYLRAYNMSN GEKLWQGRLP AGGQATPMTY EVNGKQYVVI SAGGHGSFGT
KMGDYIVAYA LPDDVK