Gene EcHS_A0128 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0128 
Symbolgcd 
ID5592492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp140319 
End bp142709 
Gene Length2391 bp 
Protein Length796 aa 
Translation table11 
GC content55% 
IMG OID640919315 
Productglucose dehydrogenase 
Protein accessionYP_001456910 
Protein GI157159592 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4993] Glucose dehydrogenase 
TIGRFAM ID[TIGR03074] membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.126967 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATTA ACAATACAGG CTCGCGACGA TTACTCGTCA CGCTAACAGC CCTTTTTGCA 
GCGCTTTGCG GGCTGTATCT ACTCATTGGC GGAGGCTGGC TGGTCGCGAT TGGCGGCTCC
TGGTACTACC CTATCGCTGG TCTTGTGATG CTCGGCGTCG CCTGGATGCT GTGGCGCAGT
AAACGTGCCG CGCTTTGGCT GTACGCAGCC CTGCTGCTCG GCACCATGAT TTGGGGCGTC
TGGGAAGTTG GTTTCGACTT CTGGGCGCTG ACTCCGCGCA GCGACATTCT GGTCTTCTTC
GGCATCTGGC TGATCCTGCC GTTTGTCTGG CGTCGCCTGG TCATTCCTGC CAGCGGCGCA
GTTGCCGCAC TGGTGGTGGC ACTGCTGATT AGCGGTGGTA TCCTGACCTG GGCCGGATTT
AACGATCCGC AGGAAATCAA CGGCACCTTA AGCGCCAATG CCACACCTGC GGAAGCTATC
TCACCCGTTG CCGATCAGGA CTGGCCTGCC TATGGTCGCA ATCAGGAAGG TCAACGCTTT
TCACCTCTGA AACAAATTAA CGCCGATAAC GTCCATAATC TGAAAGAAGC CTGGGTGTTC
CGTACTGGCG ATGTGAAGCA GCCGAACGAT CCGGGCGAAA TCACCAATGA AGTGACGCCG
ATTAAAGTGG GCGACACCCT TTACCTGTGT ACCGCTCACC AGCGCCTGTT TGCGCTCGAT
GCCGCCAGCG GCAAAGAGAA ATGGCATTAC GATCCTGAGC TGAAAACTAA CGAGTCTTTC
CAGCACGTAA CCTGCCGTGG CGTCTCTTAT CATGAAGCCA AAGCAGAAAC GGCTTCGCCG
GAAGTGATGG CGGATTGCCC GCGTCGTATC ATTCTTCCGG TCAATGATGG TCGACTGATT
GCGATTAACG CTGAAAACGG CAAACTGTGC GAAACCTTCG CCAATAAAGG CGTGCTCAAT
CTGCAAAGCA ATATGCCAGA CACCAAACCG GGTCTGTATG AACCGACTTC GCCACCGATT
ATCACCGATA AAACCATCGT GATGGCCGGT TCAGTTACCG ATAACTTCTC AACCCGCGAA
ACGTCTGGCG TGATCCGTGG TTTTGATGTC AACACCGGGG AGCTGCTGTG GGCTTTTGAT
CCCGGCGCGA AAGATCCGAA CGCAATCCCG TCTGACGAAC ACACCTTTAC CTTTAACTCG
CCAAACTCCT GGGCACCAGC GGCCTATGAC GCGAAGCTGG ATCTGGTCTA TCTGCCGATG
GGCGTGACCA CGCCGGATAT CTGGGGCGGT AACCGCACAC CGGAACAGGA ACGTTATGCC
AGCTCGATTC TGGCGCTGAA TGCCACTACC GGGAAACTGG CGTGGAGCTA CCAGACCGTT
CACCACGACC TGTGGGACAT GGATCTTCCG GCACAGCCGA CGCTGGCGGA CATCACCGTT
AATGGTCAGA AAGTGCCAGT TATTTACGCT CCGGCGAAAA CCGGCAACAT TTTTGTGCTC
GATCGTCGCA ATGGCGAACT GGTGGTTCCG GCCCCGGAAA AACCGGTGCC GCAAGGTGCT
GCGAAAGGCG ATTACGTTAC TCCGACCCAG CCGTTCTCTG AACTGAGCTT CCGTCCGAAG
AAAGATTTGA GCGGTGCGGA TATGTGGGGA GCCACCATGT TTGACCAACT GGTGTGCCGC
GTGATGTTCC ACCAGATGCG CTATGAAGGC ATTTTCACGC CGCCATCTGA ACAGGGCACG
CTGGTCTTCC CGGGTAACCT GGGAATGTTT GAATGGGGCG GGATTTCCGT TGATCCAAAT
CGTGAAGTGG CAATTGCCAA CCCAATGGCA CTGCCGTTTG TTTCGAAACT GATCCCACGC
GGTCCGGGTA ACCCGATGGA GCAGCCGAAA GATGCCAAAG GCACGGGTAC GGAATCCGGC
ATTCAGCCAC AGTACGGTGT ACCGTATGGT GTCACGCTCA ACCCGTTCCT CTCACCGTTT
GGTCTGCCGT GTAAACAGCC AGCATGGGGT TATATCTCGG CGCTGGATCT GAAAACCAAT
GAAGTGGTGT GGAAGAAACG TATTGGTACG CCGCAAGACA GTATGCCGTT CCCGATGCCT
GTACCGGTGC CGTTCAATAT GGGTATGCCT ATGCTGGGCG GGCCAATCTC CACGGCGGGT
AACGTGCTGT TTATCGCTGC TACGGCAGAT AACTACCTGC GCGCTTACAA CATGAGCAAC
GGTGAAAAAC TGTGGCAGGG CCGTTTACCA GCGGGTGGTC AGGCGACGCC AATGACCTAT
GAAGTGAATG GCAAGCAGTA TGTGGTGATC TCCGCAGGCG GTCACGGTTC ATTTGGTACG
AAGATGGGCG ACTATATTGT GGCTTATGCG CTGCCGGATG ATGTGAAATA A
 
Protein sequence
MAINNTGSRR LLVTLTALFA ALCGLYLLIG GGWLVAIGGS WYYPIAGLVM LGVAWMLWRS 
KRAALWLYAA LLLGTMIWGV WEVGFDFWAL TPRSDILVFF GIWLILPFVW RRLVIPASGA
VAALVVALLI SGGILTWAGF NDPQEINGTL SANATPAEAI SPVADQDWPA YGRNQEGQRF
SPLKQINADN VHNLKEAWVF RTGDVKQPND PGEITNEVTP IKVGDTLYLC TAHQRLFALD
AASGKEKWHY DPELKTNESF QHVTCRGVSY HEAKAETASP EVMADCPRRI ILPVNDGRLI
AINAENGKLC ETFANKGVLN LQSNMPDTKP GLYEPTSPPI ITDKTIVMAG SVTDNFSTRE
TSGVIRGFDV NTGELLWAFD PGAKDPNAIP SDEHTFTFNS PNSWAPAAYD AKLDLVYLPM
GVTTPDIWGG NRTPEQERYA SSILALNATT GKLAWSYQTV HHDLWDMDLP AQPTLADITV
NGQKVPVIYA PAKTGNIFVL DRRNGELVVP APEKPVPQGA AKGDYVTPTQ PFSELSFRPK
KDLSGADMWG ATMFDQLVCR VMFHQMRYEG IFTPPSEQGT LVFPGNLGMF EWGGISVDPN
REVAIANPMA LPFVSKLIPR GPGNPMEQPK DAKGTGTESG IQPQYGVPYG VTLNPFLSPF
GLPCKQPAWG YISALDLKTN EVVWKKRIGT PQDSMPFPMP VPVPFNMGMP MLGGPISTAG
NVLFIAATAD NYLRAYNMSN GEKLWQGRLP AGGQATPMTY EVNGKQYVVI SAGGHGSFGT
KMGDYIVAYA LPDDVK