Gene EcSMS35_0134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0134 
Symbolgcd 
ID6146718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp148888 
End bp151278 
Gene Length2391 bp 
Protein Length796 aa 
Translation table11 
GC content55% 
IMG OID641615035 
Productquinoprotein glucose dehydrogenase 
Protein accessionYP_001742251 
Protein GI170683523 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4993] Glucose dehydrogenase 
TIGRFAM ID[TIGR03074] membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATTA ACAATACAGG CTCGCGACGA TTACTCGTCA CGCTAACAGC CCTTTTTGCA 
GCGCTTTGCG GGCTGTATCT ACTCATTGGC GGAGGCTGGC TGGTCGCGAT TGGCGGCTCC
TGGTACTACC CTATCGCTGG CCTTGTGATG CTCGGCGTCG CCTGGATGCT GTGGCGCAGT
AAACGCGCCG CGCTTTGGCT GTACGCCGCC CTGCTGCTCG GCACCATGAT TTGGGGCGTC
TGGGAAGTTG GTTTCGACTT CTGGGCGCTG ACTCCGCGCA GCGACATTCT GGTCTTCTTC
GGCATCTGGC TGATCCTGCC GTTTGTCTGG CGTCGCCTGG TCATTCCTGC CAGCGGCGCA
GTTGCCGCAC TGGTGGTGGC ATTGCTGATT AGTGGCGGAA TCCTCACCTG GGCGGGATTT
AACGATCCGC AGGAAATCAA CGGCACCTTA AGCGCCGATG CCACACCTGC CGAAGCTATC
TCACCGGTTG CCGATCAGGA CTGGCCTGCG TATGGTCGCA ATCAGGAAGG TCAACGCTTT
TCACCACTGA AGCAAATTAA CGCCGATAAC GTTCACAACC TGAAAGAAGC CTGGGTGTTC
CGTACAGGCG ATGTGAAACA GCCGAACGAT CCGGGCGAAA TCACCAATGA AGTGACGCCG
ATTAAAGTGG GCGATACGCT TTACTTATGT ACCGCTCACC AGCGCCTGTT TGCGCTCGAT
GCCGCCAGCG GCAAAGAGAA ATGGCATTAC GATCCTGAAC TGAAAACCAA CGAGTCCTTC
CAGCACGTAA CCTGTCGTGG CGTCTCTTAT CATGAAGCCA AAGCAGAAAC GGCCTCGCCA
GAAGTGATGG CGGATTGCCC GCGTCGTATC ATTCTTCCGG TCAATGATGG TCGCCTGATT
GCGATTAACG CTGAAAACGG CAAGCTGTGC GAAACCTTCG CTAATAAAGG CGTGCTCAAT
CTGCAAAGCA ATATGCCGGA CACCAAACCG GGTCTGTATG AGCCGACTTC GCCGCCAATT
ATCACCGATA AAACCATCGT GATGGCCGGT TCAGTCACCG ATAACTTCTC AACCCGCGAA
ACGTCTGGCG TGATCCGTGG TTTTGATGTC AACACCGGGG AGCTGCTGTG GGCTTTTGAT
CCGGGCGCGA AAGATCCGAA CGCAATCCCG TCTGACGAAC ACACCTTTAC CTTTAACTCA
CCAAACTCGT GGGCACCAGC GGCCTATGAC GCGAAGCTGG ATCTGGTGTA TCTGCCGATG
GGCGTGACCA CGCCGGATAT CTGGGGCGGT AACCGTACAC CGGAACAGGA ACGTTATGCC
AGCTCGATTC TGGCGCTGAA TGCCACTACC GGGAAACTGG CGTGGAGCTA CCAGACCGTT
CACCACGACC TGTGGGATAT GGATCTTCCG GCACAGCCGA CGCTGGCGGA CATCACCGTT
AATGGTCAGA AAGTGCCAGT CATTTATGCT CCGGCGAAAA CCGGCAACAT TTTTGTGCTC
GATCGTCGTA ATGGCGAACT GGTGGTTCCG GCACCGGAAA AACCGGTTCC CCAAGGTGCT
GCCAAAGGCG ATTACGTAAC CCCAACTCAA CCGTTTTCTG AACTGAGCTT CCGTCCGACG
AAAGATTTGA GCGGTGCGGA TATGTGGGGA GCCACCATGT TTGACCAGCT GGTGTGCCGC
GTGATGTTCC ACCAGATGCG CTATGAAGGC ATTTTCACCC CGCCATCTGA ACAGGGTACG
CTGGTCTTCC CGGGTAACCT GGGGATGTTC GAATGGGGCG GGATTTCCGT TGATCCGAAT
CGTGAAGTGG CGATTGCCAA CCCAATGGCA CTGCCGTTTG TTTCGAAACT GATCCCACGC
GGTCCTGGTA ACCCGATGGA ACAGCCGAAA GATGCCAAAG GCACGGGTAC GGAATCCGGC
ATTCAGCCAC AGTACGGTGT ACCGTATGGT GTCACGCTCA ACCCGTTCCT CTCACCGTTT
GGTCTGCCAT GTAAACAGCC AGCATGGGGC TATATCTCGG CATTGGATCT GAAAACCAAT
GAAGTGGTGT GGAAGAAACG TATTGGTACG CCGCAGGACA GTATGCCATT CCCGATGCCT
GTACCGGTGC CGTTCAATAT GGGTATGCCG ATGCTGGGCG GGCCAATCTC CACAGCGGGT
AACGTACTGT TTATCGCCGC TACGGCAGAT AACTACCTGC GCGCTTACAA CATGAGCAAC
GGTGAAAAAC TGTGGCAGGG CCGTTTACCA GCGGGTGGTC AGGCTACGCC AATGACCTAT
GAAGTGAATG GCAAGCAGTA TGTGGTGATC TCCGCAGGCG GTCACGGTTC ATTTGGTACG
AAGATGGGCG ACTATATTGT GGCTTATGCG CTGCCGGATG ATGTGAAGTA A
 
Protein sequence
MAINNTGSRR LLVTLTALFA ALCGLYLLIG GGWLVAIGGS WYYPIAGLVM LGVAWMLWRS 
KRAALWLYAA LLLGTMIWGV WEVGFDFWAL TPRSDILVFF GIWLILPFVW RRLVIPASGA
VAALVVALLI SGGILTWAGF NDPQEINGTL SADATPAEAI SPVADQDWPA YGRNQEGQRF
SPLKQINADN VHNLKEAWVF RTGDVKQPND PGEITNEVTP IKVGDTLYLC TAHQRLFALD
AASGKEKWHY DPELKTNESF QHVTCRGVSY HEAKAETASP EVMADCPRRI ILPVNDGRLI
AINAENGKLC ETFANKGVLN LQSNMPDTKP GLYEPTSPPI ITDKTIVMAG SVTDNFSTRE
TSGVIRGFDV NTGELLWAFD PGAKDPNAIP SDEHTFTFNS PNSWAPAAYD AKLDLVYLPM
GVTTPDIWGG NRTPEQERYA SSILALNATT GKLAWSYQTV HHDLWDMDLP AQPTLADITV
NGQKVPVIYA PAKTGNIFVL DRRNGELVVP APEKPVPQGA AKGDYVTPTQ PFSELSFRPT
KDLSGADMWG ATMFDQLVCR VMFHQMRYEG IFTPPSEQGT LVFPGNLGMF EWGGISVDPN
REVAIANPMA LPFVSKLIPR GPGNPMEQPK DAKGTGTESG IQPQYGVPYG VTLNPFLSPF
GLPCKQPAWG YISALDLKTN EVVWKKRIGT PQDSMPFPMP VPVPFNMGMP MLGGPISTAG
NVLFIAATAD NYLRAYNMSN GEKLWQGRLP AGGQATPMTY EVNGKQYVVI SAGGHGSFGT
KMGDYIVAYA LPDDVK