Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0134 |
Symbol | gcd |
ID | 6146718 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 148888 |
End bp | 151278 |
Gene Length | 2391 bp |
Protein Length | 796 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641615035 |
Product | quinoprotein glucose dehydrogenase |
Protein accession | YP_001742251 |
Protein GI | 170683523 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4993] Glucose dehydrogenase |
TIGRFAM ID | [TIGR03074] membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAATTA ACAATACAGG CTCGCGACGA TTACTCGTCA CGCTAACAGC CCTTTTTGCA GCGCTTTGCG GGCTGTATCT ACTCATTGGC GGAGGCTGGC TGGTCGCGAT TGGCGGCTCC TGGTACTACC CTATCGCTGG CCTTGTGATG CTCGGCGTCG CCTGGATGCT GTGGCGCAGT AAACGCGCCG CGCTTTGGCT GTACGCCGCC CTGCTGCTCG GCACCATGAT TTGGGGCGTC TGGGAAGTTG GTTTCGACTT CTGGGCGCTG ACTCCGCGCA GCGACATTCT GGTCTTCTTC GGCATCTGGC TGATCCTGCC GTTTGTCTGG CGTCGCCTGG TCATTCCTGC CAGCGGCGCA GTTGCCGCAC TGGTGGTGGC ATTGCTGATT AGTGGCGGAA TCCTCACCTG GGCGGGATTT AACGATCCGC AGGAAATCAA CGGCACCTTA AGCGCCGATG CCACACCTGC CGAAGCTATC TCACCGGTTG CCGATCAGGA CTGGCCTGCG TATGGTCGCA ATCAGGAAGG TCAACGCTTT TCACCACTGA AGCAAATTAA CGCCGATAAC GTTCACAACC TGAAAGAAGC CTGGGTGTTC CGTACAGGCG ATGTGAAACA GCCGAACGAT CCGGGCGAAA TCACCAATGA AGTGACGCCG ATTAAAGTGG GCGATACGCT TTACTTATGT ACCGCTCACC AGCGCCTGTT TGCGCTCGAT GCCGCCAGCG GCAAAGAGAA ATGGCATTAC GATCCTGAAC TGAAAACCAA CGAGTCCTTC CAGCACGTAA CCTGTCGTGG CGTCTCTTAT CATGAAGCCA AAGCAGAAAC GGCCTCGCCA GAAGTGATGG CGGATTGCCC GCGTCGTATC ATTCTTCCGG TCAATGATGG TCGCCTGATT GCGATTAACG CTGAAAACGG CAAGCTGTGC GAAACCTTCG CTAATAAAGG CGTGCTCAAT CTGCAAAGCA ATATGCCGGA CACCAAACCG GGTCTGTATG AGCCGACTTC GCCGCCAATT ATCACCGATA AAACCATCGT GATGGCCGGT TCAGTCACCG ATAACTTCTC AACCCGCGAA ACGTCTGGCG TGATCCGTGG TTTTGATGTC AACACCGGGG AGCTGCTGTG GGCTTTTGAT CCGGGCGCGA AAGATCCGAA CGCAATCCCG TCTGACGAAC ACACCTTTAC CTTTAACTCA CCAAACTCGT GGGCACCAGC GGCCTATGAC GCGAAGCTGG ATCTGGTGTA TCTGCCGATG GGCGTGACCA CGCCGGATAT CTGGGGCGGT AACCGTACAC CGGAACAGGA ACGTTATGCC AGCTCGATTC TGGCGCTGAA TGCCACTACC GGGAAACTGG CGTGGAGCTA CCAGACCGTT CACCACGACC TGTGGGATAT GGATCTTCCG GCACAGCCGA CGCTGGCGGA CATCACCGTT AATGGTCAGA AAGTGCCAGT CATTTATGCT CCGGCGAAAA CCGGCAACAT TTTTGTGCTC GATCGTCGTA ATGGCGAACT GGTGGTTCCG GCACCGGAAA AACCGGTTCC CCAAGGTGCT GCCAAAGGCG ATTACGTAAC CCCAACTCAA CCGTTTTCTG AACTGAGCTT CCGTCCGACG AAAGATTTGA GCGGTGCGGA TATGTGGGGA GCCACCATGT TTGACCAGCT GGTGTGCCGC GTGATGTTCC ACCAGATGCG CTATGAAGGC ATTTTCACCC CGCCATCTGA ACAGGGTACG CTGGTCTTCC CGGGTAACCT GGGGATGTTC GAATGGGGCG GGATTTCCGT TGATCCGAAT CGTGAAGTGG CGATTGCCAA CCCAATGGCA CTGCCGTTTG TTTCGAAACT GATCCCACGC GGTCCTGGTA ACCCGATGGA ACAGCCGAAA GATGCCAAAG GCACGGGTAC GGAATCCGGC ATTCAGCCAC AGTACGGTGT ACCGTATGGT GTCACGCTCA ACCCGTTCCT CTCACCGTTT GGTCTGCCAT GTAAACAGCC AGCATGGGGC TATATCTCGG CATTGGATCT GAAAACCAAT GAAGTGGTGT GGAAGAAACG TATTGGTACG CCGCAGGACA GTATGCCATT CCCGATGCCT GTACCGGTGC CGTTCAATAT GGGTATGCCG ATGCTGGGCG GGCCAATCTC CACAGCGGGT AACGTACTGT TTATCGCCGC TACGGCAGAT AACTACCTGC GCGCTTACAA CATGAGCAAC GGTGAAAAAC TGTGGCAGGG CCGTTTACCA GCGGGTGGTC AGGCTACGCC AATGACCTAT GAAGTGAATG GCAAGCAGTA TGTGGTGATC TCCGCAGGCG GTCACGGTTC ATTTGGTACG AAGATGGGCG ACTATATTGT GGCTTATGCG CTGCCGGATG ATGTGAAGTA A
|
Protein sequence | MAINNTGSRR LLVTLTALFA ALCGLYLLIG GGWLVAIGGS WYYPIAGLVM LGVAWMLWRS KRAALWLYAA LLLGTMIWGV WEVGFDFWAL TPRSDILVFF GIWLILPFVW RRLVIPASGA VAALVVALLI SGGILTWAGF NDPQEINGTL SADATPAEAI SPVADQDWPA YGRNQEGQRF SPLKQINADN VHNLKEAWVF RTGDVKQPND PGEITNEVTP IKVGDTLYLC TAHQRLFALD AASGKEKWHY DPELKTNESF QHVTCRGVSY HEAKAETASP EVMADCPRRI ILPVNDGRLI AINAENGKLC ETFANKGVLN LQSNMPDTKP GLYEPTSPPI ITDKTIVMAG SVTDNFSTRE TSGVIRGFDV NTGELLWAFD PGAKDPNAIP SDEHTFTFNS PNSWAPAAYD AKLDLVYLPM GVTTPDIWGG NRTPEQERYA SSILALNATT GKLAWSYQTV HHDLWDMDLP AQPTLADITV NGQKVPVIYA PAKTGNIFVL DRRNGELVVP APEKPVPQGA AKGDYVTPTQ PFSELSFRPT KDLSGADMWG ATMFDQLVCR VMFHQMRYEG IFTPPSEQGT LVFPGNLGMF EWGGISVDPN REVAIANPMA LPFVSKLIPR GPGNPMEQPK DAKGTGTESG IQPQYGVPYG VTLNPFLSPF GLPCKQPAWG YISALDLKTN EVVWKKRIGT PQDSMPFPMP VPVPFNMGMP MLGGPISTAG NVLFIAATAD NYLRAYNMSN GEKLWQGRLP AGGQATPMTY EVNGKQYVVI SAGGHGSFGT KMGDYIVAYA LPDDVK
|
| |