Gene B21_00122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00122 
Symbolgcd 
ID8113360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp141640 
End bp144030 
Gene Length2391 bp 
Protein Length796 aa 
Translation table11 
GC content55% 
IMG OID644846415 
Producthypothetical protein 
Protein accessionYP_002997988 
Protein GI251783684 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4993] Glucose dehydrogenase 
TIGRFAM ID[TIGR03074] membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.793225 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATTA ACAATACAGG CTCGCGACGA TTACTCGTCA CGCTAACAGC CCTTTTTGCA 
GCGCTTTGCG GGCTGTATCT ACTCATTGGC GGAGGCTGGC TGGTCGCGAT TGGCGGCTCC
TGGTACTACC CTATCGCTGG CCTTGTGATG CTCGGCGTCG CCTGGATGCT GTGGCGCAGT
AAACGTGCCG CGCTTTGGCT ATACGCCGCC CTGCTGCTCG GCACCATGAT TTGGGGCGTC
TGGGAAGTTG GTTTCGACTT CTGGGCGCTG ACTCCGCGCA GCGATATTCT GGTCTTCTTC
GGCATCTGGC TGATCCTGCC GTTTGTCTGG CGTCGCCTGG TCATTCCTGC CAGCGGCGCA
GTTGCCGCAC TGGTGGTGGC ACTGCTGATT AGCGGTAGTA TCCTGACCTG GGCCGGATTT
AACGATCCGC AGGAAATCAA CGGCACCTTA AGCGCCGATG CCACACCTGC GGAAGCTATC
TCACCCGTTG CCGATCAGGA CTGGCCTGCC TATGGTCGCA ATCAGGAAGG TCAACGCTTT
TCACCTCTGA AACAAATTAA CGCCGATAAC GTCCATAATC TGAAAGAAGC CTGGGTGTTC
CGTACTGGCG ATGTGAAGCA GCCGAACGAT CCGGGCGAAA TCACCAATGA AGTGACGCCG
ATTAAAGTGG GCGACACCCT TTACCTGTGT ACCGCTCACC AGCGCCTGTT TGCGCTCGAT
GCCGCCAGCG GCAAAGAGAA ATGGCATTAC GATCCTGAGC TGAAAACTAA CGAGTCTTTC
CAGCACGTAA CCTGCCGTGG CGTCTCTTAT CATGAAGCCA AAGCAGAAAC GGCTTCGCCG
GAAGTGATGG CGGATTGCCC GCGTCGTATC ATTCTTCCGG TCAATGATGG TCGACTGATT
GCGATTAACG CTGAAAACGG CAAACTGTGC GAAACCTTCG CCAATAAAGG CGTGCTCAAT
CTGCAAAGCA ATATGCCAGA CACCAAACCG GGTCTGTATG AACCGACTTC GCCACCGATT
ATCACCGATA AAACCATCGT GATGGCAGGT TCAGTTACCG ATAACTTCTC AACCCGCGAA
ACGTCTGGCG TGATCCGTGG TTTTGATGTC AACACCGGGG AGCTGCTGTG GGCTTTTGAT
CCCGGCGCGA AAGATCCGAA CGCAATCCCG TCTGACGAAC ACACCTTTAC CTTTAACTCG
CCAAACTCGT GGGCACCAGC GGCCTATGAC GCGAAGCTGG ATCTGGTGTA TCTGCCGATG
GGCGTGACCA CGCCGGATAT CTGGGGCGGT AACCGCACAC CGGAACAGGA ACGTTATGCC
AGCTCGATTC TGGCGCTGAA TGCCACTACC GGGAAACTGG CGTGGAGCTA CCAGACCGTT
CACCACGACC TGTGGGATAT GGATCTTCCG GCACAGCCGA CGCTGGCGGA CATCACCGTT
AATGGTCAGA AAGTGCCAGT TATTTACGCT CCGGCGAAAA CCGGCAACAT TTTTGTGCTC
GATCGTCGTA ATGGCGAACT GGTGGTTCCG GCACCGGAAA AACCGGTTCC CCAAGGTGCT
GCCAAAGGCG ATTACGTTAC CCCTACTCAA CCATTCTCGG AACTGAGCTT CCGTCCGACG
AAAGATTTGA GCGGTGCGGA TATGTGGGGA GCCACCATGT TTGACCAACT GGTGTGCCGC
GTGATGTTCC ACCAGATGCG CTATGAAGGC ATTTTCACCC CGCCATCTGA ACAGGGTACG
CTGGTCTTCC CGGGTAACCT GGGGATGTTC GAATGGGGCG GGATTTCCGT TGATCCAAAT
CGTGAAGTGG CGATTGCCAA CCCAATGGCA CTGCCGTTTG TTTCGAAACT GATCCCGCGT
GGTCCTGGCA ACCCGATGGA GCAGCCGAAA GATGCCAAAG GCACGGGTAC GGAATCCGGC
ATTCAGCCAC AGTACGGTGT ACCGTATGGT GTCACGCTCA ACCCGTTCCT CTCACCATTT
GGTCTGCCAT GTAAACAGCC AGCATGGGGT TATATCTCGG CGCTGGATCT GAAAACTAAT
GAAGTGGTGT GGAAGAAACG TATTGGTACG CCGCAGGACA GTATGCCGTT CCCGATGCCG
GTTCCGGTGC CGTTCAATAT GGGTATGCCG ATGCTGGGCG GGCCAATCTC CACGGCGGGT
AACGTGCTGT TTATCGCCGC TACGGCAGAT AACTACCTGC GCGCTTACAA CATGAGCAAC
GGTGAAAAAC TGTGGCAGGG TCGTTTACCA GCGGGTGGTC AGGCTACGCC AATGACCTAT
GAAGTGAATG GTAAGCAGTA TGTGGTGATC TCCGCAGGCG GTCACGGTTC ATTTGGTACG
AAGATGGGCG ACTATATTGT GGCTTATGCG CTGCCGGATG ATGTGAAGTA A
 
Protein sequence
MAINNTGSRR LLVTLTALFA ALCGLYLLIG GGWLVAIGGS WYYPIAGLVM LGVAWMLWRS 
KRAALWLYAA LLLGTMIWGV WEVGFDFWAL TPRSDILVFF GIWLILPFVW RRLVIPASGA
VAALVVALLI SGSILTWAGF NDPQEINGTL SADATPAEAI SPVADQDWPA YGRNQEGQRF
SPLKQINADN VHNLKEAWVF RTGDVKQPND PGEITNEVTP IKVGDTLYLC TAHQRLFALD
AASGKEKWHY DPELKTNESF QHVTCRGVSY HEAKAETASP EVMADCPRRI ILPVNDGRLI
AINAENGKLC ETFANKGVLN LQSNMPDTKP GLYEPTSPPI ITDKTIVMAG SVTDNFSTRE
TSGVIRGFDV NTGELLWAFD PGAKDPNAIP SDEHTFTFNS PNSWAPAAYD AKLDLVYLPM
GVTTPDIWGG NRTPEQERYA SSILALNATT GKLAWSYQTV HHDLWDMDLP AQPTLADITV
NGQKVPVIYA PAKTGNIFVL DRRNGELVVP APEKPVPQGA AKGDYVTPTQ PFSELSFRPT
KDLSGADMWG ATMFDQLVCR VMFHQMRYEG IFTPPSEQGT LVFPGNLGMF EWGGISVDPN
REVAIANPMA LPFVSKLIPR GPGNPMEQPK DAKGTGTESG IQPQYGVPYG VTLNPFLSPF
GLPCKQPAWG YISALDLKTN EVVWKKRIGT PQDSMPFPMP VPVPFNMGMP MLGGPISTAG
NVLFIAATAD NYLRAYNMSN GEKLWQGRLP AGGQATPMTY EVNGKQYVVI SAGGHGSFGT
KMGDYIVAYA LPDDVK