Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0374 |
Symbol | betB |
ID | 6970465 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 378805 |
End bp | 380277 |
Gene Length | 1473 bp |
Protein Length | 490 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643384429 |
Product | betaine aldehyde dehydrogenase |
Protein accession | YP_002268944 |
Protein GI | 209401007 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01804] glycine betaine aldehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.216169 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCGAA TGGCAGAACA GCAGCTTTAT ATACATGGTG GTTATACCTC CGCCACCAGC GGTCGCACCT TCGAGACCAT TAACCCGGCC AACGGTAACG TGCTGGCGAC CGTGCAGGCC GCCGGGCGCG AGGATGTCGA TCGCGCCGTG AAGAGTGCCC AACAGGGGCA AAAAATCTGG GCGGCGATGA CCGCTATGGA ACGCTCGCGT ATTCTGCGTC GGGCCGTCGA TATTCTGCGC GAGCGCAATG ATGAACTCGC CAAACTGGAA ACCCTCGACA CCGGAAAAGC GTATTCGGAA ACCTCAACCG TCGATATCGT TACCGGTGCG GACGTGCTGG AGTACTACGC CGGGCTGATC CCGTCGCTGG AAGGCAGCCA GATCCCGTTG CGTGAGACAT CATTTGTCTA CACTCGCCGC GAACCGCTGG GCGTGGTGGC AGGGATTGGC GCATGGAACT ACCCGATCCA GATTGCCCTG TGGAAATCCG CCCCGGCGCT GGCGGCGGGT AATGCGATGA TTTTCAAACC GAGCGAAGTC ACTCCGCTTA CCGCGTTAAA GCTGGCTGAA ATTTACAGCG AAGCGGGCCT GCCGGACGGT GTATTTAACG TGTTGCCGGG CGTGGGCGCG GAGACCGGGC AATATCTGAC CGAGCATCCG GGCATTGCCA AAGTGTCGTT TACCGGCGGC GTCGCCAGCG GCAAAAAAGT GATGGCTAAC TCGGCAGCCT CTTCCCTGAA AGAAGTGACC ATGGAACTGG GCGGTAAATC ACCGCTGATC GTTTTCGACG ATGCGGATCT CGATCTCGCC GCCGATATCG CCATGATGGC AAACTTCTTC AGCTCCGGTC AGGTGTGTAC CAATGGCACC CGCGTCTTCG TTCCGGCGAA ATGCAACGCC GCATTTGAGC AGAAAATTCT GGCGCGCGTT GAGCGCATTC GCGCGGGCGA CGTTTTCGAT CCGCAAACTA ACTTCGGCCC GCTGGTCAGC TTCCCGCATC GCGATAACGT GCTGCGCTAT ATCGCCAAAG GCAAAGAGGA AGGCGCGCGC GTACTGTGCG GCGGCGATGT ACTGAAAGGC GATGGCCTCG ATAACGGCGC ATGGGTTGCA CCGACCGTGT TCACCGATTG CAGCGACGAG ATGACCATCG TGCGTGAAGA GATCTTCGGG CCGGTGATGT CCATTCTGAC CTACGAGTCG GAAGACGAAG TCATTCGCCG CGCCAATGAT ACCGACTACG GCCTGGCGGC GGGTATCGTG ACGGCGGACC TGAACCGCGC GCATCGCGTC ATTCATCAGC TGGAAGCGGG TATTTGCTGG ATCAACACCT GGGGCGAATC CCCGGCAGAG ATGCCCGTTG GCGGCTACAA ACACTCCGGT ATTGGCCGCG AGAACGGCGT GATGACGCTC CAGAGTTACA CCCAGGTGAA GTCCATCCAG GTTGAGATGG CTAAATTCCA GTCCATATTC TAA
|
Protein sequence | MSRMAEQQLY IHGGYTSATS GRTFETINPA NGNVLATVQA AGREDVDRAV KSAQQGQKIW AAMTAMERSR ILRRAVDILR ERNDELAKLE TLDTGKAYSE TSTVDIVTGA DVLEYYAGLI PSLEGSQIPL RETSFVYTRR EPLGVVAGIG AWNYPIQIAL WKSAPALAAG NAMIFKPSEV TPLTALKLAE IYSEAGLPDG VFNVLPGVGA ETGQYLTEHP GIAKVSFTGG VASGKKVMAN SAASSLKEVT MELGGKSPLI VFDDADLDLA ADIAMMANFF SSGQVCTNGT RVFVPAKCNA AFEQKILARV ERIRAGDVFD PQTNFGPLVS FPHRDNVLRY IAKGKEEGAR VLCGGDVLKG DGLDNGAWVA PTVFTDCSDE MTIVREEIFG PVMSILTYES EDEVIRRAND TDYGLAAGIV TADLNRAHRV IHQLEAGICW INTWGESPAE MPVGGYKHSG IGRENGVMTL QSYTQVKSIQ VEMAKFQSIF
|
| |