Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0343 |
Symbol | betB |
ID | 6145411 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 352507 |
End bp | 353979 |
Gene Length | 1473 bp |
Protein Length | 490 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641615239 |
Product | betaine aldehyde dehydrogenase |
Protein accession | YP_001742447 |
Protein GI | 170681572 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01804] glycine betaine aldehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCGAA TGGCAGAACA GCAGCTTTAT ATACATGGTG GTTATACATC CGCCACCAGC GGTCGCACCT TCGAGACCAT TAACCCGGCC AACGGTAACG TGCTGGCGAC CGTGCAGGCC GCCGGGCGCG AGGATGTCGA TCGCGCCGTG AAGAGTGCCC AACAGGGGCA AAAAATCTGG GCGGCGATGA CCGCTATGGA ACGCTCGCGT ATTCTGCGTC GGGCCGTTGA TATTCTACGT GAACGCAATG ACGAACTCGC AAAACTGGAA ACCCTCGACA CCGGAAAAGC ATATTCGGAA ACCTCAACCG TCGATATCGT TACCGGTGCG GACGTGCTGG AGTACTACGC CGGGCTGATC CCGGCGCTGG AAGGCAGCCA GATCCCGTTG CGTGAGACGT CATTTGTTTA TACCCGCCGC GAACCGCTGG GCGTAGTGGC AGGGATTGGC GCATGGAACT ACCCGATTCA GATTGCCCTG TGGAAATCCG CCCCAGCGCT GGCGGCAGGC AATGCAATGA TTTTCAAACC GAGCGAAGTC ACCCCGCTTA CCGCGTTAAA GCTGGCCGAA ATTTACAGCG AAGCGGGCCT GCCGGACGGC GTATTTAACG TGTTGCCGGG CGTGGGCGCG GAGACCGGGC AGTATCTGAC CGAGCATCCG GGCATTGCCA AAGTGTCATT TACCGGCGGT GTCGCCAGCG GCAAAAAAGT GATGGCTAAC TCGGCGGCTT CCTCCCTGAA AGAAGTGACC ATGGAGCTGG GCGGTAAATC ACCGCTGATC GTTTTTGACG ATGCGGATCT CGATCTCGCC GCCGATATCG CCATGATGGC GAATTTCTTC AGCTCCGGTC AGGTGTGTAC CAACGGCACC CGCGTCTTCG TTCCGGCGAA ATGCAAAGCC GCATTTGAAC AAAAGATTCT GGCGCGCGTT GAGCGCATTC GCGCGGGCGA CGTTTTCGAT CCGCAAACTA ACTTTGGCCC GCTGGTCAGC TTCCCGCATC GCGATAACGT GCTGCGTTAC ATCGCCAAAG GCAAAGAGGA AGGCGCGCGC GTACTGTGCG GCGGCGATGT ACTGAAAGGC GATGGCTTCG ATAACGGCGC ATGGGTTGCA CCGACCGTGT TCACCGATTG CCGCGACGAA ATGACCATCG TGCGCGAAGA GATCTTCGGT CCGGTGATGT CGCTTCTCAC TTATGAATCT GAGGACGAAG TGATTCGCCG CGCTAACGAT ACCGACTACG GTCTGGCGGC GGGTATCGTG ACGGCGGACC TGAACCTCGC GCATCGCGTC ATTCATCAGC TGGAAGCGGG TATTTGCTGG ATCAACACCT GGGGCGAATC CCCGGCAGAG ATGCCCGTTG GCGGCTACAA ACACTCCGGC ATTGGTCGCG AGAACGGCGT GATGACGCTC CAGAGTTACA CCCAGGTGAA GTCCATCCAG GTTGAGATGG CTAAATTCCA GTCCATATTC TAA
|
Protein sequence | MSRMAEQQLY IHGGYTSATS GRTFETINPA NGNVLATVQA AGREDVDRAV KSAQQGQKIW AAMTAMERSR ILRRAVDILR ERNDELAKLE TLDTGKAYSE TSTVDIVTGA DVLEYYAGLI PALEGSQIPL RETSFVYTRR EPLGVVAGIG AWNYPIQIAL WKSAPALAAG NAMIFKPSEV TPLTALKLAE IYSEAGLPDG VFNVLPGVGA ETGQYLTEHP GIAKVSFTGG VASGKKVMAN SAASSLKEVT MELGGKSPLI VFDDADLDLA ADIAMMANFF SSGQVCTNGT RVFVPAKCKA AFEQKILARV ERIRAGDVFD PQTNFGPLVS FPHRDNVLRY IAKGKEEGAR VLCGGDVLKG DGFDNGAWVA PTVFTDCRDE MTIVREEIFG PVMSLLTYES EDEVIRRAND TDYGLAAGIV TADLNLAHRV IHQLEAGICW INTWGESPAE MPVGGYKHSG IGRENGVMTL QSYTQVKSIQ VEMAKFQSIF
|
| |