Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0373 |
Symbol | betA |
ID | 6967972 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 377103 |
End bp | 378791 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643384428 |
Product | choline dehydrogenase |
Protein accession | YP_002268943 |
Protein GI | 209399344 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | [TIGR01810] choline dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.868243 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.338212 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCAATTTG ACTACATCAT TATTGGTGCC GGCTCAGCCG GCAACGTTCT CGCTACCCGT CTGACTGAAG ATCCGAATAC CTCCGTGCTG CTGCTTGAAG CGGGCGGCCC GGACTATCGC TTTGACTTCC GCACCCAGAT GCCCGCTGCC CTGGCATTCC CGCTACAGGG TAAACGCTAC AACTGGGCTT ACGAGACGGA ACCTGAACCG TTTATGAATA ACCGTCGCAT GGAGTGCGGA CGCGGTAAAG GCCTGGGTGG ATCGTCGCTG ATCAACGGCA TGTGCTATAT CCGTGGCAAC GCGATGGATC TCGACAACTG GGCAAAAGAA CCCGGTCTGG AGAACTGGAG TTATCTCGAT TGCCTGCCCT ACTACCGCAA GGCCGAGACC CGCGACGTGG GCGAGAACGA CTACCACGGC GGCGACGGCC CGGTGAGCGT CACCACCTCC AAACCCGGCG TCAATCCGCT GTTTGAAGCG ATGATTGAAG CGGGCGTGCA GGCGGGCTAC CCGCGCACGG ACGATCTCAA CGGTTATCAG CAGGAAGGTT TCGGCCCGAT GGATCGCACC GTCACGCCGC AGGGCCGCCG CGCCAGCACC GCGCGCGGTT ATCTCGATCA GGCCAAATCG CGCCCAAACC TAACCATTCG TACTCACGCC ATGACCGATC ACATTATTTT TGACTGTAAA CGCGCGGTGG GCGTCGAGTG GCTGGAAGGC GACAGCACCA TTCCGACCCG CGCGACGGCG AACAAAGAAG TGCTGTTATG TGCAGGCGCG ATTGCCTCAC CGCAGATCCT GCAACGCTCT GGCGTCGGCA ACGCTGAACT GTTGGCCGAG TTTGATATTC CGCTGGTGCA TGATTTACCC GGCGTCGGTG AAAATCTTCA GGATCATCTG GAGATGTATC TGCAATATGA GTGCAAAGAA CCGGTTTCCC TCTACCCTGC CCTGCAGTGG TGGAATCAGC CGAAAATCGG TGCGGAGTGG CTGTTTGGCG GCACCGGCGT TGGTGCCAGC AACCACTTTG AAGCAGGCGG ATTTATTCGC AGCCGAGAGG AATTTGCGTG GCCGAATATT CAGTATCACT TCCTGCCGGT AGCGATTAAC TATAACGGCT CGAATGCAGT GAAAGAGCAC GGCTTCCAGT GCCACGTCGG CTCGATGCGC TCGCCAAGCC GTGGGCATGT GCGGATTAAA TCCCGCGACC CGCACCAGCA TCCGGCAATT CTGTTTAACT ACATGTCGCA CGAACAGGAC TGGCAGGAAT TCCGCGACGC AATTCGCATC ACCCGCGAGA TCATGCATCA ACCCGCGCTG GATCAGTATC GTGGCCGCGA AATCAGCCCC GGCACGGAAT GTCAGACGGA TGAGCAGCTC GATGAGTTTG TGCGTAACCA CGCCGAAACC GCCTTCCATC CGTGCGGTAC CTGCAAAATG GGCTACGACG AGATGTCCGT GGTTGACGGC GAAGGCCGCG TGCATGGGCT GGAAGGCCTG CGCGTGGTGG ATGCGTCAAT TATGCCGCAG ATTATCACCG GGAATTTGAA CGCCACGACG ATTATGATTG GCGAGAAAAT GGCGGATATG ATTCGCGGGA AGGAAGCGTT GCCGAGGAGC ACGGCGGGAT ATTTTGTGGC AAATGGGATG CCAGTAAGAG CGAAAAAAAT GAGTCGTGAT TTGAACTGA
|
Protein sequence | MQFDYIIIGA GSAGNVLATR LTEDPNTSVL LLEAGGPDYR FDFRTQMPAA LAFPLQGKRY NWAYETEPEP FMNNRRMECG RGKGLGGSSL INGMCYIRGN AMDLDNWAKE PGLENWSYLD CLPYYRKAET RDVGENDYHG GDGPVSVTTS KPGVNPLFEA MIEAGVQAGY PRTDDLNGYQ QEGFGPMDRT VTPQGRRAST ARGYLDQAKS RPNLTIRTHA MTDHIIFDCK RAVGVEWLEG DSTIPTRATA NKEVLLCAGA IASPQILQRS GVGNAELLAE FDIPLVHDLP GVGENLQDHL EMYLQYECKE PVSLYPALQW WNQPKIGAEW LFGGTGVGAS NHFEAGGFIR SREEFAWPNI QYHFLPVAIN YNGSNAVKEH GFQCHVGSMR SPSRGHVRIK SRDPHQHPAI LFNYMSHEQD WQEFRDAIRI TREIMHQPAL DQYRGREISP GTECQTDEQL DEFVRNHAET AFHPCGTCKM GYDEMSVVDG EGRVHGLEGL RVVDASIMPQ IITGNLNATT IMIGEKMADM IRGKEALPRS TAGYFVANGM PVRAKKMSRD LN
|
| |