Gene ECH74115_0373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0373 
SymbolbetA 
ID6967972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp377103 
End bp378791 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content57% 
IMG OID643384428 
Productcholine dehydrogenase 
Protein accessionYP_002268943 
Protein GI209399344 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01810] choline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.868243 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.338212 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCAATTTG ACTACATCAT TATTGGTGCC GGCTCAGCCG GCAACGTTCT CGCTACCCGT 
CTGACTGAAG ATCCGAATAC CTCCGTGCTG CTGCTTGAAG CGGGCGGCCC GGACTATCGC
TTTGACTTCC GCACCCAGAT GCCCGCTGCC CTGGCATTCC CGCTACAGGG TAAACGCTAC
AACTGGGCTT ACGAGACGGA ACCTGAACCG TTTATGAATA ACCGTCGCAT GGAGTGCGGA
CGCGGTAAAG GCCTGGGTGG ATCGTCGCTG ATCAACGGCA TGTGCTATAT CCGTGGCAAC
GCGATGGATC TCGACAACTG GGCAAAAGAA CCCGGTCTGG AGAACTGGAG TTATCTCGAT
TGCCTGCCCT ACTACCGCAA GGCCGAGACC CGCGACGTGG GCGAGAACGA CTACCACGGC
GGCGACGGCC CGGTGAGCGT CACCACCTCC AAACCCGGCG TCAATCCGCT GTTTGAAGCG
ATGATTGAAG CGGGCGTGCA GGCGGGCTAC CCGCGCACGG ACGATCTCAA CGGTTATCAG
CAGGAAGGTT TCGGCCCGAT GGATCGCACC GTCACGCCGC AGGGCCGCCG CGCCAGCACC
GCGCGCGGTT ATCTCGATCA GGCCAAATCG CGCCCAAACC TAACCATTCG TACTCACGCC
ATGACCGATC ACATTATTTT TGACTGTAAA CGCGCGGTGG GCGTCGAGTG GCTGGAAGGC
GACAGCACCA TTCCGACCCG CGCGACGGCG AACAAAGAAG TGCTGTTATG TGCAGGCGCG
ATTGCCTCAC CGCAGATCCT GCAACGCTCT GGCGTCGGCA ACGCTGAACT GTTGGCCGAG
TTTGATATTC CGCTGGTGCA TGATTTACCC GGCGTCGGTG AAAATCTTCA GGATCATCTG
GAGATGTATC TGCAATATGA GTGCAAAGAA CCGGTTTCCC TCTACCCTGC CCTGCAGTGG
TGGAATCAGC CGAAAATCGG TGCGGAGTGG CTGTTTGGCG GCACCGGCGT TGGTGCCAGC
AACCACTTTG AAGCAGGCGG ATTTATTCGC AGCCGAGAGG AATTTGCGTG GCCGAATATT
CAGTATCACT TCCTGCCGGT AGCGATTAAC TATAACGGCT CGAATGCAGT GAAAGAGCAC
GGCTTCCAGT GCCACGTCGG CTCGATGCGC TCGCCAAGCC GTGGGCATGT GCGGATTAAA
TCCCGCGACC CGCACCAGCA TCCGGCAATT CTGTTTAACT ACATGTCGCA CGAACAGGAC
TGGCAGGAAT TCCGCGACGC AATTCGCATC ACCCGCGAGA TCATGCATCA ACCCGCGCTG
GATCAGTATC GTGGCCGCGA AATCAGCCCC GGCACGGAAT GTCAGACGGA TGAGCAGCTC
GATGAGTTTG TGCGTAACCA CGCCGAAACC GCCTTCCATC CGTGCGGTAC CTGCAAAATG
GGCTACGACG AGATGTCCGT GGTTGACGGC GAAGGCCGCG TGCATGGGCT GGAAGGCCTG
CGCGTGGTGG ATGCGTCAAT TATGCCGCAG ATTATCACCG GGAATTTGAA CGCCACGACG
ATTATGATTG GCGAGAAAAT GGCGGATATG ATTCGCGGGA AGGAAGCGTT GCCGAGGAGC
ACGGCGGGAT ATTTTGTGGC AAATGGGATG CCAGTAAGAG CGAAAAAAAT GAGTCGTGAT
TTGAACTGA
 
Protein sequence
MQFDYIIIGA GSAGNVLATR LTEDPNTSVL LLEAGGPDYR FDFRTQMPAA LAFPLQGKRY 
NWAYETEPEP FMNNRRMECG RGKGLGGSSL INGMCYIRGN AMDLDNWAKE PGLENWSYLD
CLPYYRKAET RDVGENDYHG GDGPVSVTTS KPGVNPLFEA MIEAGVQAGY PRTDDLNGYQ
QEGFGPMDRT VTPQGRRAST ARGYLDQAKS RPNLTIRTHA MTDHIIFDCK RAVGVEWLEG
DSTIPTRATA NKEVLLCAGA IASPQILQRS GVGNAELLAE FDIPLVHDLP GVGENLQDHL
EMYLQYECKE PVSLYPALQW WNQPKIGAEW LFGGTGVGAS NHFEAGGFIR SREEFAWPNI
QYHFLPVAIN YNGSNAVKEH GFQCHVGSMR SPSRGHVRIK SRDPHQHPAI LFNYMSHEQD
WQEFRDAIRI TREIMHQPAL DQYRGREISP GTECQTDEQL DEFVRNHAET AFHPCGTCKM
GYDEMSVVDG EGRVHGLEGL RVVDASIMPQ IITGNLNATT IMIGEKMADM IRGKEALPRS
TAGYFVANGM PVRAKKMSRD LN