Gene EcHS_A0370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0370 
SymbolbetA 
ID5592815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp382917 
End bp384587 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content57% 
IMG OID640919555 
Productcholine dehydrogenase 
Protein accessionYP_001457141 
Protein GI157159823 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01810] choline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCAATTTG ACTACATCAT TATTGGTGCC GGCTCAGCCG GCAACGTTCT CGCTACCCGT 
CTGACTGAAG ATCCGAATAC CTCCGTGCTG CTGCTTGAAG CGGGCGGCCC GGACTATCGC
TTTGACTTCC GCACCCAGAT GCCCGCAGCG CTGGCGTTCC CACTACAGGG CAAACGCTAC
AACTGGGCCT ATGAAACAGA GCCTGAACCG TTTATGAATA ACCGCCGCAT GGAGTGCGGA
CGCGGTAAAG GCCTGGGTGG ATCGTCGCTG ATCAACGGCA TGTGCTACAT CCGTGGCAAT
GCGCTGGATC TCGATAACTG GGCGCAAGAA CCCGGTCTGG AGAACTGGAG TTATCTCGAC
TGCCTGCCCT ACTACCGCAA GGCCGAGACT CGCGATGTGG GCGAGAACGA CTACCACGGT
GGCGATGGCC CGGTGAGCGT CACCACCTCC AAACCCGGCG TCAATCCGCT GTTTGAAGCG
ATGATTGAAG CGGGCGTGCA GGCAGGCTAC CCGCGCACGG ACGATCTCAA CGGCTATCAG
CAGGAAGGTT TTGGCCCGAT GGATCGCACC GTCACGCCGC AGGGCCGTCG CGCCAGTACC
GCGCGCGGTT ATCTCGATCA GGCCAAATCG CGCCCTAACC TGACCATTCG TACTCACGCT
ATGACCGATC ACATCATTTT TGACGGCAAA CGCGCGGTGG GCGTCGAATG GCTGGAAGGC
GACAGCACCA TCCCAACCCG CGCAACGGCC AACAAAGAAG TGCTGTTATG TGCAGGCGCG
ATTGCCTCAC CGCAGATCCT GCAACGCTCC GGCGTCGGCA ACGCTGAACT GCTGGCGGAG
TTTGATATTC CGCTGGTGCA TGAATTACCC GGCGTCGGCG AAAATCTTCA GGATCATCTG
GAGATGTATC TGCAATATGA GTGCAAAGAA CCGGTTTCCC TCTACCCTGC CCTGCAGTGG
TGGAACCAAC CGAAAATCGG TGCGGAGTGG CTGTTTGGCG GCACTGGCGT TGGTGCCAGC
AACCACTTTG AAGCAGGTGG ATTTATTCGC AGCCGTGAGG AATTTGCGTG GCCGAATATT
CAGTACCATT TCCTGCCAGT AGCGATTAAC TATAACGGCT CGAATGCAGT GAAAGAGCAC
GGTTTCCAGT GCCACGTCGG CTCAATGCGC TCGCCAAGCC GTGGGCATGT GCGGATTAAA
TCCCGCGACC CGCACCAGCA TCCGGCGATT CTGTTTAACT ACATGTCGCA CGAGCAGGAC
TGGCAGGAGT TCCGCGACGC AATTCGCATC ACCCGGGAGA TCATGCATCA ACCGGCGCTG
GATCAGTATC GTGGCCGCGA AATCAGCCCC GGCACGGAAT GTCAGACGGA TGAACAGCTC
GATGAGTTCG TGCGTAATCA CGCCGAAACC GCCTTCCATC CGTGCGGTAC CTGCAAAATG
GGCTACGACG AGATGTCCGT GGTTGACGGC GAAGGCCGCG TACACGGGTT AGAAGGCCTG
CGTGTGGTGG ATGCGTCGAT TATGCCGCAG ATTATCACCG GGAATTTGAA CGCCACGACA
ATTATGATTG GCGAGAAAAT AGCGGATATG ATTCGTGGAC AGGAAGCGCT GCCGAGGAGC
ACGGCGGGAT ATTTTGTGGC AAATGGGATG CCGGTGAGAG CGAAAAAATG A
 
Protein sequence
MQFDYIIIGA GSAGNVLATR LTEDPNTSVL LLEAGGPDYR FDFRTQMPAA LAFPLQGKRY 
NWAYETEPEP FMNNRRMECG RGKGLGGSSL INGMCYIRGN ALDLDNWAQE PGLENWSYLD
CLPYYRKAET RDVGENDYHG GDGPVSVTTS KPGVNPLFEA MIEAGVQAGY PRTDDLNGYQ
QEGFGPMDRT VTPQGRRAST ARGYLDQAKS RPNLTIRTHA MTDHIIFDGK RAVGVEWLEG
DSTIPTRATA NKEVLLCAGA IASPQILQRS GVGNAELLAE FDIPLVHELP GVGENLQDHL
EMYLQYECKE PVSLYPALQW WNQPKIGAEW LFGGTGVGAS NHFEAGGFIR SREEFAWPNI
QYHFLPVAIN YNGSNAVKEH GFQCHVGSMR SPSRGHVRIK SRDPHQHPAI LFNYMSHEQD
WQEFRDAIRI TREIMHQPAL DQYRGREISP GTECQTDEQL DEFVRNHAET AFHPCGTCKM
GYDEMSVVDG EGRVHGLEGL RVVDASIMPQ IITGNLNATT IMIGEKIADM IRGQEALPRS
TAGYFVANGM PVRAKK