Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0370 |
Symbol | betA |
ID | 5592815 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 382917 |
End bp | 384587 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640919555 |
Product | choline dehydrogenase |
Protein accession | YP_001457141 |
Protein GI | 157159823 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | [TIGR01810] choline dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 47 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCAATTTG ACTACATCAT TATTGGTGCC GGCTCAGCCG GCAACGTTCT CGCTACCCGT CTGACTGAAG ATCCGAATAC CTCCGTGCTG CTGCTTGAAG CGGGCGGCCC GGACTATCGC TTTGACTTCC GCACCCAGAT GCCCGCAGCG CTGGCGTTCC CACTACAGGG CAAACGCTAC AACTGGGCCT ATGAAACAGA GCCTGAACCG TTTATGAATA ACCGCCGCAT GGAGTGCGGA CGCGGTAAAG GCCTGGGTGG ATCGTCGCTG ATCAACGGCA TGTGCTACAT CCGTGGCAAT GCGCTGGATC TCGATAACTG GGCGCAAGAA CCCGGTCTGG AGAACTGGAG TTATCTCGAC TGCCTGCCCT ACTACCGCAA GGCCGAGACT CGCGATGTGG GCGAGAACGA CTACCACGGT GGCGATGGCC CGGTGAGCGT CACCACCTCC AAACCCGGCG TCAATCCGCT GTTTGAAGCG ATGATTGAAG CGGGCGTGCA GGCAGGCTAC CCGCGCACGG ACGATCTCAA CGGCTATCAG CAGGAAGGTT TTGGCCCGAT GGATCGCACC GTCACGCCGC AGGGCCGTCG CGCCAGTACC GCGCGCGGTT ATCTCGATCA GGCCAAATCG CGCCCTAACC TGACCATTCG TACTCACGCT ATGACCGATC ACATCATTTT TGACGGCAAA CGCGCGGTGG GCGTCGAATG GCTGGAAGGC GACAGCACCA TCCCAACCCG CGCAACGGCC AACAAAGAAG TGCTGTTATG TGCAGGCGCG ATTGCCTCAC CGCAGATCCT GCAACGCTCC GGCGTCGGCA ACGCTGAACT GCTGGCGGAG TTTGATATTC CGCTGGTGCA TGAATTACCC GGCGTCGGCG AAAATCTTCA GGATCATCTG GAGATGTATC TGCAATATGA GTGCAAAGAA CCGGTTTCCC TCTACCCTGC CCTGCAGTGG TGGAACCAAC CGAAAATCGG TGCGGAGTGG CTGTTTGGCG GCACTGGCGT TGGTGCCAGC AACCACTTTG AAGCAGGTGG ATTTATTCGC AGCCGTGAGG AATTTGCGTG GCCGAATATT CAGTACCATT TCCTGCCAGT AGCGATTAAC TATAACGGCT CGAATGCAGT GAAAGAGCAC GGTTTCCAGT GCCACGTCGG CTCAATGCGC TCGCCAAGCC GTGGGCATGT GCGGATTAAA TCCCGCGACC CGCACCAGCA TCCGGCGATT CTGTTTAACT ACATGTCGCA CGAGCAGGAC TGGCAGGAGT TCCGCGACGC AATTCGCATC ACCCGGGAGA TCATGCATCA ACCGGCGCTG GATCAGTATC GTGGCCGCGA AATCAGCCCC GGCACGGAAT GTCAGACGGA TGAACAGCTC GATGAGTTCG TGCGTAATCA CGCCGAAACC GCCTTCCATC CGTGCGGTAC CTGCAAAATG GGCTACGACG AGATGTCCGT GGTTGACGGC GAAGGCCGCG TACACGGGTT AGAAGGCCTG CGTGTGGTGG ATGCGTCGAT TATGCCGCAG ATTATCACCG GGAATTTGAA CGCCACGACA ATTATGATTG GCGAGAAAAT AGCGGATATG ATTCGTGGAC AGGAAGCGCT GCCGAGGAGC ACGGCGGGAT ATTTTGTGGC AAATGGGATG CCGGTGAGAG CGAAAAAATG A
|
Protein sequence | MQFDYIIIGA GSAGNVLATR LTEDPNTSVL LLEAGGPDYR FDFRTQMPAA LAFPLQGKRY NWAYETEPEP FMNNRRMECG RGKGLGGSSL INGMCYIRGN ALDLDNWAQE PGLENWSYLD CLPYYRKAET RDVGENDYHG GDGPVSVTTS KPGVNPLFEA MIEAGVQAGY PRTDDLNGYQ QEGFGPMDRT VTPQGRRAST ARGYLDQAKS RPNLTIRTHA MTDHIIFDGK RAVGVEWLEG DSTIPTRATA NKEVLLCAGA IASPQILQRS GVGNAELLAE FDIPLVHELP GVGENLQDHL EMYLQYECKE PVSLYPALQW WNQPKIGAEW LFGGTGVGAS NHFEAGGFIR SREEFAWPNI QYHFLPVAIN YNGSNAVKEH GFQCHVGSMR SPSRGHVRIK SRDPHQHPAI LFNYMSHEQD WQEFRDAIRI TREIMHQPAL DQYRGREISP GTECQTDEQL DEFVRNHAET AFHPCGTCKM GYDEMSVVDG EGRVHGLEGL RVVDASIMPQ IITGNLNATT IMIGEKIADM IRGQEALPRS TAGYFVANGM PVRAKK
|
| |