Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_00270 |
Symbol | betA |
ID | 8115023 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 295198 |
End bp | 296868 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644846559 |
Product | hypothetical protein |
Protein accession | YP_002998132 |
Protein GI | 251783828 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | [TIGR01810] choline dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCAATTTG ACTACATCAT TATTGGTGCC GGCTCAGCCG GCAACGTTCT CGCTACCCGT CTGACTGAAG ATCCGAATAC CTCCGTGCTG CTGCTTGAAG CGGGCGGCCC GGACTATCGC TTTGACTTCC GCACCCAGAT GCCCGCTGCC CTGGCATTCC CGCTACAGGG TAAACGCTAC AACTGGGCCT ATGAAACGGA ACCTGAACCG TTTATGAATA ACCGCCGCAT GGAGTGCGGA CGCGGTAAAG GTCTGGGTGG ATCGTCGCTG ATCAACGGCA TGTGCTACAT CCGTGGCAAT GCGCTGGATC TCGACAACTG GGCGAAAGAA CCCGGCCTGG AGAACTGGAG CTATCTCGAT TGCCTGCCCT ACTATCGCAA GGCCGAGACG CGCGATATTG GTGATAACGA CTATCACGGC GGTGATGGCC CGGTGAGCGT CACCACCTCC AAACCCGGCG TCAATCCGCT GTTTGAAGCG ATGATTGAAG CGGGCGTGCA GGCGGGCTAC CCGCGCACGG ACGATCTCAA CGGCTATCAG CAAGAAGGTT TTGGCCCGAT GGATCGCACC GTGACGCCGC AGGGCCGTCG CGCCAGCACC GCGCGTGGCT ATCTCGATCA GGCCAAATCG CGTCCTAACC TGACCATTCG TACTCACGCT ATGACCGATC ACATCATTTT TGACGGCAAA CGCGCGGTGG GCGTCGAATG GCTGGAAGGC GACAGCACCA TCCCAACCCG CGCAACGGCC AACAAAGAAG TGCTGTTATG TGCAGGCGCG ATTGCCTCAC CGCAGATCCT GCAACGCTCC GGCGTCGGCA ACGCTGAACT GCTGGCGGAG TTTGATATTC CGCTGGTGCA TGAATTACCC GGCGTCGGCG AAAATCTTCA GGATCATCTG GAGATGTATC TGCAATATGA GTGCAAAGAA CCGGTTTCCC TCTACCCTGC CCTGCAGTGG TGGAACCAGC CGAAAATCGG TGCGGAGTGG CTGTTTGGCG GCACTGGCGT TGGTGCCAGC AACCACTTTG AAGCAGGTGG ATTTATTCGC AGCCGTGAGG AATTTGCGTG GCCGAATATT CAGTACCATT TCCTGCCAGT AGCGATTAAC TATAACGGCT CGAATGCAGT GAAAGAGCAC GGTTTCCAGT GCCACGTCGG CTCAATGCGC TCGCCAAGCC GTGGGCATGT GCGGATTAAA TCCCGCGACC CGCACCAGCA TCCGGCGATT CTGTTTAACT ACATGTCGCA CGAGCAGGAC TGGCAGGAGT TCCGCGACGC AATTCGCATC ACCCGCGAGA TCATGCATCA ACCCGCGCTG GATCAGTATC GTGGCCGCGA AATCAGCCCC GGTGTCGAAT GCCAGACGGA TGAACAGCTC GATGAGTTCG TGCGTAACCA CGCCGAAACC GCCTTCCATC CGTGCGGTAC CTGCAAAATG GGTTACGACG AGATGTCCGT GGTTGACGGC GAAGGCCGCG TACACGGGTT AGAAGGCCTG CGTGTGGTGG ATGCGTCGAT TATGCCGCAG ATTATCACCG GGAATTTGAA CGCCACGACA ATTATGATTG GCGAGAAAAT AGCGGATATG ATTCGTGGAC AGGAAGCGCT GCCGAGGAGC ACGGCGGGAT ATTTTGTGGC AAATGGGATG CCGGTGAGAG CGAAAAAATG A
|
Protein sequence | MQFDYIIIGA GSAGNVLATR LTEDPNTSVL LLEAGGPDYR FDFRTQMPAA LAFPLQGKRY NWAYETEPEP FMNNRRMECG RGKGLGGSSL INGMCYIRGN ALDLDNWAKE PGLENWSYLD CLPYYRKAET RDIGDNDYHG GDGPVSVTTS KPGVNPLFEA MIEAGVQAGY PRTDDLNGYQ QEGFGPMDRT VTPQGRRAST ARGYLDQAKS RPNLTIRTHA MTDHIIFDGK RAVGVEWLEG DSTIPTRATA NKEVLLCAGA IASPQILQRS GVGNAELLAE FDIPLVHELP GVGENLQDHL EMYLQYECKE PVSLYPALQW WNQPKIGAEW LFGGTGVGAS NHFEAGGFIR SREEFAWPNI QYHFLPVAIN YNGSNAVKEH GFQCHVGSMR SPSRGHVRIK SRDPHQHPAI LFNYMSHEQD WQEFRDAIRI TREIMHQPAL DQYRGREISP GVECQTDEQL DEFVRNHAET AFHPCGTCKM GYDEMSVVDG EGRVHGLEGL RVVDASIMPQ IITGNLNATT IMIGEKIADM IRGQEALPRS TAGYFVANGM PVRAKK
|
| |