Gene B21_00270 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00270 
SymbolbetA 
ID8115023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp295198 
End bp296868 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content57% 
IMG OID644846559 
Producthypothetical protein 
Protein accessionYP_002998132 
Protein GI251783828 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01810] choline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCAATTTG ACTACATCAT TATTGGTGCC GGCTCAGCCG GCAACGTTCT CGCTACCCGT 
CTGACTGAAG ATCCGAATAC CTCCGTGCTG CTGCTTGAAG CGGGCGGCCC GGACTATCGC
TTTGACTTCC GCACCCAGAT GCCCGCTGCC CTGGCATTCC CGCTACAGGG TAAACGCTAC
AACTGGGCCT ATGAAACGGA ACCTGAACCG TTTATGAATA ACCGCCGCAT GGAGTGCGGA
CGCGGTAAAG GTCTGGGTGG ATCGTCGCTG ATCAACGGCA TGTGCTACAT CCGTGGCAAT
GCGCTGGATC TCGACAACTG GGCGAAAGAA CCCGGCCTGG AGAACTGGAG CTATCTCGAT
TGCCTGCCCT ACTATCGCAA GGCCGAGACG CGCGATATTG GTGATAACGA CTATCACGGC
GGTGATGGCC CGGTGAGCGT CACCACCTCC AAACCCGGCG TCAATCCGCT GTTTGAAGCG
ATGATTGAAG CGGGCGTGCA GGCGGGCTAC CCGCGCACGG ACGATCTCAA CGGCTATCAG
CAAGAAGGTT TTGGCCCGAT GGATCGCACC GTGACGCCGC AGGGCCGTCG CGCCAGCACC
GCGCGTGGCT ATCTCGATCA GGCCAAATCG CGTCCTAACC TGACCATTCG TACTCACGCT
ATGACCGATC ACATCATTTT TGACGGCAAA CGCGCGGTGG GCGTCGAATG GCTGGAAGGC
GACAGCACCA TCCCAACCCG CGCAACGGCC AACAAAGAAG TGCTGTTATG TGCAGGCGCG
ATTGCCTCAC CGCAGATCCT GCAACGCTCC GGCGTCGGCA ACGCTGAACT GCTGGCGGAG
TTTGATATTC CGCTGGTGCA TGAATTACCC GGCGTCGGCG AAAATCTTCA GGATCATCTG
GAGATGTATC TGCAATATGA GTGCAAAGAA CCGGTTTCCC TCTACCCTGC CCTGCAGTGG
TGGAACCAGC CGAAAATCGG TGCGGAGTGG CTGTTTGGCG GCACTGGCGT TGGTGCCAGC
AACCACTTTG AAGCAGGTGG ATTTATTCGC AGCCGTGAGG AATTTGCGTG GCCGAATATT
CAGTACCATT TCCTGCCAGT AGCGATTAAC TATAACGGCT CGAATGCAGT GAAAGAGCAC
GGTTTCCAGT GCCACGTCGG CTCAATGCGC TCGCCAAGCC GTGGGCATGT GCGGATTAAA
TCCCGCGACC CGCACCAGCA TCCGGCGATT CTGTTTAACT ACATGTCGCA CGAGCAGGAC
TGGCAGGAGT TCCGCGACGC AATTCGCATC ACCCGCGAGA TCATGCATCA ACCCGCGCTG
GATCAGTATC GTGGCCGCGA AATCAGCCCC GGTGTCGAAT GCCAGACGGA TGAACAGCTC
GATGAGTTCG TGCGTAACCA CGCCGAAACC GCCTTCCATC CGTGCGGTAC CTGCAAAATG
GGTTACGACG AGATGTCCGT GGTTGACGGC GAAGGCCGCG TACACGGGTT AGAAGGCCTG
CGTGTGGTGG ATGCGTCGAT TATGCCGCAG ATTATCACCG GGAATTTGAA CGCCACGACA
ATTATGATTG GCGAGAAAAT AGCGGATATG ATTCGTGGAC AGGAAGCGCT GCCGAGGAGC
ACGGCGGGAT ATTTTGTGGC AAATGGGATG CCGGTGAGAG CGAAAAAATG A
 
Protein sequence
MQFDYIIIGA GSAGNVLATR LTEDPNTSVL LLEAGGPDYR FDFRTQMPAA LAFPLQGKRY 
NWAYETEPEP FMNNRRMECG RGKGLGGSSL INGMCYIRGN ALDLDNWAKE PGLENWSYLD
CLPYYRKAET RDIGDNDYHG GDGPVSVTTS KPGVNPLFEA MIEAGVQAGY PRTDDLNGYQ
QEGFGPMDRT VTPQGRRAST ARGYLDQAKS RPNLTIRTHA MTDHIIFDGK RAVGVEWLEG
DSTIPTRATA NKEVLLCAGA IASPQILQRS GVGNAELLAE FDIPLVHELP GVGENLQDHL
EMYLQYECKE PVSLYPALQW WNQPKIGAEW LFGGTGVGAS NHFEAGGFIR SREEFAWPNI
QYHFLPVAIN YNGSNAVKEH GFQCHVGSMR SPSRGHVRIK SRDPHQHPAI LFNYMSHEQD
WQEFRDAIRI TREIMHQPAL DQYRGREISP GVECQTDEQL DEFVRNHAET AFHPCGTCKM
GYDEMSVVDG EGRVHGLEGL RVVDASIMPQ IITGNLNATT IMIGEKIADM IRGQEALPRS
TAGYFVANGM PVRAKK