Gene EcSMS35_0342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0342 
SymbolbetA 
ID6146020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp350805 
End bp352493 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content58% 
IMG OID641615238 
Productcholine dehydrogenase 
Protein accessionYP_001742446 
Protein GI170681227 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01810] choline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCAATTTG ACTACATCAT TATTGGTGCC GGCTCAGCCG GCAACGTCCT CGCTACCCGT 
CTGACTGAAG ATCCGAACAC AACCGTGCTG CTGCTTGAAG CAGGCGGCCC GGACTATCGC
TTTGACTTCC GCACCCAGAT GCCCGCCGCG CTGGCGTTCC CGCTACAGGG CAAACGCTAC
AACTGGGCCT ATGAGACGGA ACCCGAACCG TTTATGAACA ACCGCCGCAT GGAGTGCGGA
CGCGGCAAAG GTCTGGGCGG TTCATCGCTG ATCAACGGCA TGTGTTACAT CCGTGGTAAC
GCGCTGGATC TCGATAACTG GGCGCAAGAA CCCGGTCTGG AGAACTGGAG CTACCTCGAC
TGTCTGCCCT ACTACCGCAA GGCCGAGACT CGCGACGTGG GCGAGAACGA CTACCACGGC
GGCGACGGCC CGGTGAGCGT CACCACCTCC AAACCCGGCG TCAATCCGCT GTTTGAAGCG
ATGATTGAAG CGGGCGTGCA GGCGGGCTAC CCGCGCACGG ACGATCTCAA CGGCTATCAG
CAAGAAGGTT TTGGCCCGAT GGATCGCACC GTCACGCCGC ATGGCCGTCG CGCCAGCACC
GCGCGTGGCT ATCTCGATCA GGCCAAATCG CGCTCTAACC TGACCATTCG TACTCACGCC
ATGACCGATC ACATCATTTT TGACGGCAAA CGCGCGGTGG GCGTCGAGTG GCTGGAAGGC
GACAGCACCA TTCCGACCCG CGCGGCGGCG AATAAAGAAG TGCTGTTATG TGCAGGCGCG
ATTGCTTCAC CGCAGATCCT GCAACGCTCC GGCGTCGGCA ACGCTGGACT GCTGGCGGAG
TTTGATATTC CGCTGGTGCA TGAATTACCT GGCGTCGGCG AAAATCTTCA GGATCACCTG
GAGATGTATC TGCAATATGA GTGCAAAGAA CCGGTTTCCC TCTACCCTGC CCTGCAGTGG
TGGAATCAGC CGAAAATCGG TGCGGAGTGG CTGTTTGGCG GCACCGGCAT TGGTGCCAGC
AACCACTTTG AAGCAGGCGG ATTTATTCGC AGCCGTGAGG AATTTGCGTG GCCGAATATT
CAGTACCATT TCCTGCCAGT GGCGATTAAC TATAACGGCT CGAACGCAGT GAAAGAGCAC
GGCTTCCAGT GCCACGTCGG ATCGATGCGC TCGCCAAGCC GTGGGCATGT GCGGATTAAA
TCCCGCGCCC CGCACCAGCA TCCAGCGATT CTGTTTAACT ACATGTCGCA CGAGCAGGAC
TGGCAGGAGT TCCGCGACGC AATTCGCATC ACCCGGGAGA TCATGCATCA ACCGGCGCTG
GATCAGTATC ATGGCCGCGA AATCAGCCCC GGTGTCGAAT GCCAGACGGA TGAGCAGCTC
GATGAGTTTG TGCGTAATCA CGCCGAAACC GCCTTCCACC CATGCGGTAC GTGCAAAATG
GGGTACGACG AGATGTCCGT TGTGGATGGC GAAGGCCGCG TGCACGGGCT GGAAGGATTA
CGCGTAGTCG ATGCGTCAAT TATGCCGCAG ATTATCACCG GCAATCTGAA CGCCACGACG
ATTATGATTG GCGAGAAAAT GGCGGATATG ATTCGCGGGA AGGATGCGTT GCCGAGGAGC
ACGGCGGGAT ATTATGTGGC AAATGGGATG CCGGTGAGAG CGAAAAAAAT GAGTCGTGAT
GTGAACTGA
 
Protein sequence
MQFDYIIIGA GSAGNVLATR LTEDPNTTVL LLEAGGPDYR FDFRTQMPAA LAFPLQGKRY 
NWAYETEPEP FMNNRRMECG RGKGLGGSSL INGMCYIRGN ALDLDNWAQE PGLENWSYLD
CLPYYRKAET RDVGENDYHG GDGPVSVTTS KPGVNPLFEA MIEAGVQAGY PRTDDLNGYQ
QEGFGPMDRT VTPHGRRAST ARGYLDQAKS RSNLTIRTHA MTDHIIFDGK RAVGVEWLEG
DSTIPTRAAA NKEVLLCAGA IASPQILQRS GVGNAGLLAE FDIPLVHELP GVGENLQDHL
EMYLQYECKE PVSLYPALQW WNQPKIGAEW LFGGTGIGAS NHFEAGGFIR SREEFAWPNI
QYHFLPVAIN YNGSNAVKEH GFQCHVGSMR SPSRGHVRIK SRAPHQHPAI LFNYMSHEQD
WQEFRDAIRI TREIMHQPAL DQYHGREISP GVECQTDEQL DEFVRNHAET AFHPCGTCKM
GYDEMSVVDG EGRVHGLEGL RVVDASIMPQ IITGNLNATT IMIGEKMADM IRGKDALPRS
TAGYYVANGM PVRAKKMSRD VN