Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3311 |
Symbol | |
ID | 6067157 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 3628308 |
End bp | 3629780 |
Gene Length | 1473 bp |
Protein Length | 490 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641602727 |
Product | betaine aldehyde dehydrogenase |
Protein accession | YP_001726260 |
Protein GI | 170021306 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01804] glycine betaine aldehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCGAA TGGCAGAACA GCAGCTTTAT ATACATGGTG GTTATACCTC CGCCACCAGC GGTCGCACCT TCGAGACCAT TAACCCGGCC AACGGTAACG TGCTGGCGAC CGTGCAGGCC GCCGGGCGCG AGGATGTCGA TCGCGCCGTG AAAAGCGCCC AGCAGGGGCA AAAAATCTGG GCGTCGATGA CCGCCATGGA GCGCTCGCGT ATTCTGCGTC GGGCCGTTGA TATTCTGCGT GAACGCAATG ACGAACTCGC AAAACTGGAA ACCCTCGACA CCGGAAAAGC ATATTCGGAA ACCTCAACCG TCGATATCGT TACCGGTGCG GACGTGCTGG AGTACTACGC CGGGCTGATC CCGGCGTTGG AAGGCAGCCA GATCCCGTTG CGTGAAACGT CCTTTGTGTA TACCCGCCGC GAACCGCTGG GCGTAGTGGC AGGGATTGGC GCATGGAACT ACCCGATCCA GATTGCCCTG TGGAAATCCG CCCCGGCGCT GGCGGCAGGC AACGCAATGA TTTTCAAACC GAGCGAAGTT ACCCCGCTTA CCGCGTTAAA GCTGGCTGAA ATTTACAGCG AAGCGGGCCT GCCGAACGGC GTATTTAACG TGTTGCCGGG CGTGGGCGCG GAGACCGGGC AATATCTGAC CGAGCATCCG GGCATTGCCA AAGTGTCATT TACCGGCGGT GTCGCCAGCG GCAAAAAAGT GATGGCTAAC TCGGCGGCCT CTTCCCTGAA AGAAGTGACC ATGGAACTGG GCGGTAAATC ACCGCTGATC GTTTTCGATG ATGCGGATCT CGATCTCGCC GCCGATATCG CCATGATGGC AAACTTCTTC AGCTCCGGTC AGGTGTGTAC CAATGGCACC CGCGTCTTCG TTCCGGCGAA ATGCAAAGCC GCATTTGAGC AGAAAATTCT GGCGCGCGTT GAGCGCATTC GCGCGGGCGA CGTTTTCGAT CCGCAAACTA ACTTCGGCCC GCTGGTCAGC TTCCCGCATC GCGATAACGT GCTGCGCTAT ATCGCCAAAG GCAAAGAGGA AGGCGCGCGC GTACTGTGCG GCGGCGATGT ACTGAAAGGC GATGGCTTCG ATAACGGCGC ATGGGTTGCA CCGACAGTGT TCACCGATTG CAGCGACGAT ATGACCATCG TGCGTGAAGA GATCTTCGGG CCAGTGATGT CCATTCTGAC CTACGAGTCG GAAGACGAAG TCATTCGCCG CGCTAACGAT ACCGACTACG GCCTGGCAGC GGGCATCGTG ACAGCGGACC TGAACCGCGC GCATCGCGTC ATTCATCAGC TGGAAGCGGG TATTTGCTGG ATCAACACCT GGGGCGAATC CCCGGCAGAG ATGCCCGTTG GCGGCTACAA ACACTCCGGC ATTGGTCGCG AGAACGGCAT GATGACGCTC CAGAGTTACA CCCAGGTGAA GTCCATCCAG GTTGAGATGG CTAAATTCCA GTCCATATTC TAA
|
Protein sequence | MSRMAEQQLY IHGGYTSATS GRTFETINPA NGNVLATVQA AGREDVDRAV KSAQQGQKIW ASMTAMERSR ILRRAVDILR ERNDELAKLE TLDTGKAYSE TSTVDIVTGA DVLEYYAGLI PALEGSQIPL RETSFVYTRR EPLGVVAGIG AWNYPIQIAL WKSAPALAAG NAMIFKPSEV TPLTALKLAE IYSEAGLPNG VFNVLPGVGA ETGQYLTEHP GIAKVSFTGG VASGKKVMAN SAASSLKEVT MELGGKSPLI VFDDADLDLA ADIAMMANFF SSGQVCTNGT RVFVPAKCKA AFEQKILARV ERIRAGDVFD PQTNFGPLVS FPHRDNVLRY IAKGKEEGAR VLCGGDVLKG DGFDNGAWVA PTVFTDCSDD MTIVREEIFG PVMSILTYES EDEVIRRAND TDYGLAAGIV TADLNRAHRV IHQLEAGICW INTWGESPAE MPVGGYKHSG IGRENGMMTL QSYTQVKSIQ VEMAKFQSIF
|
| |