Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0847 |
Symbol | |
ID | 4076022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 898671 |
End bp | 900155 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638006145 |
Product | betaine-aldehyde dehydrogenase |
Protein accession | YP_612842 |
Protein GI | 99080688 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00938434 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGTTG ACGCCTTAAT CTCTACCTAT TTCGCCAACG GGCGCCTGTC GGATCTGCCG CGCGATCACT TTATCGACGG TCGCCTTGTT CCGGCTGAAA CCGGCAACCG GATGGAGAGC TTTGATCCCG GCAGAGCCAC CGCCTTTGGG GATTTTGCCG CTGGCACCGC TGTGGATGTG GACCGAGCGG TCAGCAGCGC TGTGGCGGGG TTTGCAATCT GGCGCGACAC CGCGCCGGCG ACGCGCTGTG CGGTGTTGAT GGAGGCCGCA CGCCTGATGC GCGCCGAAGC TGACTGGCTG GCTGTAATCG AGAGCCTTGA TAGCGGCAAA ACGCTGGCCG AAGCCTACGG CGACGTGCAG GGCTCGGCGC GGCTCTTTGA GTATTATGCC GGCGCGGCAG ACAAGCTCGA TGGGCGCAGC GTGAACCTCG GCAATGACAA CGCGGCCTTC ACCCTGCGCG AACCAGTCGG TGTCACGGCG CATATCGTGC CGTGGAACTA TCCCACCTCG ACGCTGGTGC GCGGGATCGC GCCGGCGCTT GCAGCGGGGT GTTCGGCGGT GGTGAAACCG GCCGAAACCA CGCCCTATAC GGCACTGATG ATCGCAGAGC TGTTGATCCG CGCCGGGCTG CCTGCGGGCG TGGTGAATGT TGTGACCGGC ACCGGACCGG AAGCAGGCGC GCCCTTGGTG CGCGATCCCC GGGTGCGCCA TGTGACCTTT ACCGGTTCGG TGCAGACCGG CGTTGGCGTG ATGCAGGCCG TCGCCCCCAA TGTCACCGGA CTGACGCTGG AACTCGGCGG CAAATCCCCG TTGGTGGCCT TTGCAGATGC AGATGCCGAG AAAGTGGCCG AGGGCGCGCT CTGGGCGATC TTTTCCAACG CGGGGCAGAT CTGTTCGGCG GGTTCACGGC TGGTGGTTCA CCGCGACCTC CATGCGCAGG TGCTCGAACG TCTGGTGAAG AAGGCCACGA CCCTGCGGTT CGGTCATGGG CTGCACAACC CGGATATGGG GGCACTCAAC TCCGAACGCC ATCAGGCGGC AATCAGCGGC CACGTTGAGC GCGCGCGCGC CCGAGGCGTC GAAATCCTTT GCGGCGGTCA GGCAACCACA GACCCTGCCA CGGGCAAGGG TTGGTTCTTT GAACCGACAA TTCTGGACGC GCTCTCTGCG GATGACCCGG CCATCCAGCA GGAAATCTTC GGTCCCGTGC TCGGCGTGCA GGTTTTCGAC GACGAAGACG AAGCACTTGC TCTGGCCAAT GGTACAGAGT TTGCGCTGGC GGCGGGGGTC TACACCAAGG ACACTTCAAC CGCCCTGCGC ATGGCGCGCC GCATTGATGC GGGGCAGGTG ACGGTGAACG ACTATTGGGC AGGGGGCATC GAACTGCCCT TCGGCGGCAA TCGCAAGTCT GGCTTTGGTC GCGAAAAAGG GCTGGAAGGT GTTGATGCCT ACACCCGCGC CAAGGCAATC ACCCTCGCGG TATAA
|
Protein sequence | MSVDALISTY FANGRLSDLP RDHFIDGRLV PAETGNRMES FDPGRATAFG DFAAGTAVDV DRAVSSAVAG FAIWRDTAPA TRCAVLMEAA RLMRAEADWL AVIESLDSGK TLAEAYGDVQ GSARLFEYYA GAADKLDGRS VNLGNDNAAF TLREPVGVTA HIVPWNYPTS TLVRGIAPAL AAGCSAVVKP AETTPYTALM IAELLIRAGL PAGVVNVVTG TGPEAGAPLV RDPRVRHVTF TGSVQTGVGV MQAVAPNVTG LTLELGGKSP LVAFADADAE KVAEGALWAI FSNAGQICSA GSRLVVHRDL HAQVLERLVK KATTLRFGHG LHNPDMGALN SERHQAAISG HVERARARGV EILCGGQATT DPATGKGWFF EPTILDALSA DDPAIQQEIF GPVLGVQVFD DEDEALALAN GTEFALAAGV YTKDTSTALR MARRIDAGQV TVNDYWAGGI ELPFGGNRKS GFGREKGLEG VDAYTRAKAI TLAV
|
| |