Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0515 |
Symbol | |
ID | 4077221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 541096 |
End bp | 542631 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638005811 |
Product | aldehyde dehydrogenase |
Protein accession | YP_612510 |
Protein GI | 99080356 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.423959 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.408285 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAAAG AAATGATCGA CATCACGGGC CGTGTGGCGA CCCTGTTGCG CGACCTCACA GGCACGTCGG AGGTCCGCTC ATATGTCGGG GGGGAACTGG TTCAGGGGAG CGGCGCTGAA CTGGAGCTTA CCGATCCGGC CAGCGGAGTG GTCTTTGCCC GGTATCGCGA TGCAGGTCCC GATGTGGTTG CGCGTGCGGC AGAGGCCGCG CAGGTGGCAC AGAAAATCTG GTATGCCAAA ACCGCGTCAG AGCGTGGGCG CATCCTCTTT GAGATCGGGC GCCAGATCCG CCAGCATGCC GCCGCGCTCT CGGAGCTTGA GGCGTTGTCG TCAGGCCGTC CCATGCGAGA CACAGGCGGT GAGCCTGCCC GAATGGCGGA GATGTTTGAA TACTATGCAG GCTGGTGTGA CAAGATCACC GGTGATGTGA TCCCGGTGCC TTCGAGCCAT TTGAACTACA CTCGCCAGGA ACCGCTCGGC GTGGTGGCGC AGATCACGCC CTGGAACGCT CCACTCTTTA CCTGTTGCTG GCAGGTGGCG CCCGCGATCT GCGCGGGCAA TGCGGTCATG ATCAAGCCGT CGGAACTGAC GCCGCTTACT TCGGTGGTGA TCGGGATCTT GTGTGAAAAA GCGGGCGCTC CCAAGGGATT GGTCAATGTG ATCGCGGGGG ATGGGCCCGG GTCGGGGCAA GCGATGATCG CGCACCCGGA GACCGCGCTT GTGGTCTTTG TTGGGTCGGC AGAAGCAGGC AGCAAGATCG CCGCCGCGGC TGCTAAACGC CTGATCCCTT CAGTGTTGGA ACTTGGTGGC AAGTCCGCCA ATATCGTTTT TGACGACGCC GATATTGACC GCGCCGTGGT GGGCGCGCAG GCAGCGATCT TTGCCTCTTG TGGCCAAAGT TGCGTGGCGG GGTCACGCCT CTTGGTGCAT CGCTCCGTAC AGACTGAGGT GGTGGAGAAA CTCTCTGCTG CTGCAGCGCG TATCCCCGTG GGTGACCCGA TGGATCCTGC GACACAGGTC GGGCCCGTCA ACAACCTGCG CCAATGGAAC AAGATAGACA CAATGGTCCA AGCCGCGACC CGTGCCGGGG CCAGTGTCGC CAGCGGTGGT GGCAAGCCTG CAGCGCTCGC GGCATCGGGC GGGTTCTTTT ACGCGCCTAC GGTGCTAGAT GCAGTCACGC CGCAGATGGA GATCGCCAAT GAGGAAGTCT TTGGCCCGGT GGTTTCGGTG CTGCCCTTCG ACGATGAGGA GGAGGCCATT CAGCTTGCGA ATGCGACACC CTATGGCTTG GCCGGGGCGG TCTGGACCCG AGACGTGGGG CGGGCACACC GGGTCGCGGG CGCTGTACGC GCGGGCACCT TCTGGATCAA CAGCTACAAG ACCATCAATG TGATGTCGCC TTTTGGCGGG TTCGGGCGCT CTGGCTATGG ACGTTCCTCC GGGCGCGAGG CGCTCTCGGC TTACACGCAG ACCAAATCCG TATGGGTCGA AACAGCTGAA AACCCGGCCC AAGGCTTTGG CTACGCGCCG GGCTGA
|
Protein sequence | MTKEMIDITG RVATLLRDLT GTSEVRSYVG GELVQGSGAE LELTDPASGV VFARYRDAGP DVVARAAEAA QVAQKIWYAK TASERGRILF EIGRQIRQHA AALSELEALS SGRPMRDTGG EPARMAEMFE YYAGWCDKIT GDVIPVPSSH LNYTRQEPLG VVAQITPWNA PLFTCCWQVA PAICAGNAVM IKPSELTPLT SVVIGILCEK AGAPKGLVNV IAGDGPGSGQ AMIAHPETAL VVFVGSAEAG SKIAAAAAKR LIPSVLELGG KSANIVFDDA DIDRAVVGAQ AAIFASCGQS CVAGSRLLVH RSVQTEVVEK LSAAAARIPV GDPMDPATQV GPVNNLRQWN KIDTMVQAAT RAGASVASGG GKPAALAASG GFFYAPTVLD AVTPQMEIAN EEVFGPVVSV LPFDDEEEAI QLANATPYGL AGAVWTRDVG RAHRVAGAVR AGTFWINSYK TINVMSPFGG FGRSGYGRSS GREALSAYTQ TKSVWVETAE NPAQGFGYAP G
|
| |