Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0096 |
Symbol | |
ID | 4078762 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 100018 |
End bp | 102360 |
Gene Length | 2343 bp |
Protein Length | 780 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638005383 |
Product | aldehyde dehydrogenase |
Protein accession | YP_612091 |
Protein GI | 99079937 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.493554 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAATCA AAGAGATCTT CGAGACCATG GACTATGGCC CCGCCCCGGA AAATGCCGCA GAAGCCCTTG CTTGGTTGGT CGACCAGGGC AGCCAGTTTG GTCACTTCAT CAACGGTGAT TTCACCGCCC CCGGCACAGT GTTCGAGAGC AAGAACCCCG CAACCGGCGA GGTTCTGGCA GAGCTGACAC AAGCCACGCA AAAGGACATT GATGCGGCTG TCAAAGCCGC CCGCGCCGCG CAAGAAGGCT GGGCCGCGAT GGGCGGTTCC GGTCGCGCCA CATACCTCTA TGCCATCGCC CGCCTCCTGC AGAAACACAG CCGCCTCTTT GCGGTGCTAG AGACGCTCGA CAACGGCAAA CCGATCCGCG AGAGCCGCGA TATCGACGTG CCGCTGGTCC AGCGCCATTT CTACCACCAC GCGGGCCACG CGCAGCTGAT GGATGACGAG ATGAGCGACC GCGCCGCACT TGGCGTTTGC GGTCAGATCA TCCCGTGGAA TTTCCCGCTC TTGATGCTCG CGTGGAAGGT CGCGCCTGCA CTCGCCATGG GTAACACGGT GGTGCTGAAA CCCGCCGAAT ATACCTCGCT CACCGCGCTG CTCTTTGCCG ACATCTGCCG TCAGGCGGGC CTGCCCAAAG GGGTCGTGAA TATCGTAACC GGCGACGGGG CCGTGGGCGA GATGATCGTC AATGCAGAGG TCGACAAGAT CGCCTTTACC GGCTCCACTT CGGTGGGGCG CAAGATCCGC GAAGCCACTG CAGGGTCCGG CAAGGCGCTG ACGCTCGAGC TTGGCGGCAA AAGCCCCTAC ATCGTCTTTG ACGACGCTGA TCTTGATAGC GCCATCGAAG GCCTCGTGGA TGCGATCTGG TTCAATCAGG GGCAGGTCTG CTGCGCGGGC TCCCGGCTCT TGGTGCAAGA GGGCGTCTCT GAGCGGTTCC ACCAGAAACT GCGCGCACGG ATGAAAACCC TGCGTCTGGG CGATCCGCTC GACAAATGCA TCGACATCGG CGCTGTGGTC GACCCCGTTC AACACGCCGA GATCAGCCGT CTGGTGGCCT CGGCCACAAA CTGCACCGTG CATCAATCGG CAGTCAACAT GCCCGCAAAG GGCTGTTTCT TCCCGCCGAC CCTGATCGAG GGCCTCTCGC CCTCTGATCC CTTGATGCAG GAAGAGATCT TTGGCCCGGT TCTGGTCTCT ACCACCTTCC GCACCCCAGC AGAGGCTGTG GAGCTCGCAA ACAACACCCG CTACGGGCTT GCAGCGACGC TCTGGACAGA GAATGTGAAC CTCGCGCTGG ATGTCGCACC CAAGCTCGTT GCCGGCGTGG TCTGGGTCAA TGGAACCAAT ATGTTTGATG CCGCTGCCCC CTTTGGCGGC GTGCGCGAGA GCGGCTTTGG CCGCGAGGGC GGAGTTGAGG GCCTGATGGC CTATACCAAG CCCAAGGCAC AGTCTGAGGC GCTCCAGCCG GTTGTGGCCT TTGAAGGCAA CGCCAAAAGC GCGACCCCGG AAGGGATTGA CCGCACCGCG AAGATGTTTG TCGGCGGCAA GCAAGCGCGC CCCGACAGCG GCTATTCCAA ACCCGTGTAT GGCCCCAAAG GCGACCTATT GGGCCACGTC GGCCTTGGCA GCCGCAAGGA TGTCAGAAAT GCCGTCGAGG CCGCTAATGC GGCCAAGGGC TGGGCCAAGA CCACCGGCCA TCTACGCGCG CAGATCCTTT ATTATCTGGC CGAGAACCTC GCCGCTCGCG CGGGGGAGTT TGCGGCCCGC ATTGACGCAA TGACCGGCAA AGGAGAGGGC GCCGCAGAGG TCGAGGCGTC GCTGCAACGC CTCTTCTCTG CCGCAGCCTG GGCCGACAAA TACGACGGTC TCGCCCATGG CGTGCCGATC CGTGGCGTGG CTCTTGGCAT GAAAGAACCC GTGGGCACCA TTGGCGTCCT CTGCGCCGAT GAGGCGCCGC TCCTGGGGCT CGTCTCGGCC ATGGCCCCCG CCATTGCCAT GGGCAACCGC GTGGTGCTCG CGGCCTCAGA GGCCTTTCCT CTGGCGGCTA CGGATCTTTA TCAGGTGCTC GAAACCTCTG ATGTGCCCGC TGGCGTGGTC AATATCCTCA CCGGCCCGCA CAAGGACCTC GGTGACACAA TGGCCAAGCA CCTCGATATC GACGCGGTCT GGAGTTTCTC TTCCAGCGAC CTCTCCAAGA TGATCGAGGC CGCTTCTGCC GGGAACCTCA AGCGCACCTG GGTCAACAAC GGCCACGCCT TCGATTGGTC GCGCGATCAG TCAAAGCGCT TCTTGCAGGC CGCGACAGAG GTCAAGACCG TCTGGATCCC CTACGGCGAG TGA
|
Protein sequence | MTIKEIFETM DYGPAPENAA EALAWLVDQG SQFGHFINGD FTAPGTVFES KNPATGEVLA ELTQATQKDI DAAVKAARAA QEGWAAMGGS GRATYLYAIA RLLQKHSRLF AVLETLDNGK PIRESRDIDV PLVQRHFYHH AGHAQLMDDE MSDRAALGVC GQIIPWNFPL LMLAWKVAPA LAMGNTVVLK PAEYTSLTAL LFADICRQAG LPKGVVNIVT GDGAVGEMIV NAEVDKIAFT GSTSVGRKIR EATAGSGKAL TLELGGKSPY IVFDDADLDS AIEGLVDAIW FNQGQVCCAG SRLLVQEGVS ERFHQKLRAR MKTLRLGDPL DKCIDIGAVV DPVQHAEISR LVASATNCTV HQSAVNMPAK GCFFPPTLIE GLSPSDPLMQ EEIFGPVLVS TTFRTPAEAV ELANNTRYGL AATLWTENVN LALDVAPKLV AGVVWVNGTN MFDAAAPFGG VRESGFGREG GVEGLMAYTK PKAQSEALQP VVAFEGNAKS ATPEGIDRTA KMFVGGKQAR PDSGYSKPVY GPKGDLLGHV GLGSRKDVRN AVEAANAAKG WAKTTGHLRA QILYYLAENL AARAGEFAAR IDAMTGKGEG AAEVEASLQR LFSAAAWADK YDGLAHGVPI RGVALGMKEP VGTIGVLCAD EAPLLGLVSA MAPAIAMGNR VVLAASEAFP LAATDLYQVL ETSDVPAGVV NILTGPHKDL GDTMAKHLDI DAVWSFSSSD LSKMIEAASA GNLKRTWVNN GHAFDWSRDQ SKRFLQAATE VKTVWIPYGE
|
| |