Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3321 |
Symbol | |
ID | 4075726 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 329654 |
End bp | 331141 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638004829 |
Product | aldehyde dehydrogenase (acceptor) |
Protein accession | YP_611555 |
Protein GI | 99078297 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.566743 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.20798 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGATT GGAAAGCGGC GGCTTCGGCT CTGTATGATG GCGGGTTTCG CCCGATGTTC ATCAACGGAA AATGGTGTGA ATCGCAATCC GGCGAGGTGA TCGAGGCCCG TAATCCTGCG AGCGGCGCTT TGCTGGCAAC GGTGCCCAAA GGCGGCGCAG CGGATGTTGA TGCGGCTGTC GCGGCGGCGC GCGCGGCTTT TGAGGGGCCG TGGTCGAAAT GGACCCCCTT TGAGCGTCAG GCCCTGCTGC TCCGGATCGC GGATCGCTTT GAGGCAGAGT GGGAAACGCT TTGTCTGTCC GACACGCTTG ATATGGGGAT GCCGATTCAG CGCACGCTCG CCAACAGCCG TCGTGTTCTG GGGATGCTGC GCTTTTATGC GGGGCAGGCA GTCACCATCC ATGGCCACAC GATCCCCAAC TCCTTTCCGG GGGAGATCCA TTCCTCGACC GTGCGCGAAC CGGTGGGCGT GGTGGGCGCG ATCATTCCGT GGAACGCGCC GATCGCGGGT TCGATCTGGA AGATCGCGCC AGCCATCGCA ACCGGCTGCA CGGTGGTGCT GAAACCTTCC GAGGAGGCCT CTTTGACGGT GCTGATGATT GCCCGGATCA TGCAGGAAGC GGGCCTGCCC GATGGGGTTT TGAATATCGT CACCGGGTAC GGCGCCGCGG CGGGTGCGGC GCTGGCGGCG CATTCTGGTG TCGACAAGAT CGTCTTTACC GGCTCCACCG CGACAGGGCA GGCCATCGCC CGCGCGGCAA CCGGAAACCT CAAACGGGTT TCGCTGGAGC TTGGCGGCAA ATCCCCGGTG ATTGTCTGCC GGGATGCAGA CATTGAAAAA GCGGTGCCCG TCGCAGCCAT GGCGGTGTTT GCAAACTCGG GCCAGATCTG CATCGCCGGG TCGCGGCTGT TTGTGGCGCG CGAGATCCAC GACGAATTTG TGCGACGTGT TGCGGAATTC GCTGCCAATC TGCGTATTGG TCACGGCATC GAAGAGAGCA CGGACGTAGG CCCGATCATC TCTGCGCGGC AGGCAGAGCG TATTGCGGGC TATCTCGCTG CCGGCCCAAG CGAAGGTGCG GAGATCCTGA CCGGTGGCGC ACGGGTGAAA GGCGCGGGTT TTGAAGGCGG ACACTTCATC GAGCCAACTG TGTTTGGTGG CGTCACGGAC GAGATGTCCA TCGCACGCGA GGAGATCTTT GGTCCGGTGA TCTCGGCGCT GCCGTTTGAC AGTCTCGATG AGGTGGTCGA GCGGGCCAAC GCGACACCTT ATGGGTTGGC TGCTGGTGTG TTCTCGACCC ACCTCGGGAC CGCGCACAAA TTGGCACATC GCCTGAAGGC GGGATCAGTC TGGGTCAATA TGTACCACGC GATCGACCCT GCGGTGCCCT TCGGAGGCGT CAAGATGTCA GGCTACGGGC GCGAAGGCGG CACCGAGCAC ATGGAAGAAT ACCTCGATAC CAAGGCGATC TGGATCAACA CGGACTGA
|
Protein sequence | MTDWKAAASA LYDGGFRPMF INGKWCESQS GEVIEARNPA SGALLATVPK GGAADVDAAV AAARAAFEGP WSKWTPFERQ ALLLRIADRF EAEWETLCLS DTLDMGMPIQ RTLANSRRVL GMLRFYAGQA VTIHGHTIPN SFPGEIHSST VREPVGVVGA IIPWNAPIAG SIWKIAPAIA TGCTVVLKPS EEASLTVLMI ARIMQEAGLP DGVLNIVTGY GAAAGAALAA HSGVDKIVFT GSTATGQAIA RAATGNLKRV SLELGGKSPV IVCRDADIEK AVPVAAMAVF ANSGQICIAG SRLFVAREIH DEFVRRVAEF AANLRIGHGI EESTDVGPII SARQAERIAG YLAAGPSEGA EILTGGARVK GAGFEGGHFI EPTVFGGVTD EMSIAREEIF GPVISALPFD SLDEVVERAN ATPYGLAAGV FSTHLGTAHK LAHRLKAGSV WVNMYHAIDP AVPFGGVKMS GYGREGGTEH MEEYLDTKAI WINTD
|
| |