Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2442 |
Symbol | aldH1 |
ID | 5714098 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 2584873 |
End bp | 2586378 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641268365 |
Product | NADP-dependent fatty aldehyde dehydrogenase |
Protein accession | YP_001533777 |
Protein GI | 159044983 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTCA CCCCCCACGG AAAACACCTG ATCGCCGGCG CCTGGGTCGG GAGCGATCAG ACCTTCGCCT CCGATCCCGC CCACGGGCCC GCCCACGAAT TCTCCGTCGG CACGCCCGCG CTGGTCGATC AGGCCTGCGC TGCCGCGGAG GACGCCTTTG CCAGCTATGG CTACAGCGAC GCGGCCACCC GCGCGGCCTT CCTGAACGCC ATCGCCGACG AGATCGACGC CCGCGCCCAG ATCATCACTG GGATCGGCAC CCAGGAAACC GGCCTGCCCG AAGCCCGCCT GCAAGGGGAG CGGGGCCGAA CCACCGGGCA GCTGCGCCTC TTTGCCGAAC ATATCCTCAA GGGCGATTGC CTCGACCGGC GGCATGATCC GGCTCTGCCG GACCGCGCGC CTCTGCCGCG CCCGGACCTG AAGCTGGTCC AACGCCCCAT CGGCCCGGTC GCGGTGTTCG GCGCCTCGAA CTTCCCGCTC GCCTTCTCGG TGGCGGGCGG CGACACCGCG GCGGCGCTGG CCGCGGGCTG CCCCGTGGTG GTCAAGGGCC ACTCGGCCCA TCCCGGCACC GGGGAGATCG TGGCCGAAGC GATCCACGCG GCCATCGCCA GGACCGGGAT GCCCGCGGGG GTTTTCAGCC TGATCCAGGG CGGCAAGCGC GACGTGGGCA CAGCCCTCGT CCAACACCCG CTGATCCGCG CGGTGGGCTT CACCGGCTCG CTCGCCGGGG GGCGGGCGCT CTTCGATCTC TGCGCCGCAC GGCCCGAGCC GATCCCGTTC TTCGGCGAGC TGGGCTCGGT CAACCCGATG TTCCTGCTGC CCGAGGCGAT CGCGGCGCGT GGGGCCGAGA TCGGCGCGGG CTGGGCGGGC TCGCTGGCCA TGGGGGCGGG GCAGTTCTGC ACCAATCCCG GCATCGCGGT GGTGCTGCCG GGCGCCGACG CTTTCGTCGC CGCCGCCGAG GCCGCCCTGC GCGAGACCGC CGCGCAAACC ATGCTGACGG AGGGGATCGC CGCGGCCTAT CGCGACGGCG TCGCCCGTCT GGCCGCGCAC CCGCAAACCT CGGAACTGCT GGGCGCCCCC TGCGATGGGC GCGAGGCGCA CCCCTGTCTC TACCGTGTCG CGGCCAGGGA CTGGCTGGCC GATCACACCC TGCAAGAGGA GGTCTTCGGG CCGCTCGGCC TGGTGGTGGA GGCGCAGGAT GCGGCCGAAA TGGCCCGGAT CGCCAGGTCC CTCCAGGGCC AGCTCACCTG TACGCTCCAC ATGGAGGACG GCGACACCGA CCATGCGAGA TCCCTGGTGC CGCTGCTCGA ACGCAAGGCG GGCAGGATGC TTGTCAACGG CTTCCCCACG GGCGTCGAGG TTGCCGACAG CATGGTGCAT GGCGGGCCCT ATCCGGCCTC CACGAATTTC GGTGCGACCT CGGTCGGGAC ACTCTCGATC CGGCGATTCC TGCGGCCCGT GTGCTATCAG AACATGCCGG ACGCCCTTTT GCCTGCCGAT TACTGA
|
Protein sequence | MSFTPHGKHL IAGAWVGSDQ TFASDPAHGP AHEFSVGTPA LVDQACAAAE DAFASYGYSD AATRAAFLNA IADEIDARAQ IITGIGTQET GLPEARLQGE RGRTTGQLRL FAEHILKGDC LDRRHDPALP DRAPLPRPDL KLVQRPIGPV AVFGASNFPL AFSVAGGDTA AALAAGCPVV VKGHSAHPGT GEIVAEAIHA AIARTGMPAG VFSLIQGGKR DVGTALVQHP LIRAVGFTGS LAGGRALFDL CAARPEPIPF FGELGSVNPM FLLPEAIAAR GAEIGAGWAG SLAMGAGQFC TNPGIAVVLP GADAFVAAAE AALRETAAQT MLTEGIAAAY RDGVARLAAH PQTSELLGAP CDGREAHPCL YRVAARDWLA DHTLQEEVFG PLGLVVEAQD AAEMARIARS LQGQLTCTLH MEDGDTDHAR SLVPLLERKA GRMLVNGFPT GVEVADSMVH GGPYPASTNF GATSVGTLSI RRFLRPVCYQ NMPDALLPAD Y
|
| |