Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1095 |
Symbol | |
ID | 5711063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 1122494 |
End bp | 1124014 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641267006 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001532438 |
Protein GI | 159043644 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.298073 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGAGA TGACACAGGT CGACGACACC CTCGTCTCGC CGTTCAAGGA CCGCTACGAC AACTTCATCG GCGGTAAGTT CGTGCCCCCG GTCGAGGGAC GCTATTTCGA CAATGTCACA CCGATCACCG GCGAAGTCGT GGGTCAGATC GCGCGCTCTT CGGCGGCGGA TGTGGAACTG GCGCTGGACG CGGCCCACGC GGCCAAGGAT GCCTGGGGCA AGACCTCGGT CACCGAGCGC GCCAATATCG TTCTGAAGAT CGCCGACCGG ATCGAGGAAA ACCTCGACAT CATCGCCAAG GCGGAAACCT GGGACAACGG CAAGCCGATC CGTGAGACGA CCCTGGCTGA CATCCCCCTC GCCGTGGATC ACTTCCGCTA TTTCGCGGGT GTGCTGCGGG GCCAGGAAGG CTCCATGTCC GAGATCGACA ACGATACAGT GGCTTATCAC TTCCACGAGC CGCTCGGCGT CGTGGGCCAG ATCATCCCCT GGAACTTCTC GATCCTGATG GCGGCCTGGA AACTGGCGCC CGCGATTGCC GCCGGCAACT GCATCGTGCT GAAGCCGGCC GAGCAGACCC CCGCCGCGAT CATGGTTCTG GTAGAACTGA TTTCCGACCT GCTGCCGGCG GGTGTTCTGA ACATCGTCAA TGGCTATGGC GGTGAAGTGG GCGCGGCGCT GGCGACCTCC GACCGGATTG CCAAGATCGC CTTTACCGGG TCCACCGCCA CGGGCCGCAA GATCATGGAG GCCGCCACGG TCAACCTGAT CCCAGTCACG CTGGAGCTGG GTGGCAAGTC GCCGAACATC TTCTTCAAGG ACGTGATGGC CGAGGATGAC GCCTTCCTCG ACAAGGCGGT CGAAGGCTTC GTTCTGTTCG CGTTCAACCA GGGCGAGGTC TGCACCTGCC CGAGCCGGGC GCTGATCCAC GAGGACATCT ATGAAGAGTT CATCGCGCGC GCCATCGCTC GGGTGAAGGC CATCGTGCAG GGCGACCCGC GCAAGATGGA AACCATGGTC GGGGCCCAGG CGTCGAAAGA GCAGAAGGAC AAGATCCTGT CCTACTTCCA GATCGGTGTC GAAGAGGGGG CCGAAGTGCT CACCGGCGGC AAGGTCGCGG ATGTGTCCGA CGATCTGAAG GATGGCTTCT ACATCGAGCC GACCATCCTG AAGGGCCACA ACAAGATGCG TGTCTTCCAG GAAGAGATCT TCGGTCCGGT CGTGTCCGTG ACGACCTTCA AGACCGAAGA GGAAGCGTTG GAGCTGGCCA ATGACACCAT GTACGGCCTC GGCGCCGGGG TGTGGTCGCG GGATCAGAAC ACCTGCTACC GCTTCGGGCG CGGTGTCCAG GCGGGGCGCG TCTGGGTCAA CAACTACCAC GCCTATCCGG CTCACGCGGC CTTTGGCGGG TACAAGCAGT CCGGCATCGG GCGTGAGAAC CACAAGATGA TGCTCGACCA TTACCAGCAG ACCAAGAACA TGCTGGTCAG CTACAATCCC AACAAGCTCG GCTTCTTCTG A
|
Protein sequence | MNEMTQVDDT LVSPFKDRYD NFIGGKFVPP VEGRYFDNVT PITGEVVGQI ARSSAADVEL ALDAAHAAKD AWGKTSVTER ANIVLKIADR IEENLDIIAK AETWDNGKPI RETTLADIPL AVDHFRYFAG VLRGQEGSMS EIDNDTVAYH FHEPLGVVGQ IIPWNFSILM AAWKLAPAIA AGNCIVLKPA EQTPAAIMVL VELISDLLPA GVLNIVNGYG GEVGAALATS DRIAKIAFTG STATGRKIME AATVNLIPVT LELGGKSPNI FFKDVMAEDD AFLDKAVEGF VLFAFNQGEV CTCPSRALIH EDIYEEFIAR AIARVKAIVQ GDPRKMETMV GAQASKEQKD KILSYFQIGV EEGAEVLTGG KVADVSDDLK DGFYIEPTIL KGHNKMRVFQ EEIFGPVVSV TTFKTEEEAL ELANDTMYGL GAGVWSRDQN TCYRFGRGVQ AGRVWVNNYH AYPAHAAFGG YKQSGIGREN HKMMLDHYQQ TKNMLVSYNP NKLGFF
|
| |