Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3903 |
Symbol | |
ID | 5704976 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4443215 |
End bp | 4444696 |
Gene Length | 1482 bp |
Protein Length | 493 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641273328 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001538685 |
Protein GI | 159039432 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR03216] 2-hydroxymuconic semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGGCC GGGCACCCGA CGGGCCGGGC CTGTTGCGCA ACTTCGTCTC CGGCGAGTTC GTCGACTCAG GTCGCCGTTT CACCAAGCGC AGCCCGGTGA CCGGCGAGCC GGTCTTCGAG GTCGTCGAAG CAAGCGAATC CACTGTAGAT GATGCCGTGT CCGCTGCCCG TTCCGCGCTG CGCGGCCCGT GGGGTCGGCT CGGCGAGCGG GGCCGCGCCG AGGTACTGCG CCGCGTCGCC GACGAACTGG AGCGTCGATT CGACGACCTG GTCACCGCCG AGGTGGCCGA CACCGGCAAG GCCATCTCGC AGGCCCGGAC ACTGGACATC CCGCGCGGCG CGGCCAACTT CCGTGCGTTC GCCGAGATCG CAGCGACCGC ACCGGCCGAA TCGTTCACCA CGGTCACCCC GACAGGTGGC CACGCGCTGA ACTACGCGAT CCGCAAGCCG GTCGGTGTGG TCGCGGTGAT CGTGCCGTGG AACCTGCCGC TGTTGCTGCT CACCTGGAAG GTCGCGCCCG CACTGGCGTG CGGCAACGCG GTGGTGGTCA AGCCCAGTGA GGAGACGCCC GCCTCGGCGA CCCTGCTCGC CGAGGTGATG GCCGCCGCCG GTGTCCCCGC GGGTGTCTTC AACGTCGTGC ACGGCTACGG GCCCGGATCG GCGGGTGAGT TCCTCACCCG GCACCCCGAC GTGGACGCGA TCACCTTCAC CGGTGAGTCG GCCACCGGCG GCGCCATCAT GCGGGCCGCC GCCGACGGGG TGAAGGCGGT GTCCTTCGAA CTTGGTGGCA AGAACGCCGG GCTGGTCTTC GCCGACGCCG ACCTGCCGGC GGCGGTGGCC GGATCGGTGC GATCCAGCTT CACCAACGGC GGCCAGGTCT GCCTCTGTAC TGAACGCATC TACGTGCAGC GTCCGGTCTT CGCGGAGTTC ACCGCCCGGC TGGCCGAGCG GGCCGCCGAA CTGCCGTACG GATGGCCCGC GGACGAAGCC ACGGTGAACA TGCCCCTGAT CTCGACCGCC CACCGGGAAA AGGTGCTCGG CCACTACCAG CTGGCCCGTG CCGAGGGAGC GGAGGTGCGC GCCGGCGGTG GCACGCCGCA CTTCGGCGAC GCTCGTGACG GTGGCGCGTA CATCCAGCCG ACGGTGCTCA CCGGGCTCGG TTCGGACGCC CGGACCAACC GGGAGGAGAT CTTCGGGCCG GTGGTGCACG TGGCGCCATT CGACGACGAG GACGAGGCGT ACGCGCTGGC CAACGGCACC GAGTACGGGC TGGCGGCGGC CGTGTGGACC CGGGACGTGG GCCGGGCCCA CCGGGCGGGC GCTCAGCTGG ACGCGGGGAT CGTCTGGGTC AATACCTGGT ACCTGCGCGA CCTGCGTACC CCGTTCGGCG GGGTGAAGGC GTCCGGCATC GGCCGCGAGG GCGGTGTTCA CTCGCTGGGT TTCTACTCCG AGCTCACCAA CGTCTGTGTG GATCTGTCAT GA
|
Protein sequence | MAGRAPDGPG LLRNFVSGEF VDSGRRFTKR SPVTGEPVFE VVEASESTVD DAVSAARSAL RGPWGRLGER GRAEVLRRVA DELERRFDDL VTAEVADTGK AISQARTLDI PRGAANFRAF AEIAATAPAE SFTTVTPTGG HALNYAIRKP VGVVAVIVPW NLPLLLLTWK VAPALACGNA VVVKPSEETP ASATLLAEVM AAAGVPAGVF NVVHGYGPGS AGEFLTRHPD VDAITFTGES ATGGAIMRAA ADGVKAVSFE LGGKNAGLVF ADADLPAAVA GSVRSSFTNG GQVCLCTERI YVQRPVFAEF TARLAERAAE LPYGWPADEA TVNMPLISTA HREKVLGHYQ LARAEGAEVR AGGGTPHFGD ARDGGAYIQP TVLTGLGSDA RTNREEIFGP VVHVAPFDDE DEAYALANGT EYGLAAAVWT RDVGRAHRAG AQLDAGIVWV NTWYLRDLRT PFGGVKASGI GREGGVHSLG FYSELTNVCV DLS
|
| |