Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0238 |
Symbol | |
ID | 5705966 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 268080 |
End bp | 269504 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641269768 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001535164 |
Protein GI | 159035911 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.851453 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000681524 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGACTCTCA CCCAGAGTAG TGGGATTCCC GATGCCGTAG CCGCTGCCCG GGGCCGCTTC GAGCGCGGCC TCACCCGCTC GCTGTCCGCC CGCCGCCGTC AGCTGCGGGC GCTGGGCGCC ATGCTCACCG AAAACGAGTC CGCCTTCGAG GCCGCGCTCT GGAGTGACCT GCGCAAGAAC CGGGCGGAGG CGCAGTTGAC CGAGATCAGC CTCGTCCTCG CCGAGATCAA CCACGCGTTA CGCCACCTGC GGCGATGGGT TCGACCAACC CGCGTACCGG TCCCGCTGGT GCTGATGCCC GCACAGGCGC GGCTGGTGCC CGAGCCACTC GGCGTCGTGC TGGTCATCGC CCCCTGGAAC TACCCAGTGC TGCTACTGCT CAGCCCGCTG GTCGGGGCAC TGGCCGCTGG CAACACGGCT GTCCTCAAAC CGAGTGAACT CGCCCCCGCC ACCTCGTCGC TGATCGCTCG GCTGGTGCCG AGCTACTTCC CCGACGGCGC CGTCCACACG GTTGAGGGTG CTGTGCCCGA GACCACCGAA CTACTGGCCC AGCGGTTCGA TCACATCGTC TTCACGGGCA GCGGAACAGT CGGGCGGATC GTGATGCGCG CCGCCGCCGA GCAGCTCACC CCGGTCACGT TGGAGCTGGG TGGCAAGTCA CCCGCCTGGT TCGACGACAG CGCCGACATC GCGGTGGCCG CCCGACGGTT GGCCTGGGCG AAGTTCACCA ATGCCGGGCA GACCTGCATC GCTCCGGACT ACGTCATGAC CACCCCGGAT CGGGTGCCCG CACTCGTGGA CGCGCTCGGC ACGGCGATCG AGGACATGTG GGGGCGCGAC CCTCACGAGA GCAGCACGTA CGGCCGCATC GTGAACGACC GCCAGTTCGA CCGCCTCGTC GGCTACCTGA CCGGCATCGA CCCGGCCATC GGCGGGACGT ACGACGGCAC TGAGCGCTAC TTCTCGCCCA CGGTGGTGAC CTTCCCGGCC ACCGACCATC CCGCAGTCGG CCCGGACGCC GCGCACCCCG TCCTACAGGA GGAGATCTTC GGCCCGATCC TGCCGATCCT TCCCGTGGCC AGCGCCGAGC AGGCCGTCCA GGTCATCAAT GGCTGGGACA AGCCGCTCGC TTTGTACGTG TTCTCGTCCT CCCCTGGCAT CCGGCGGCTG TTCGAGGAGG AAACCTCCTC CGGTGCCGTG GTCTACAACG CCGCGCTCAT CCACGCCGCC GCGACCGGGC TGCCCTTCGG GGGAGTAGGG GCGAGCGGAA TGGGCGCCTA CCACGGCAGC TACTCCTGGC GCACGTTCAG CCACTTCAAA CCGGTCCTCG AGAAGCCACT CAAGCCGGAT AGCCTGCGCC TGATACAGCC GCCGTTCGGC AAGCTGGGGA CGGCCCTCGC CCAGCACCTC ATGCGTCGGG CCTGA
|
Protein sequence | MTLTQSSGIP DAVAAARGRF ERGLTRSLSA RRRQLRALGA MLTENESAFE AALWSDLRKN RAEAQLTEIS LVLAEINHAL RHLRRWVRPT RVPVPLVLMP AQARLVPEPL GVVLVIAPWN YPVLLLLSPL VGALAAGNTA VLKPSELAPA TSSLIARLVP SYFPDGAVHT VEGAVPETTE LLAQRFDHIV FTGSGTVGRI VMRAAAEQLT PVTLELGGKS PAWFDDSADI AVAARRLAWA KFTNAGQTCI APDYVMTTPD RVPALVDALG TAIEDMWGRD PHESSTYGRI VNDRQFDRLV GYLTGIDPAI GGTYDGTERY FSPTVVTFPA TDHPAVGPDA AHPVLQEEIF GPILPILPVA SAEQAVQVIN GWDKPLALYV FSSSPGIRRL FEEETSSGAV VYNAALIHAA ATGLPFGGVG ASGMGAYHGS YSWRTFSHFK PVLEKPLKPD SLRLIQPPFG KLGTALAQHL MRRA
|
| |