Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2802 |
Symbol | |
ID | 5706158 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 3181995 |
End bp | 3183503 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641272258 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001537628 |
Protein GI | 159038375 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.163668 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000224903 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGTTCTCT CGATCAAGTC ACGACGGGAC GGGGTGGGAC TGGGCCGGGG GCACCTGCTT GTCGCGGGTG CGTGGCGACC GTCGCGCGAT GGTGGGACGT GGGTGCACCT GCACCCGGCG ACGGGTGAGG AGGTGGGGGA GTTCGCGATC GCGGACCCCG CCGACGTCGA CGCTGCTGTC CGGGCTGCCC GACAGGCGTT CGACGAAGGG CCCTGGCCCC GCAGCCGGGC GAGAGAACGC ATCCGGGTCC TGCGACGCGC CGCAGACCTG ATCCGCGAGC ACTCCGACGA ACTGCTCGCG CTCCAGGCGC TCGACAACAG CGTCCCGTTG AGCTTCAGCG GTGCCTACGT GATGTCGGCC GAGTGCGCGG CCGACGTCTT CGACCACCAC GCGGGCTGGA TCGACAAGCT TGGTGGTGAG ACGCTACCGC CCTACCAAGG GGGTGACCAC CTGGTATTCA CCCTGCGCGA GCCGATTGGG GTGGTGGCGG CGGTCATTCC GTGGAACGCC CCGCTCTTGT TGGCGGCGCA GAAGCTCGCG CCGGCGCTGG CCTCCGGGTG CACGGTCGTG CTGAAGCCGT CAGAGTACGC CACCTTCGCG GTACTGCGGT TGGTGCAGAT TCTCGACGAG GCGGGAGTGC CACCGGGTGT GCTCAACGTG GTGACCGGGC CCGGCGAATC GACCGGTGAG GCGTTGATCA CCCATCCGAT GGTAGACAAG ATCACCTTCA CCGGCAGTCG TGCTGTGGGT CGCCGTATCC TGCACGCCGC AGCCGACGGA ATCACCAAGG TGAGTCTGGA ACTCGGTGGG AAGAGCCCAT CGATCGTATT CGCAGACGCC GATGTCTACG CGGCGGCGGC GATGACCATG GGCACCGTCA CCGTAGGACT GTCTGGTCAG GTGTGTGTGG CCCACAGTCG GGCACTGGTC CAGCGCGAGG TTTACGACGA GTTCGTGTCG ATCGCCACCG GGGCGACCGC GCTCGCGTGC TACGGGGATC CGTTCGACGC CGAGACCACC GCCTCACCGC TGATCAACGG ACGACAGCTC GACCGGGTGC TCGGCTATGT CGCACAAGGC CAGGCGGAGG GCGCTCGCCT GGTGTGCGGG GGCGAACGGG TTGGGGGAGA GCTGGCTGCG GGCAACTTCG TGACCCCGGC GCTCTTCGCC GACGTGGCCA GCGACATGAC CATCGCCCGT GAGGAGATTT TCGGTCCCGT GCTGGGTGTG ACTCCGTTCA CCGACGAGCA GGAGGCGATA CGCCTGGCGA ACGACACCGA GTATGGACTC GCCGCCATGG TGTGGACCGC GGATGTGAAG CGGGCCATGC GCCTGACCCG AGCCGTGCGG GCGGGAACCA TCGGCGTCAA CGGCTACCAG GTGGAGCCAC ACGCGGCCTT CGGTGGATTC GGTCAGTCCG GGCTCGGACG CGAGGGCGGG CGAGGCTCGG CAGAGGCTTT CACCGAGGTG AAGACCGTGC TGGTGCCGAC CACCGAGGAG CTCATGTAG
|
Protein sequence | MVLSIKSRRD GVGLGRGHLL VAGAWRPSRD GGTWVHLHPA TGEEVGEFAI ADPADVDAAV RAARQAFDEG PWPRSRARER IRVLRRAADL IREHSDELLA LQALDNSVPL SFSGAYVMSA ECAADVFDHH AGWIDKLGGE TLPPYQGGDH LVFTLREPIG VVAAVIPWNA PLLLAAQKLA PALASGCTVV LKPSEYATFA VLRLVQILDE AGVPPGVLNV VTGPGESTGE ALITHPMVDK ITFTGSRAVG RRILHAAADG ITKVSLELGG KSPSIVFADA DVYAAAAMTM GTVTVGLSGQ VCVAHSRALV QREVYDEFVS IATGATALAC YGDPFDAETT ASPLINGRQL DRVLGYVAQG QAEGARLVCG GERVGGELAA GNFVTPALFA DVASDMTIAR EEIFGPVLGV TPFTDEQEAI RLANDTEYGL AAMVWTADVK RAMRLTRAVR AGTIGVNGYQ VEPHAAFGGF GQSGLGREGG RGSAEAFTEV KTVLVPTTEE LM
|
| |