Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3504 |
Symbol | |
ID | 5541003 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 4571490 |
End bp | 4572914 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640895622 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001433572 |
Protein GI | 156743443 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCTTG TTTCTGTTCG TAATCCGCGC ACCGGGCAGT ACGATTATCA GTTTCTCCCT CCCAGGCGCG ACGAGTTAGC AGATGTGTGT CGCCGCCTCC GTGATGCGCA ACCGGCATGG GAAGCGCTGG GGATTGATGC CCGTGTTGCC GTTCTTGATG ATTGGCGCCG CGCGCTGGCA GCACATCGCA GCGACATTAT CACGGCGTTG ATCGCCGATA CGGGACGCTA TTACGAAAGT GTGCTCGAGT TCGAGTCGGT CGTGTCGAGT ATCGAACGCT GGCGACGGCT GGCTCCCGAT CTCCTGCGCT ATGGAGAAAG TCGCTCCAGC GCTCTTCCGT TTATTCGTCT GGAAGGCCGC CTTGTGCCGT ACCCGCTTGT GGGAGTGATC AGCCCGTGGA ACTTTCCTCT CTTACTGAGC CTGATCGATG CTTTACCCGC GCTCTTGACC GGCTGCGCGG CGCTGATTAA GCCCAGTGAA ATCGCCCCTC GTTTTATCGA ACCGTTGCAG CGCACCATCG CTGATGTGCC TGCGTTAAGC GACGTTTTGC AGTATGTCGC CGGCGATGGC GCTACCGGCG CTGCGATGAT CGATCTGGTT GATCTGGTCT GCTTTACCGG CAGCGTACCA ACCGGTCGGC GAGTGGCAGA AGCGGCGGCG CAGCGGTTTA TCCCGGCTTT CCTCGAACTT GGCGGCAAGG ACCCGGCCAT TGTCCTGGCT GATGCCGACA TCGAACGGGC CGCTGCTGCT ATCTTGTGGG GAGGCATGGT TAATGCCGGT CAGTCGTGTC TCTCGATAGA GCGGGTGTAT GTCGAAGCGC CGGTTTTCGC ATCGTTTGTG GAAGCGCTTA CCGACCAAGC ACGGCGACTG CGCCTCGCAT TTCCCGAACC GCAAAGCGGC GAGATTGGTC CAATCATTTC GGCGCGACAG GCTGATGTCA TTGCCGATCA TCTTGCCGAT GCGTTTGCGC ACGGCGCCGT TGCGCCGTGC GGCGGTGCGC TGGTTGAGTA TGGTGGCGGC ATCTATTGCT TGCCGACAGT GCTCACGAAC GTCAATCATA CGATGAAGGT GATGCGCGAA GAGACCTTTG CTCCGATCTT GCCGGTCATG CCGGTCGCCG ATGCTGATGA GGCAGTCGCG TTGGCGAATG ACAGCCATTT TGGTCTGAGC GCCGCAGTTT TCTCCGGCAA CCTCGCCATG GCTCGCGCCA TTGCCGCCCG TTTGCATGCC GGTGCGATCA GCATTAACGA TGCGGCGCTC ACTGCACTTA TTCACGACGG TGAAAAACAG TCGTTCAAGT TCTCCGGACT TGGCGGCTCG CGCATGGGTC CAGCAGCACT GCACCGGTTT GCCCGCAAAC AGGCGCTGCT GGTGAATACC AATTCAGGAT ACGATCCATG GTGGTTTCAA AGGGAGGCGC AGTAA
|
Protein sequence | MALVSVRNPR TGQYDYQFLP PRRDELADVC RRLRDAQPAW EALGIDARVA VLDDWRRALA AHRSDIITAL IADTGRYYES VLEFESVVSS IERWRRLAPD LLRYGESRSS ALPFIRLEGR LVPYPLVGVI SPWNFPLLLS LIDALPALLT GCAALIKPSE IAPRFIEPLQ RTIADVPALS DVLQYVAGDG ATGAAMIDLV DLVCFTGSVP TGRRVAEAAA QRFIPAFLEL GGKDPAIVLA DADIERAAAA ILWGGMVNAG QSCLSIERVY VEAPVFASFV EALTDQARRL RLAFPEPQSG EIGPIISARQ ADVIADHLAD AFAHGAVAPC GGALVEYGGG IYCLPTVLTN VNHTMKVMRE ETFAPILPVM PVADADEAVA LANDSHFGLS AAVFSGNLAM ARAIAARLHA GAISINDAAL TALIHDGEKQ SFKFSGLGGS RMGPAALHRF ARKQALLVNT NSGYDPWWFQ REAQ
|
| |