Gene Rcas_3504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3504 
Symbol 
ID5541003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4571490 
End bp4572914 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content59% 
IMG OID640895622 
Productaldehyde dehydrogenase 
Protein accessionYP_001433572 
Protein GI156743443 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCTTG TTTCTGTTCG TAATCCGCGC ACCGGGCAGT ACGATTATCA GTTTCTCCCT 
CCCAGGCGCG ACGAGTTAGC AGATGTGTGT CGCCGCCTCC GTGATGCGCA ACCGGCATGG
GAAGCGCTGG GGATTGATGC CCGTGTTGCC GTTCTTGATG ATTGGCGCCG CGCGCTGGCA
GCACATCGCA GCGACATTAT CACGGCGTTG ATCGCCGATA CGGGACGCTA TTACGAAAGT
GTGCTCGAGT TCGAGTCGGT CGTGTCGAGT ATCGAACGCT GGCGACGGCT GGCTCCCGAT
CTCCTGCGCT ATGGAGAAAG TCGCTCCAGC GCTCTTCCGT TTATTCGTCT GGAAGGCCGC
CTTGTGCCGT ACCCGCTTGT GGGAGTGATC AGCCCGTGGA ACTTTCCTCT CTTACTGAGC
CTGATCGATG CTTTACCCGC GCTCTTGACC GGCTGCGCGG CGCTGATTAA GCCCAGTGAA
ATCGCCCCTC GTTTTATCGA ACCGTTGCAG CGCACCATCG CTGATGTGCC TGCGTTAAGC
GACGTTTTGC AGTATGTCGC CGGCGATGGC GCTACCGGCG CTGCGATGAT CGATCTGGTT
GATCTGGTCT GCTTTACCGG CAGCGTACCA ACCGGTCGGC GAGTGGCAGA AGCGGCGGCG
CAGCGGTTTA TCCCGGCTTT CCTCGAACTT GGCGGCAAGG ACCCGGCCAT TGTCCTGGCT
GATGCCGACA TCGAACGGGC CGCTGCTGCT ATCTTGTGGG GAGGCATGGT TAATGCCGGT
CAGTCGTGTC TCTCGATAGA GCGGGTGTAT GTCGAAGCGC CGGTTTTCGC ATCGTTTGTG
GAAGCGCTTA CCGACCAAGC ACGGCGACTG CGCCTCGCAT TTCCCGAACC GCAAAGCGGC
GAGATTGGTC CAATCATTTC GGCGCGACAG GCTGATGTCA TTGCCGATCA TCTTGCCGAT
GCGTTTGCGC ACGGCGCCGT TGCGCCGTGC GGCGGTGCGC TGGTTGAGTA TGGTGGCGGC
ATCTATTGCT TGCCGACAGT GCTCACGAAC GTCAATCATA CGATGAAGGT GATGCGCGAA
GAGACCTTTG CTCCGATCTT GCCGGTCATG CCGGTCGCCG ATGCTGATGA GGCAGTCGCG
TTGGCGAATG ACAGCCATTT TGGTCTGAGC GCCGCAGTTT TCTCCGGCAA CCTCGCCATG
GCTCGCGCCA TTGCCGCCCG TTTGCATGCC GGTGCGATCA GCATTAACGA TGCGGCGCTC
ACTGCACTTA TTCACGACGG TGAAAAACAG TCGTTCAAGT TCTCCGGACT TGGCGGCTCG
CGCATGGGTC CAGCAGCACT GCACCGGTTT GCCCGCAAAC AGGCGCTGCT GGTGAATACC
AATTCAGGAT ACGATCCATG GTGGTTTCAA AGGGAGGCGC AGTAA
 
Protein sequence
MALVSVRNPR TGQYDYQFLP PRRDELADVC RRLRDAQPAW EALGIDARVA VLDDWRRALA 
AHRSDIITAL IADTGRYYES VLEFESVVSS IERWRRLAPD LLRYGESRSS ALPFIRLEGR
LVPYPLVGVI SPWNFPLLLS LIDALPALLT GCAALIKPSE IAPRFIEPLQ RTIADVPALS
DVLQYVAGDG ATGAAMIDLV DLVCFTGSVP TGRRVAEAAA QRFIPAFLEL GGKDPAIVLA
DADIERAAAA ILWGGMVNAG QSCLSIERVY VEAPVFASFV EALTDQARRL RLAFPEPQSG
EIGPIISARQ ADVIADHLAD AFAHGAVAPC GGALVEYGGG IYCLPTVLTN VNHTMKVMRE
ETFAPILPVM PVADADEAVA LANDSHFGLS AAVFSGNLAM ARAIAARLHA GAISINDAAL
TALIHDGEKQ SFKFSGLGGS RMGPAALHRF ARKQALLVNT NSGYDPWWFQ REAQ