Gene Rcas_0272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0272 
Symbol 
ID5537734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp338405 
End bp339958 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content62% 
IMG OID640892436 
Productaldehyde dehydrogenase 
Protein accessionYP_001430423 
Protein GI156740294 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGACC ACATCACAGA TGATCTCCCG TGCTATGGGT TGTATATCAA TGGCAGCTGG 
GTCGATGCCG AAGGCGGCCT GATGCCGGTC AACGAACCGG CGCGTGGGCG GACGATGGCG
TATGTAGCGC GCGGCTCGGC AGCCGATGTG GATCGCGCCG TGGCAGCAGC ACGTGAAGCG
TTTGATCGCG GTCCCTGGCC CCACACTCCC GGTCACGAAC GCGCGCGCAT CCTCAATGCG
ATTGCCGACC TGATCGAAGA GCACACCGCC GAATTTGCCG AACTCGAGTC GCGCAACCTT
GGCGCCCCGC TGCGCAAGAC GACCTTTGTC GATATTCCGT GGTCGGTCGA GCATCTGCGC
GTGTTCGCCG AACTGGCGAC GATCCATCCG TATGAGGCGC TGCCGTGGAC GGATATTCCC
TCGGTGAGTT GGAATTTCGT CTGGCGTGAG CCGATCGGCG TTTGCGGGCA GATTGTGCCG
TGGAACTATC CGCTTCTCAT GACGATCTGG AAGATCGCAC CGGCGCTGGC TGCGGGGAAC
ACTGTGGTGC TCAAACCAGC GTCCTACACG CCGCTGACAG CGCTCATGCT GATGAGACTG
ATTCACGAAG CCGGGTTGCT GCCACGCGGC GTGCTGAATA TCGTGACCGG TCCGGGGGCG
GAGGTTGGTG ACGCACTGGC GCGGCATCCC GGCGTGGACA AGGTGTCGTT CACCGGATCG
ACCGAAACCG GGCGTCATAT CATGCGGTTG GCGAGCGATA CGATCAAGCG CCTGACCCTC
GAACTCGGCG GCAAGTCGCC GAGCCTGGTG ATGCCCGACG CCGATCTCGA ACTCGCCACC
GATGGCGTGC TATTCGGCGT CTTTTTCAAC GGCGGGCAGA GTTGTGAAGC AGGAACGCGC
TGCCTGGTGC CAGAAAGCCT GCACGACGAG TTTCTCGAGC GTCTGGTGAC GCGCGCCCGC
TCGCTGCGGA TCGGCGACCC GCTCGATCTG GAAACCGACC TGGGTCCGCT CGTTTCCGAA
GCGCAGTGCC GGATCGTCGA GGAGTACATC GATGTCGGCA AGCACGAAGG CGCGCGCCTG
GTGACCGGCG GCAAACGCGC ACGGATCCCC GGTTTTGAGT ATGGTCCATT CATTGAACCA
ACGATCTTCA CCGGTGTGCA GAACGGCACA CGCCTGGCGC AGGAAGAAAT CTTTGGTCCG
GTGCTATCGG TCATTCCGTA CCGGACCGTG ACGGAGGCGA TTGAACTGGC GAATGCCAGT
CGCTATGGGC TTGGCGCCGC CGTCTGGTCA CGTGATCTTC AAGGCGCGAT TGAGGTCGCC
AAGCGCATCC GCACCGGTAC TGTCTGGATC AACGACCATC ACATCATTCT GCCGCGTGCG
CCATTTGGCG GCTACAAGCA GAGCGGCATC GGGCGCGAAC ACGGTATCTA CGGCTTGATG
GCGTACACCG AACTGAAGCA CATCCATGTC GATCTGATGC AGAAGCGCAG CGGGCGCGTC
TGGTGGGACG TGCTCATCCC GCAACGCGAT CAGAGTGAGG AAGCGGGGTT GTAG
 
Protein sequence
MDDHITDDLP CYGLYINGSW VDAEGGLMPV NEPARGRTMA YVARGSAADV DRAVAAAREA 
FDRGPWPHTP GHERARILNA IADLIEEHTA EFAELESRNL GAPLRKTTFV DIPWSVEHLR
VFAELATIHP YEALPWTDIP SVSWNFVWRE PIGVCGQIVP WNYPLLMTIW KIAPALAAGN
TVVLKPASYT PLTALMLMRL IHEAGLLPRG VLNIVTGPGA EVGDALARHP GVDKVSFTGS
TETGRHIMRL ASDTIKRLTL ELGGKSPSLV MPDADLELAT DGVLFGVFFN GGQSCEAGTR
CLVPESLHDE FLERLVTRAR SLRIGDPLDL ETDLGPLVSE AQCRIVEEYI DVGKHEGARL
VTGGKRARIP GFEYGPFIEP TIFTGVQNGT RLAQEEIFGP VLSVIPYRTV TEAIELANAS
RYGLGAAVWS RDLQGAIEVA KRIRTGTVWI NDHHIILPRA PFGGYKQSGI GREHGIYGLM
AYTELKHIHV DLMQKRSGRV WWDVLIPQRD QSEEAGL