Gene Rcas_0041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0041 
Symbol 
ID5537499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp54325 
End bp55779 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content66% 
IMG OID640892206 
Productaldehyde dehydrogenase 
Protein accessionYP_001430197 
Protein GI156740068 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0209129 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCTA TGCACAAAGA CATCAAAATC TACCGCAACT ATATCGGCGG CGCCTGGATG 
GAATCGCCGG CACGCCGGCA TGCACCCAAC ATCAACCCCG CCGACGCCAG CGATTTAATC
GGCGAAGCGC CGCTTTCCCT CAATGACGAG GCGATGGCCG CCATCGAAGT GGCGGTGCAT
GCCTTGCGGT CCTGGCGCCG GACGCCGGCG CCCGAACGTG GGGCGCTGGC GCTGCGCGCA
GCGCACTTGC TGGCGGAACG CGCGGACAAT GTCGCGCGCG CGGTGGTGCG CGAGCAGGGC
AAAACCCTTG CCGAGGCGCG TGCCGAGGTA CAGCGCGCCA TCGCATACGC GGAGTTCTGC
GGCGCTGCGG CGTTCGCCAC TGAAGGCGTC ACCGTGCCGC TGCGCGCCCC GGCGCTTGGC
TACACGCGCC GCCGTCCACT CGGCGTCGTA GCGTTGCTGA CGCCGGAATG GTCGCCGCTG
GCGCTGCCAT TTGAGCGCCT GGTGCAGGCG TTGGTGTGTG GCAATACCGT CGTGCTGAAA
CCGGCGCTGG CAACGCCGGA AACTGCGGAA TGGCTGGTGC GCTGCTTTGC CGACGCCGGC
GCGCCATCCG GCGTCGTCAA TCTGGTGCAC GGCGCCACCG ATGAGACAGG CGCTGCTCTG
ATCGATCATC CGATGGTTCG CGCGGTCTGG ATCGGCGGGT CACATGCCGA TGTCGTCGGC
GCGCGGCGGC AGGCGGAGAC GCGCACGCTG CGCTTCATGA GTGAGCAGAT GGATGTCAAC
CCGGTGATCG TGCTCGAAGA TGCCGATCTG GACCTGGCGC TGGCGGGAGT GCTCACCGGC
GCGTTCGGCA ATGCCGGTCA ATCGTACACG GCGACCAAAC GGGTGATTCT TGTGCATCCG
GTGGCGGATG CGTTCCTCGA AGAACTGGTC GCCCGCGTTT GCGCACTAAA CCTGGGCAAT
GGTCTCGATG AAGCGGTTGG GATAGGACCA TGCACCGACG AAGCGCAGAT CGAGCAGGCG
CTCGATCTGG TGCACCAGGC GGAGGCAGAG GGCGCCGAAG TGCTCTGCGG CGGCGCGCGC
GCGGAAGACG AGGCGCTGGC GCATGGCTAC TTCCTGCGCC CAACAATCGT GGATCGGGTG
CGCCCAGAGA TGCGGATCGC CCGTGAACCA GCCCTGGGAC CGGTGCTGGC AGTGACGCGC
GTCGAAAGTT TTGCCGAAGC GCTGGCACAC ACGATCCGAT CCCATGCGGT GCGCGCCGCC
GGAATCTACA CCCGCGACGG CGCGCGTATG CTGCGCTTCG TCGAGGAAAT GAACGCCCAA
TCGATCCACA TTAATGCGCC AACTACCGGC GATGAGCCGC AGATGCCGGT CAATCACGAC
TCCCTGATCG ACTTCTTCAG CGACACGAGC GCCGTTTATG TGCAGTACGG CGCCGGGAAT
GGAGGCGTTG TGTGA
 
Protein sequence
MKPMHKDIKI YRNYIGGAWM ESPARRHAPN INPADASDLI GEAPLSLNDE AMAAIEVAVH 
ALRSWRRTPA PERGALALRA AHLLAERADN VARAVVREQG KTLAEARAEV QRAIAYAEFC
GAAAFATEGV TVPLRAPALG YTRRRPLGVV ALLTPEWSPL ALPFERLVQA LVCGNTVVLK
PALATPETAE WLVRCFADAG APSGVVNLVH GATDETGAAL IDHPMVRAVW IGGSHADVVG
ARRQAETRTL RFMSEQMDVN PVIVLEDADL DLALAGVLTG AFGNAGQSYT ATKRVILVHP
VADAFLEELV ARVCALNLGN GLDEAVGIGP CTDEAQIEQA LDLVHQAEAE GAEVLCGGAR
AEDEALAHGY FLRPTIVDRV RPEMRIAREP ALGPVLAVTR VESFAEALAH TIRSHAVRAA
GIYTRDGARM LRFVEEMNAQ SIHINAPTTG DEPQMPVNHD SLIDFFSDTS AVYVQYGAGN
GGVV