Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0041 |
Symbol | |
ID | 5537499 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 54325 |
End bp | 55779 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640892206 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001430197 |
Protein GI | 156740068 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0209129 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCTA TGCACAAAGA CATCAAAATC TACCGCAACT ATATCGGCGG CGCCTGGATG GAATCGCCGG CACGCCGGCA TGCACCCAAC ATCAACCCCG CCGACGCCAG CGATTTAATC GGCGAAGCGC CGCTTTCCCT CAATGACGAG GCGATGGCCG CCATCGAAGT GGCGGTGCAT GCCTTGCGGT CCTGGCGCCG GACGCCGGCG CCCGAACGTG GGGCGCTGGC GCTGCGCGCA GCGCACTTGC TGGCGGAACG CGCGGACAAT GTCGCGCGCG CGGTGGTGCG CGAGCAGGGC AAAACCCTTG CCGAGGCGCG TGCCGAGGTA CAGCGCGCCA TCGCATACGC GGAGTTCTGC GGCGCTGCGG CGTTCGCCAC TGAAGGCGTC ACCGTGCCGC TGCGCGCCCC GGCGCTTGGC TACACGCGCC GCCGTCCACT CGGCGTCGTA GCGTTGCTGA CGCCGGAATG GTCGCCGCTG GCGCTGCCAT TTGAGCGCCT GGTGCAGGCG TTGGTGTGTG GCAATACCGT CGTGCTGAAA CCGGCGCTGG CAACGCCGGA AACTGCGGAA TGGCTGGTGC GCTGCTTTGC CGACGCCGGC GCGCCATCCG GCGTCGTCAA TCTGGTGCAC GGCGCCACCG ATGAGACAGG CGCTGCTCTG ATCGATCATC CGATGGTTCG CGCGGTCTGG ATCGGCGGGT CACATGCCGA TGTCGTCGGC GCGCGGCGGC AGGCGGAGAC GCGCACGCTG CGCTTCATGA GTGAGCAGAT GGATGTCAAC CCGGTGATCG TGCTCGAAGA TGCCGATCTG GACCTGGCGC TGGCGGGAGT GCTCACCGGC GCGTTCGGCA ATGCCGGTCA ATCGTACACG GCGACCAAAC GGGTGATTCT TGTGCATCCG GTGGCGGATG CGTTCCTCGA AGAACTGGTC GCCCGCGTTT GCGCACTAAA CCTGGGCAAT GGTCTCGATG AAGCGGTTGG GATAGGACCA TGCACCGACG AAGCGCAGAT CGAGCAGGCG CTCGATCTGG TGCACCAGGC GGAGGCAGAG GGCGCCGAAG TGCTCTGCGG CGGCGCGCGC GCGGAAGACG AGGCGCTGGC GCATGGCTAC TTCCTGCGCC CAACAATCGT GGATCGGGTG CGCCCAGAGA TGCGGATCGC CCGTGAACCA GCCCTGGGAC CGGTGCTGGC AGTGACGCGC GTCGAAAGTT TTGCCGAAGC GCTGGCACAC ACGATCCGAT CCCATGCGGT GCGCGCCGCC GGAATCTACA CCCGCGACGG CGCGCGTATG CTGCGCTTCG TCGAGGAAAT GAACGCCCAA TCGATCCACA TTAATGCGCC AACTACCGGC GATGAGCCGC AGATGCCGGT CAATCACGAC TCCCTGATCG ACTTCTTCAG CGACACGAGC GCCGTTTATG TGCAGTACGG CGCCGGGAAT GGAGGCGTTG TGTGA
|
Protein sequence | MKPMHKDIKI YRNYIGGAWM ESPARRHAPN INPADASDLI GEAPLSLNDE AMAAIEVAVH ALRSWRRTPA PERGALALRA AHLLAERADN VARAVVREQG KTLAEARAEV QRAIAYAEFC GAAAFATEGV TVPLRAPALG YTRRRPLGVV ALLTPEWSPL ALPFERLVQA LVCGNTVVLK PALATPETAE WLVRCFADAG APSGVVNLVH GATDETGAAL IDHPMVRAVW IGGSHADVVG ARRQAETRTL RFMSEQMDVN PVIVLEDADL DLALAGVLTG AFGNAGQSYT ATKRVILVHP VADAFLEELV ARVCALNLGN GLDEAVGIGP CTDEAQIEQA LDLVHQAEAE GAEVLCGGAR AEDEALAHGY FLRPTIVDRV RPEMRIAREP ALGPVLAVTR VESFAEALAH TIRSHAVRAA GIYTRDGARM LRFVEEMNAQ SIHINAPTTG DEPQMPVNHD SLIDFFSDTS AVYVQYGAGN GGVV
|
| |