Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3204 |
Symbol | |
ID | 5210175 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 4038224 |
End bp | 4039771 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640596796 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001277515 |
Protein GI | 148657310 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00521927 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGCAACA AGATCATCGC CCCCGGCGTG GAATGGGCAC ACCTGCTGGA ACAGGCACAG ACCCTCGTTC CTGAAGCATT CAGCCACGAA GGACATCCGC TCAACCTGAT CGGCGGCGAG TGGTGCTGTC CTGGTCATCC CAAACTGTTC CTCTCACCCG TTGATGGCAC GGCTCTCGGC TACTATCCGA TGATCGACCT CGACACCGCC CGTCATGCCG TTGCCACATC TGCCGCCGAA TTCGGCGCCT GGTCGACAAC TGACCTTGCT GAACGCCAGC AGCGCGTCGA TGCATGCGTC CACCTGCTGC GTCAACATCG TGAACTCATC GGACGACTGC TGATGTGGGA GATCGGGAAA CCCTATGCTC AGGCAATGAC CGATGTTGAT CGCTGTATCA GCGGCGTCGA GTGGTATGTC GAGCAGATTC CATCCATGCT GGAAGGACGC ACGCCGCTTG GATTGATCTC AAACATCGCT TCCTGGAACT ACCCGCTGTC GGTGCTGGTG CACGCCATGC TGGTGCAGAC GCTGGCAGGC AATCCGGTGA TCGCCAAAAC GCCCAGCGAT GGCGGATTGT TTGCACTGAC CGTATCACTG GCGCTGGCGC GACGCTGCGG ACTGCCGGTC TCGCTGGTGA GCGGCTCCGG CGGGAAGCTT TCCGAAGCGC TGGTGCGCGG ACCGGAGATC GCCTGCCTGG CATTCGTCGG CGGCAAAACC AACGGACGCG ACATCGCCGC CAGCCTGTAC GACCGCGAGA AGCGCTACAT GCTCGAGATG GAAGGCATCA ACGCCTACGG CATCTGGCAG TTCACGCAGT GGGATCTGCT GGCGAAACAG TTGCGCCGCG GCTTCGATTA CGGTAAACAG CGTTGCACTG CTTATGTTCG TTTTGTGGTT CAGCGTCAAC TCTTCCCGCA GTTCCTCGAT ATGTACCTGC CGGTGCTCAA ATCGTTGCGC ATCGGCAATC CAACCCTGGT CGATCATCCA GAGGCGCCGC TGCCCACGCT CGACTTTGGA CCGCTGATCA ACAGCCGCAA GGTTGATGAA CTTCAGGTGC TGATCAGCGA AGCGATCGGC GGCGGTGCGA TCAGTCTGTA TCAGGGAACC CTCTGTGCAG ACGATTTCCT GCCGAACCAG GATATTTCCG CCTATATGGC GCCTGTTTCG CTGCTGAATG TTCCGCGCAG CGCGCGTCTG TACCACAATG AGCCGTTCGG TCCGGTCGAC ACGATTGTGG TGGTTGATAG CGTGGAAGAG TTGATCAACG AGATGAACGT CTCCAACGGC TGCCTGGTGG CGTCGGTCGC CTGCGACAAT CAGCGTCTGG CACAGCAGAT CGCTGCCGAA GTGCGCGCCT TCAAGGTCGG CATCAACACC ATCCGCTCAC GCGGCGATCG GGACGAAGTG TTCGGCGGCA TCGGGCAAAG CTGGAAGGGA TGCTTCGTCG GCGGTAAGTA TCTGGTGCAG GCAGTCACTG TCGGTCCTCC TGGCGAGCGG TTGTACGGCA ACTTCCCCGA TTACACGCTG CTGCCGGAGA AGCGGTGA
|
Protein sequence | MSNKIIAPGV EWAHLLEQAQ TLVPEAFSHE GHPLNLIGGE WCCPGHPKLF LSPVDGTALG YYPMIDLDTA RHAVATSAAE FGAWSTTDLA ERQQRVDACV HLLRQHRELI GRLLMWEIGK PYAQAMTDVD RCISGVEWYV EQIPSMLEGR TPLGLISNIA SWNYPLSVLV HAMLVQTLAG NPVIAKTPSD GGLFALTVSL ALARRCGLPV SLVSGSGGKL SEALVRGPEI ACLAFVGGKT NGRDIAASLY DREKRYMLEM EGINAYGIWQ FTQWDLLAKQ LRRGFDYGKQ RCTAYVRFVV QRQLFPQFLD MYLPVLKSLR IGNPTLVDHP EAPLPTLDFG PLINSRKVDE LQVLISEAIG GGAISLYQGT LCADDFLPNQ DISAYMAPVS LLNVPRSARL YHNEPFGPVD TIVVVDSVEE LINEMNVSNG CLVASVACDN QRLAQQIAAE VRAFKVGINT IRSRGDRDEV FGGIGQSWKG CFVGGKYLVQ AVTVGPPGER LYGNFPDYTL LPEKR
|
| |