Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3176 |
Symbol | |
ID | 4899158 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | - |
Start bp | 203759 |
End bp | 205246 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640113778 |
Product | betaine-aldehyde dehydrogenase |
Protein accession | YP_001045048 |
Protein GI | 126463935 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.762089 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCAAG AGATCGACTG GCACGCCCGC GCCAAAGCCG TGGCCCTTCC GAACCGTCCC TTCATCGACG GCCAGTATGT CGACAGCCTG TCGGGCGAGA CCTTCGCCTG CATCTATCCC GGCGATGGGC GGCCGCTGAC CGATGTGGCC TCGTGCAATG CGGCGGATGT GGATGTCGCG GTGCGGTCGG CCCGCCGGGC CTTCGAGGCG GGGGTCTGGT CGCGGATGGC CCCGGCCGAC CGGCGCCGCA TCCTGCTGCG CTTCTCCGAG CTGATCCTCG CGAACCGCGA GGAGCTGGCG CTCCTCGAGA CGCTGAACGT GGGCAAGCCC ATCGCCAATG CCTACAACGG CGACGTGGTG AGCGCTGCCG CCTGCATCGC CTGGTATGCC GAGGCGATCG ACAAGGTCTA TGGCGAGGTG GCGGCCACGG CGCACGACAT GACGACGCTC GTCATGCGCG AGCCCTTGGG GATCGTGGCG GCCGTGGTGC CGTGGAACTA TCCGATGTCG ATGGCCGCCT GGAAGCTCGG GCCGGCGCTC GCCACCGGCA ACTCGGTGAT CCTGAAGCCC GCCGAACAAT CGCCCTTCAC CGCGCTGAAG TTCGGCGAAC TCGCCATCGA GGCGGGGATG CCTCCGGGGG TGCTGAACGT GGTACCGGGC CTCGGCCATA TCGCGGGCAA GGCGCTCGGG CTGCATATGG ATGTCGATTG TGTGGGCTTC ACCGGCTCGA CCGAAGTCGG CAAGTACTTC ATGCAATATT CGGGCCAGTC GAACATCAAG CGCATCGGGC TCGAACTCGG CGGCAAGTCG CCCCAGGTCG TGCTCGCGGA TTGCGACGAT CTCGATGCGG CGGCGGCCGG GATTGCGGCC GGCATCTTCG CCAACACCGG GCAGGTCTGC AACGCGGGCT CGCGCCTGAT CGTGGACGAG AAGATCCACG ACCAGCTCCT CGAGAAGATC GCGGCTCAGG CCAAGGTCTT CGCGCCCGGC GATCCGCTCG ACACCGCCAC CCGCATGGGC TCGATGGTGA GCGAAGAGCA GATGGACCGC GTGCTGGGCT ACATCGACGC CGGCCGCGCG GACGGTGCAC GGCCGGTCAT CGGCGGCGGC CGGGTGAAGA CCGAAACCGG CGGCTTCTAC ATCGAACCCA CGATCTTCGA GGGCGTGCGC AACGACATGA AGATCGCGCA GGAAGAGATC TTCGGGCCGG TGCTGTCGGC CATCCCGGTG AAGGGCTTCG ACGAGGCGAT GGCGGTGGCC AATGACACGG TCTACGGGCT TGCGGGCGCG GTCTGGACCG GGTCGGTGAA GAACGCGCAC CGGGCGGCCA AGTCGCTCCG CGCGGGCGTG GTCTGGGTCA ACTGCTTCGA CCGCGGCTCG CTCGCCGTGC CCTTCGGCGG CTTCAAGCAA TCGGGCTTCG GCCGGGACAA GTCGCTGCAT GCGATGGACA AATACACCGA CCTCAAGGCC GTCTGGTTCG CTCACTGA
|
Protein sequence | MLQEIDWHAR AKAVALPNRP FIDGQYVDSL SGETFACIYP GDGRPLTDVA SCNAADVDVA VRSARRAFEA GVWSRMAPAD RRRILLRFSE LILANREELA LLETLNVGKP IANAYNGDVV SAAACIAWYA EAIDKVYGEV AATAHDMTTL VMREPLGIVA AVVPWNYPMS MAAWKLGPAL ATGNSVILKP AEQSPFTALK FGELAIEAGM PPGVLNVVPG LGHIAGKALG LHMDVDCVGF TGSTEVGKYF MQYSGQSNIK RIGLELGGKS PQVVLADCDD LDAAAAGIAA GIFANTGQVC NAGSRLIVDE KIHDQLLEKI AAQAKVFAPG DPLDTATRMG SMVSEEQMDR VLGYIDAGRA DGARPVIGGG RVKTETGGFY IEPTIFEGVR NDMKIAQEEI FGPVLSAIPV KGFDEAMAVA NDTVYGLAGA VWTGSVKNAH RAAKSLRAGV VWVNCFDRGS LAVPFGGFKQ SGFGRDKSLH AMDKYTDLKA VWFAH
|
| |