Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_2910 |
Symbol | |
ID | 5084510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | - |
Start bp | 2966362 |
End bp | 2967822 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640484480 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001169101 |
Protein GI | 146278942 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.922612 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.319448 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGAGA AACGCGACTT CTACATCGGC GGCCGGTGGG TGGCGCCGGC CACGCGGAGG GATTGCGAGG TCGTGGACCC TTCCACCGAA GAGGTCTGCG CGGTGATCTC GCTCGGCGAT CAGGCCGACA CGGACGCGGC CGTATCCGCG GCCAAGGCCG CCTTCGAAGG CTGGGCCGCC ACGCCCCCGG CCGAGCGGCT GCGGCTCGTG AAGGGGATCC TCGCGCAATA CGAGGCGCGC AAGGAAGAGA TGGCGCAGGC GATCAGCCTC GAGATGGGCG CACCCATCGA CCTTGCGCGC AACAGCCAGG CGCCCTGCCT GCCCTGGCAT CTGTCGAACT TCCTCAAGGC GTTCGAGGAG ATCGAGTGGG TCCGGCCGCT CGGCCCGCAC GCGCCGAACG ACCGCATCGC GCTGGAACCG ATCGGCGTCG TCGGGCTCAT CACGCCCTGG AACTGGCCGA TGAACCAGGT GACGCTGAAG GTGATCCCGG CGCTGCTGGC GGGCTGCACC TGCGTGCTGA AACCGTCCGA GGAGGCGCCG CTTTCCTCGA TGCTCTTTGC CGAGTTCGTC CATGATGCGG GCATTCCGGC GGGCGTCTTC AACCTCGTGA ACGGCGACGG CGCGGGCGTG GGCTCACAGC TCTCCTCGCA CCCCGATATC GAGATGATCT CCTTCACCGG ATCGACCCGC GCGGGCCGGG CGATCTCGAA GGCCGCGGCC GAGTCGCTCA AGCGCGTCAC GCTCGAACTT GGCGGCAAGG GTGCGAACCT GGTCTTCGCC GATGCCGACG AACGCGCGGT CGAGCGGGGC GTGAAGCATT GCTTCAACAA CTCGGGCCAG AGCTGCAACG CGCCGACCCG GATGCTGGTC GAGCGTCCGC TTTACGACCG GGCGGTCGAG ATCGCGGCCG AGGTGGCCTC GAAGACCCGC GTGGCCTCGG CGCATGAGGA GGGGCCGCAT ATCGGGCCGG TGGTGAACAA GCGCCAGTTC GAGCAGATCC AGTCCTACAT CCAGAAGGGC ATCGACGAGG GCGCGCGGCT GGTGGCGGGC GGCCTCGGCC GGCCCGACGG GCTGAACCGC GGCTTCTTCG TGCGCCCCAC GGTCTTTGCC GACGTGACCC CCGGCATGAC CATCGAACGC GAGGAGATCT TCGGGCCGGT CCTGTCGATC CTGCCGTTCG AGACCGAGGA CGAGGCGGTG CGGATCGCCA ATGACACGCC CTATGGCCTG ACCAACTATG TCCAGAGCCA GGACGGGGCG CGGCGCAACC GCCTCGCCCG GCGCCTGCGT TCGGGCATGG TCGAGATGAA CGGCAAATCG CGCGGCGCAG GCGCGCCCTT CGGCGGCGTC AAGGCCTCGG GCCGGGCGCG GGAAGGCGGG CTCTGGGGGA TCGAGGAGTT CCTCGAGGTC AAGGCCATCT CGGGCTGGGA CCCCGAGGCC GAGGCGCTGG CCGCGGAGTG A
|
Protein sequence | MIEKRDFYIG GRWVAPATRR DCEVVDPSTE EVCAVISLGD QADTDAAVSA AKAAFEGWAA TPPAERLRLV KGILAQYEAR KEEMAQAISL EMGAPIDLAR NSQAPCLPWH LSNFLKAFEE IEWVRPLGPH APNDRIALEP IGVVGLITPW NWPMNQVTLK VIPALLAGCT CVLKPSEEAP LSSMLFAEFV HDAGIPAGVF NLVNGDGAGV GSQLSSHPDI EMISFTGSTR AGRAISKAAA ESLKRVTLEL GGKGANLVFA DADERAVERG VKHCFNNSGQ SCNAPTRMLV ERPLYDRAVE IAAEVASKTR VASAHEEGPH IGPVVNKRQF EQIQSYIQKG IDEGARLVAG GLGRPDGLNR GFFVRPTVFA DVTPGMTIER EEIFGPVLSI LPFETEDEAV RIANDTPYGL TNYVQSQDGA RRNRLARRLR SGMVEMNGKS RGAGAPFGGV KASGRAREGG LWGIEEFLEV KAISGWDPEA EALAAE
|
| |