Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1706 |
Symbol | |
ID | 3908231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 1941346 |
End bp | 1942905 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637883600 |
Product | 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
Protein accession | YP_485325 |
Protein GI | 86748829 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.447223 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.661157 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGATG CCGCTCCACC GAAGACCAAA GCCGATCCGG CCGCGGCATT TCAGGCCAAT CTCGACCGCA CCGCGCCGCT GCTGAAGACG CTGAAAGCCG ACGGCATCGG CCACCTGATC GACGGCGCGA TCGTGCCATC GTCGTCCGGC GAGGTTTTCG AGACGACCTC GCCGATCGAC AATTGCGTGC TGGCGCAAGT GGCGCGCGGC ACCTCTGACG ACATCGACCG CGCGGCGCAG GCAGCCAAGC GCGCGTTTCC GGCGTGGCGC GACATGGCGG CATCGGCGCG GCGCAAATTG CTGCACAAGG TGGCGGACGC GATAGAAGCA CGCGCCGACG ACATCGCCGT GCTGGAATGC ATCGACACCG GGCAAGCACA TCGCTTCATG GCCAAGGCCG CGATCCGCGC CGCCGAGAAT TTCCGGTTCT TCGCCGACAA ATGCGCCGAG GCGCGCGACG GGCTGAACAC GCCGAGCGAC GAGCACTGGA ACGTTTCGAC CCGGGTGCCG ATCGGCCCGG TCGGCGTGAT CACGCCGTGG AATACGCCGT TCATGCTGTC GACCTGGAAG ATCGCGCCGG CGCTGGCCGC CGGCTGCACC GTCGTGCACA AGCCCGCCGA GTGGTCGCCG GTGACCGCGG ATCTGCTGTC GCGAATCTGC AAGGACGCCG GCCTGCCCGA CGGCGTGCTC AACACCGTGC AGGGCTTCGG CGAAGAAGCA GGCAAGGCGC TGACCGAACA TCCGGCGATC AAGGCGATCG CCTTCGTCGG CGAGACCGCC ACGGGTGCGG CGATCATGGC GCAGGGCGCG CCGACGCTGA AGCGGGTGCA TTTCGAACTC GGCGGCAAGA ACCCGGTGAT CGTGTTCGAC GACGCCGATC TCGACCGCGC GCTCGACGCC GTGGTGTTCA TGATCTACTC GCTCAACGGC GAGCGCTGCA CCTCGTCCAG CCGGCTGCTG ATCCAGCAAT CGATCGCCGA CGCCTTCATC GACAAGCTCG CGGCCCGCGT CCGGGCGCTC AAGGTCGGCC ATCCGCTCGA TCCGGCCACC GAAGTCGGCC CGCTGATCCA TCAGCGCCAT CTCGACAAGG TGTGCTCCTA TTTCGACATT GCGAAGAACG AAGGTGCGAC CGTCGCCGTC GGTGGCGCGC GCCACGATGG CCCCGGCGGC GGCAACTACG TTCAGCCAAC GCTGGTGACC GGCGCGCGCA GTAACATGCG GGTCGCGCAA GACGAAGTGT TCGGCCCGTT TCTCACGGTG ATCCCGTTCA AGGACGAGGC CGACGCGATC GCGATCGCCA ACGACATCCG CTACGGCCTC ACCGGCTATA TCTGGACCGG CGATATGGGC CGCGCGCTGC GCGTCGCCGA CGCGCTCGAG GCCGGCATGA TCTGGCTGAA CTCCGAAAAC GTTCGGCATC TGCCGACCCC GTTCGGCGGC ATGAAGCAAT CCGGCATCGG CCGCGACGGC GGCGACTATT CGTTCGAGTT CTACATGGAA ACCAAGCACG TCTCGCTGGC CCGCGGCACG CACAAGATTC AGAGACTGGG GGTTATGTAG
|
Protein sequence | MADAAPPKTK ADPAAAFQAN LDRTAPLLKT LKADGIGHLI DGAIVPSSSG EVFETTSPID NCVLAQVARG TSDDIDRAAQ AAKRAFPAWR DMAASARRKL LHKVADAIEA RADDIAVLEC IDTGQAHRFM AKAAIRAAEN FRFFADKCAE ARDGLNTPSD EHWNVSTRVP IGPVGVITPW NTPFMLSTWK IAPALAAGCT VVHKPAEWSP VTADLLSRIC KDAGLPDGVL NTVQGFGEEA GKALTEHPAI KAIAFVGETA TGAAIMAQGA PTLKRVHFEL GGKNPVIVFD DADLDRALDA VVFMIYSLNG ERCTSSSRLL IQQSIADAFI DKLAARVRAL KVGHPLDPAT EVGPLIHQRH LDKVCSYFDI AKNEGATVAV GGARHDGPGG GNYVQPTLVT GARSNMRVAQ DEVFGPFLTV IPFKDEADAI AIANDIRYGL TGYIWTGDMG RALRVADALE AGMIWLNSEN VRHLPTPFGG MKQSGIGRDG GDYSFEFYME TKHVSLARGT HKIQRLGVM
|
| |