Gene RPB_1706 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1706 
Symbol 
ID3908231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1941346 
End bp1942905 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content67% 
IMG OID637883600 
Product5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 
Protein accessionYP_485325 
Protein GI86748829 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.447223 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.661157 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGATG CCGCTCCACC GAAGACCAAA GCCGATCCGG CCGCGGCATT TCAGGCCAAT 
CTCGACCGCA CCGCGCCGCT GCTGAAGACG CTGAAAGCCG ACGGCATCGG CCACCTGATC
GACGGCGCGA TCGTGCCATC GTCGTCCGGC GAGGTTTTCG AGACGACCTC GCCGATCGAC
AATTGCGTGC TGGCGCAAGT GGCGCGCGGC ACCTCTGACG ACATCGACCG CGCGGCGCAG
GCAGCCAAGC GCGCGTTTCC GGCGTGGCGC GACATGGCGG CATCGGCGCG GCGCAAATTG
CTGCACAAGG TGGCGGACGC GATAGAAGCA CGCGCCGACG ACATCGCCGT GCTGGAATGC
ATCGACACCG GGCAAGCACA TCGCTTCATG GCCAAGGCCG CGATCCGCGC CGCCGAGAAT
TTCCGGTTCT TCGCCGACAA ATGCGCCGAG GCGCGCGACG GGCTGAACAC GCCGAGCGAC
GAGCACTGGA ACGTTTCGAC CCGGGTGCCG ATCGGCCCGG TCGGCGTGAT CACGCCGTGG
AATACGCCGT TCATGCTGTC GACCTGGAAG ATCGCGCCGG CGCTGGCCGC CGGCTGCACC
GTCGTGCACA AGCCCGCCGA GTGGTCGCCG GTGACCGCGG ATCTGCTGTC GCGAATCTGC
AAGGACGCCG GCCTGCCCGA CGGCGTGCTC AACACCGTGC AGGGCTTCGG CGAAGAAGCA
GGCAAGGCGC TGACCGAACA TCCGGCGATC AAGGCGATCG CCTTCGTCGG CGAGACCGCC
ACGGGTGCGG CGATCATGGC GCAGGGCGCG CCGACGCTGA AGCGGGTGCA TTTCGAACTC
GGCGGCAAGA ACCCGGTGAT CGTGTTCGAC GACGCCGATC TCGACCGCGC GCTCGACGCC
GTGGTGTTCA TGATCTACTC GCTCAACGGC GAGCGCTGCA CCTCGTCCAG CCGGCTGCTG
ATCCAGCAAT CGATCGCCGA CGCCTTCATC GACAAGCTCG CGGCCCGCGT CCGGGCGCTC
AAGGTCGGCC ATCCGCTCGA TCCGGCCACC GAAGTCGGCC CGCTGATCCA TCAGCGCCAT
CTCGACAAGG TGTGCTCCTA TTTCGACATT GCGAAGAACG AAGGTGCGAC CGTCGCCGTC
GGTGGCGCGC GCCACGATGG CCCCGGCGGC GGCAACTACG TTCAGCCAAC GCTGGTGACC
GGCGCGCGCA GTAACATGCG GGTCGCGCAA GACGAAGTGT TCGGCCCGTT TCTCACGGTG
ATCCCGTTCA AGGACGAGGC CGACGCGATC GCGATCGCCA ACGACATCCG CTACGGCCTC
ACCGGCTATA TCTGGACCGG CGATATGGGC CGCGCGCTGC GCGTCGCCGA CGCGCTCGAG
GCCGGCATGA TCTGGCTGAA CTCCGAAAAC GTTCGGCATC TGCCGACCCC GTTCGGCGGC
ATGAAGCAAT CCGGCATCGG CCGCGACGGC GGCGACTATT CGTTCGAGTT CTACATGGAA
ACCAAGCACG TCTCGCTGGC CCGCGGCACG CACAAGATTC AGAGACTGGG GGTTATGTAG
 
Protein sequence
MADAAPPKTK ADPAAAFQAN LDRTAPLLKT LKADGIGHLI DGAIVPSSSG EVFETTSPID 
NCVLAQVARG TSDDIDRAAQ AAKRAFPAWR DMAASARRKL LHKVADAIEA RADDIAVLEC
IDTGQAHRFM AKAAIRAAEN FRFFADKCAE ARDGLNTPSD EHWNVSTRVP IGPVGVITPW
NTPFMLSTWK IAPALAAGCT VVHKPAEWSP VTADLLSRIC KDAGLPDGVL NTVQGFGEEA
GKALTEHPAI KAIAFVGETA TGAAIMAQGA PTLKRVHFEL GGKNPVIVFD DADLDRALDA
VVFMIYSLNG ERCTSSSRLL IQQSIADAFI DKLAARVRAL KVGHPLDPAT EVGPLIHQRH
LDKVCSYFDI AKNEGATVAV GGARHDGPGG GNYVQPTLVT GARSNMRVAQ DEVFGPFLTV
IPFKDEADAI AIANDIRYGL TGYIWTGDMG RALRVADALE AGMIWLNSEN VRHLPTPFGG
MKQSGIGRDG GDYSFEFYME TKHVSLARGT HKIQRLGVM