Gene RPD_0049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0049 
Symbol 
ID4020503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp61605 
End bp63122 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content65% 
IMG OID637960225 
Productbetaine-aldehyde dehydrogenase 
Protein accessionYP_567190 
Protein GI91974531 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGA AGACCACCAA GAAGCCGGAG CCGAGCCGGG ACGACTGGCA CGAGCGCGGC 
CTGAAGGTCA CGCCACGCAA TCAGCTCTAT ATCGATGGCG GCTGGCAGCC GGCTGCGTCC
GGCGGCACCT TTGCGTGTAT CAGCCCGATC GACGGCCGGA CCATCACCGA CGTCGCCGCC
GGCGATGCCG AGGATATCGA TCGTGCGGTC AAGGCGGCCC GCACGGCGTT CGAGAGCGGG
GTCTGGTCGC GCCAGACCCC GGCCCAGCGC AAGAAAGTGC TGTTGCAATT CGCAAAGCTG
GTGCGCGAGC ATCGCGACGA ACTCGCTTTG CTCGAAACGC TCGATGTCGG AAAGCCGATC
CGTTTCTCGC GCGCCGTGGA CATTCCTCAG GTCGAAGAGG CCATCAGCTG GACCGCCGAG
GCGATCGACA AGCTGTATGA CGAGGTCGCC CCGACGGGTG ACAAGTCGCT GGCTCTGGTG
CGCCGCGAAG CCGTCGGCGT GGTCGGCGCG GTGGTGCCGT GGAATTTTCC GCTGCTGATG
GCGGCGTGGA AGTTCGCGCC GATTCTGGCC ACCGGCAACA GCCTCGTGCT GAAGCCGGCC
GAGCAATCGC CGCTGACCGC GCTGCGGGTC GCCGAGCTTG CGACCGAAGC CGGGATTCCG
AACGGCGTCT TCAATGTCGT GACCGGCTTC GGCGAGACCG CGGGCAAGGC GCTCGGCCTG
CACCCGGATG TCGACGTGCT GGCATTCACC GGCTCGACGC AGGTCGGCAA GTACTTCCTC
GGCTATTCGG CGCAGTCGAA CATGAAGCAG GTCTGGCTCG AATGCGGTGG CAAGAGCCCG
AACATCATCT TCGATGACGT CTACGACATC GACGCCGCCG TGAAGGCCGC GGCGATGGGG
ATCTTCTTCA ACCAGGGGCA GGTCTGCAAT GCCGGCTCGC GGTTGCTGGT CCACAAGGCC
GTCAAGGCGC AGTTCATGGA GAAGCTGGTC GCCTTCACCA AGCGTATGAC GCCGGCGGAC
CCGATGGACC CGGCGACGAT TCTCGGCTCG ATCGTGAGCG TCGAACAGGC GCGCCGCATT
CTCGACTACA TCGAGATCGG CGCCGGCGAG GGCGCCAGAC TGGTAGCGGG CGGCAAGCCG
GTGCAGCCGG TCGAAGGCGG CTGCTTTATC GAACCGACCA TTTTCGATGG CGTCGGTGCG
GCGATGCGGA TCGCGCAGGA AGAGATCTTC GGCCCGGTGC TGTCGGTGAT CGAGTTCGAG
AGCGACGACG AAGCGATCAG TATCGCCAAT GACTCGATGT ACGGCCTCGC AGCCGCGGTC
TGGACGCGCG ATCTCAACCG GGCGCACCGG ATGGGCCAGC GTCTGAAAGC TGGGCTGGTC
TGGGTGAACT GCTACGACGC CGGCGACATG ACCGTGCCGT TCGGCGGCGT GAAGCAGTCT
GGCTTTGGTC GCGATCGATC TCTCCACGCT TTGGAGAAGT ACACCCAGCT CAAGACCGTC
TGGATCAATC TGCGCTGA
 
Protein sequence
MTTKTTKKPE PSRDDWHERG LKVTPRNQLY IDGGWQPAAS GGTFACISPI DGRTITDVAA 
GDAEDIDRAV KAARTAFESG VWSRQTPAQR KKVLLQFAKL VREHRDELAL LETLDVGKPI
RFSRAVDIPQ VEEAISWTAE AIDKLYDEVA PTGDKSLALV RREAVGVVGA VVPWNFPLLM
AAWKFAPILA TGNSLVLKPA EQSPLTALRV AELATEAGIP NGVFNVVTGF GETAGKALGL
HPDVDVLAFT GSTQVGKYFL GYSAQSNMKQ VWLECGGKSP NIIFDDVYDI DAAVKAAAMG
IFFNQGQVCN AGSRLLVHKA VKAQFMEKLV AFTKRMTPAD PMDPATILGS IVSVEQARRI
LDYIEIGAGE GARLVAGGKP VQPVEGGCFI EPTIFDGVGA AMRIAQEEIF GPVLSVIEFE
SDDEAISIAN DSMYGLAAAV WTRDLNRAHR MGQRLKAGLV WVNCYDAGDM TVPFGGVKQS
GFGRDRSLHA LEKYTQLKTV WINLR