Gene RPB_1749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1749 
Symbol 
ID3909736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1998993 
End bp2000183 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content63% 
IMG OID637883643 
ProductAcyl-CoA dehydrogenase-like 
Protein accessionYP_485368 
Protein GI86748872 
COG category[I] Lipid transport and metabolism 
COG ID[COG1960] Acyl-CoA dehydrogenases 
TIGRFAM ID[TIGR03204] pimeloyl-CoA dehydrogenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0172136 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCTGA ACTTCAGCAA GGAAGAAATC GCGTTTCGCG ACGAGGTGCG GCAGTTCTTC 
AAGGACAACG TGCCGGCGAA GACGCGGCAG AAGCTGGTCG AGGGCCGGCA TCCGTCGAAG
GACGACATGG TGGAGTGGTA TCGCATCCTC CACAAGAAGG GCTGGGGCGT CACCCACTGG
CCGAAGGAAT ATGGCGGCAC CGGTTGGAGC AGCGTGCAGC ACTACATCTT CAACGAGGAG
CTGCAGGCCG CCCCCGCGCC GCAGCCGCTC GCTTTCGGCG TGTCGATGGT GGGACCTGTG
ATCTACACTT TCGGCAGTGA AGAGCAGAAG AAGCGCTTCC TGCCGCGCAT CGCCAGCGTC
GAGGATTGGT GGTGCCAGGG CTTCTCCGAG CCCGGCTCCG GCTCCGATCT CGCCTCGCTC
AAGACCAAGG CCGAGAAGCG CGGCGACAAG TGGATCATCA ACGGCCAGAA GACCTGGACC
ACGCTGGCGC AATACGCCGA CTGGATCTTC TGCCTGTGCC GCACCGACCC CGCCGCCAAG
AAGCAGTCGG GCATTTCCTT CATCCTGGTC GACATGAAGA CCAAGGGCAT CACGGTGCGC
CCGATCCAGA CCATCGACGG CGGCAAGGAA GTCAACGAAG TGTTCTTCGA CGACGTCGAG
GTGCCGCTCG AAAATCTGGT CGGCGAGGAG AACAAGGGCT GGGACTACGC CAAATTCCTG
CTCGGCAACG AGCGCACCGG CATTGCCCGG GTCGGCATGT CGAAGGAGCG CATCCGCCGC
ATCAAGGAAC TGGCCGCCTC GGTCGAATCC GGCGGCAAGC CGGTGATCGA GAACCCCAAG
TTCCGCGACA AGCTCGCCGC GGTCGAGATC GAGCTGAAGG CGCTCGAACT GACGCAGCTC
CGCGTCGTCG CCGATGAAGG CAAGCACGGC AAGGGCAAGC CGAACCCGGC GTCGTCGGTG
CTGAAGATCA AGGGCTCCGA GATCCAGCAG GCGACCACCG AGCTGTTGAT GGAAGTGATC
GGCCCGTTCG CCGCGCCCTA CGATGTGCAC GGCGACGACG ACAGCAACGA GACGATGGAC
TGGACCGCGC AGATCGCGCC GAGCTACTTC AACAACCGCA AGGTGTCGAT CTACGGCGGC
TCGAACGAGA TCCAGCGCAA CATCATCACC AAGGCGGTGC TCGGGCTGTA A
 
Protein sequence
MDLNFSKEEI AFRDEVRQFF KDNVPAKTRQ KLVEGRHPSK DDMVEWYRIL HKKGWGVTHW 
PKEYGGTGWS SVQHYIFNEE LQAAPAPQPL AFGVSMVGPV IYTFGSEEQK KRFLPRIASV
EDWWCQGFSE PGSGSDLASL KTKAEKRGDK WIINGQKTWT TLAQYADWIF CLCRTDPAAK
KQSGISFILV DMKTKGITVR PIQTIDGGKE VNEVFFDDVE VPLENLVGEE NKGWDYAKFL
LGNERTGIAR VGMSKERIRR IKELAASVES GGKPVIENPK FRDKLAAVEI ELKALELTQL
RVVADEGKHG KGKPNPASSV LKIKGSEIQQ ATTELLMEVI GPFAAPYDVH GDDDSNETMD
WTAQIAPSYF NNRKVSIYGG SNEIQRNIIT KAVLGL