Gene RPB_2104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2104 
Symbol 
ID3908518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2392867 
End bp2394081 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content69% 
IMG OID637883997 
ProductL-carnitine dehydratase/bile acid-inducible protein F 
Protein accessionYP_485721 
Protein GI86749225 
COG category[C] Energy production and conversion 
COG ID[COG1804] Predicted acyl-CoA transferases/carnitine dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0421788 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0185484 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCCAC TCGCAGGCGT GACGATCGTC GACATGACGT CGGTGCTGAT GGGGCCTTAC 
GCGACCCAGA TGCTGGGCGA TTACGGCGCG GATGTCGTCA AGATCGAGTC CCCCGACGGC
GACGTCACGC GGCAGATCGG CCCGGCGCGG AATCCCGGCA TGGGACCGGT GTTCCTCAAT
GCCAACCGCA ACAAGCGCAG CATCTGCCTC GATCTCAAAC ACGCCGCCGG CCGCGACGCC
GCGCTGCGGC TGATTGCGGG CGCCGACGTG CTGGTGTACA ACGTGCGTCC GCAGGCGATG
GCGCGGCTGC GGCTCGGCTA TGACGAGGTC GCGGCGATCA ATCCGCGGCT GATCTATGCC
GGGCTGTTCG GCTTCGGCCA GGACGGACCC TATGCGGCCA AACCCGCTTA TGACGATCTA
ATCCAGGGCG CGACAGCGCT GCCGGCGCTG AACGCGCGTA TCGGTGACGG CACGCCGCGC
TACGTGCCTA ACGCGCTGGT CGACCGCATC GTCGGGCTCA CCGCGGTCGG CGCGATCTGC
GCCGCGCTGG TGCATCGCGA CCGCACTGGA CAGGGGCAGC GCGTCGGCGT CCCGATGTTC
GAGACGATGG CGGGCTTCGT GATGGGTGAT CATCTCGGCG GGCTCACCTA CGAGCCGCCG
CTCGATCGCG GCGGCTATGC CCGGCACCTG TCGCCGGACC GCCGGCCGTA CCAGACCGCC
GACGGCTACA TCTGCGCGAT GGTGTACAAC GACAAGCAGT GGGGCAGCTT CCTGCGCGCG
ATCGGCCGCG ACGATCTGCT GAGCGACGAG CGTTACACGT CGTTCGCCAA GCGCGCCGTG
AACATCGACG TGGTCTATGC CGAGCTGGCG CGGATCTTCC TGACGCGCAG CACGGCGGAG
TGGACGGAAC TGCTCGATGC CGCCGACGTG CCGGCGATGC GGATGCACGA TCTCGAAAGC
CTGCTCGACG ATCCGCATCT GGTCGCGACC GATTTCTTCC CCGTCGTCGA TCATCCGAGC
GAAGGCCCGA TCCGCGACAT GAGAGTCTCG GCGACCTTTG CGGCGACGCC CGTCGCGCGC
CAGCGCCTCG CGCCGCGCTT GGGCGAGCAG GGGGCGGAGG TGCTGCGCGA GGCCGGCTAC
AGCGACGACG AGATCGAGGC GCTGGCTGGA TGCGGCGCGT TGAAGCTGCC GGCGGCGGGC
AAGGTGGCGA GTTGA
 
Protein sequence
MGPLAGVTIV DMTSVLMGPY ATQMLGDYGA DVVKIESPDG DVTRQIGPAR NPGMGPVFLN 
ANRNKRSICL DLKHAAGRDA ALRLIAGADV LVYNVRPQAM ARLRLGYDEV AAINPRLIYA
GLFGFGQDGP YAAKPAYDDL IQGATALPAL NARIGDGTPR YVPNALVDRI VGLTAVGAIC
AALVHRDRTG QGQRVGVPMF ETMAGFVMGD HLGGLTYEPP LDRGGYARHL SPDRRPYQTA
DGYICAMVYN DKQWGSFLRA IGRDDLLSDE RYTSFAKRAV NIDVVYAELA RIFLTRSTAE
WTELLDAADV PAMRMHDLES LLDDPHLVAT DFFPVVDHPS EGPIRDMRVS ATFAATPVAR
QRLAPRLGEQ GAEVLREAGY SDDEIEALAG CGALKLPAAG KVAS