Gene RPB_3947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3947 
Symbol 
ID3911754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4506580 
End bp4507785 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content68% 
IMG OID637885851 
ProductL-carnitine dehydratase/bile acid-inducible protein F 
Protein accessionYP_487551 
Protein GI86751055 
COG category[C] Energy production and conversion 
COG ID[COG1804] Predicted acyl-CoA transferases/carnitine dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.292669 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGACG GCATCTTCAA GGGTCTCAAG GTGCTCGATT GCGCGAGCTT CATCGCCGCA 
CCCGCCGCCG CCACGGTGCT GTCGGATTTC GGCGCCGACG TCATCAAGAT CGAGCCGCCG
GGCGCCGGCG ATCCCTATCG CAACCTGCCG AACCTGCCGG GCTATCCGCG CAGCGAGCAC
AACTACGCGT GGATGATCGA GAGCCGCAAC AAGCGCAGCC TCGCGCTCGA CCTCGCCAAA
CCGGAAGGCA AAGAGGTGTT GCGCAAGCTC GTCGCCGAGG CCGACGTGTT CATCACCAAT
TTCCCGCCGC AGGTGCGGGC GCGGCTCGGC ATCGCCTATG CGGATCTGGC GCCGCTCAAC
GCGCGGCTGA TCTATGCGAG CTTCACCGGC TATGGCGAGC GCGGCGAGGA GGCCAACAAG
CCGGGCTTCG ACAGCAACGC GTGGTGGGCG CGCTCCGGCA TGATGGATCT GGTCCGCGCC
GATGAAGACA CCACGCCGGC GCGCTCGGTC GCCGGGATGG GCGACCATCC CTGCGCGATG
GCGCTGTACG GCGCGATCGT CACCGCACTG TACAAGCGCG AACGCAGCGG CAAGGGCAGC
GAGGTGAAGT CCAATCTGAT GGCCAATGGC GTGTGGTCGT CGAGCGTGCT GGCGCAGGCC
AAGCTGGTCG GCGCGCAGTT CCAGCCGCGG ATGCCGCGCG AGCGCGCGCT CAACGCGGTG
GCCAATCATT ATCGCTGCCG CGACGGCCGC TGGCTGATCC TGTCGCTGCT CAACGAAGAG
AAGCAGTGGC CGACGCTGAC GCGCTGCCTC GGCCGCGAGG ACCTGACCGA CGATCCGCGC
TTCGCCACCA CGCCCGACCG CCATGCCCGC TCGGTCGAAC TGATCGCGAT CTTCGACGAG
ATCTTCGCGA CCCGCGACCT CGCCGACTGG CGCAAGGCGC TCGACGGCAG CGGGCTGGTG
TTCGGCGTCG TCGGCATCCT CGACGACATC CCGAACGATC AGCAGATGAT CGACAACGAC
GTGCTGGTGC CGTTCGAGAA CAACACCATC ATGACGATCA ACAGCCCGAT CTGGGTCGAG
GGCAGCGCCA AGACGCGGCC GCGGCTGCCG CCGGCGCTCG GCGAACACAG CGACGACGTG
CTGCGCAGCG CCGGCTACGA CGACGCCGCG ATCAGCGCGC TGCGGACATC CGGCGTGGTG
GGATAA
 
Protein sequence
MDDGIFKGLK VLDCASFIAA PAAATVLSDF GADVIKIEPP GAGDPYRNLP NLPGYPRSEH 
NYAWMIESRN KRSLALDLAK PEGKEVLRKL VAEADVFITN FPPQVRARLG IAYADLAPLN
ARLIYASFTG YGERGEEANK PGFDSNAWWA RSGMMDLVRA DEDTTPARSV AGMGDHPCAM
ALYGAIVTAL YKRERSGKGS EVKSNLMANG VWSSSVLAQA KLVGAQFQPR MPRERALNAV
ANHYRCRDGR WLILSLLNEE KQWPTLTRCL GREDLTDDPR FATTPDRHAR SVELIAIFDE
IFATRDLADW RKALDGSGLV FGVVGILDDI PNDQQMIDND VLVPFENNTI MTINSPIWVE
GSAKTRPRLP PALGEHSDDV LRSAGYDDAA ISALRTSGVV G