Gene Rpal_5047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5047 
Symbol 
ID6412741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5429113 
End bp5430528 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content67% 
IMG OID642714932 
Productbifunctional enoyl-CoA hydratase/phosphate acetyltransferase 
Protein accessionYP_001994011 
Protein GI192293406 
COG category[C] Energy production and conversion
[I] Lipid transport and metabolism 
COG ID[COG0280] Phosphotransacetylase
[COG2030] Acyl dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCAGA TCCAAAACCG CACCTTTGAT GAGATTGAGG TCGGTGACAC CGCCAGTCTG 
GTTCGCACGC TGACCTATCG CGACATCGAG GTGTTCGCGG TGATGTCCGG CGACGTCAAC
CCGATGCATG TCGACGCGGC GTTCGCCAAG AGCGACATGT TCCATCAGGT GGTGGCGCAC
GGCATGTGGG GCGGGGCGCT GATCTCGACT CTACTCGGCA CGCAATTGCC CGGGCCCGGC
ACGATCTATC TCGATCAATC GCTGCGGTTC GCAAGGCCCG TGCTGCTCGG CGACACCGTG
ACCGTCACGG TCACGGTCAA AGAGAAGAAC GCGGCCAAGA AACGCCTGCT GCTGGATTGC
CGCGCTACCA ATCAGCGCGG CGAGGAGGTA ATCACCGGCC TCGCCGAAGT GATCGCGCCG
GTCGAGAAGA TCTCGCGGCC GCGGGTGCTG CTGCCGGAAA TCGATCTCAA TCGCACCGCG
CAGCGCTACG AGCGGCTGAT CGAAATGACG CGCGGGCTGC AGCCGATCCG CACCGCGGTG
GTGCACCCGG TGGATTCCGC CTCGTTGCTC GGCGCTGTCG AGGCGGCGCG CGAGGGGCTG
ATCGTGCCGG TCCTGGTCGG ACCGGAAGCC AAGATCCGCG CCGCCGCCGC CCAGGCCGCG
GTGGATCTTG CCGGCTACGA GATCGTTGCG GTCGAGCACA GCGCGGCCGC TGCCGAAGCC
GGGGTGGCGA TGGCGCGGGC CGGCGAGGTC GAGGCGGTGA TGAAAGGCGC GCTGCACACC
GACGAGCTGA TGCACGCGGT GGTCGATCGT ACCCGCGGTC TCCGTACCGC ACGTCGTATC
AGCCACGTCT ATGCGATTGA CGCACCGGAC TATCCCCGCG CGCTGCTGGT CACTGACGCG
GCGATCAACA TCTACCCGAC GCTCGCTGAC AAGCGCGACA TCATCCAAAA CGCGATCGAT
CTGGCGCATG CGCTGGGGAT CGCCGAGCCG CGGGTGGCGA TCCTGTCGGC GGTCGAAACC
GTCACCGAGA GCATCCGCTC GACGCTCGAT GCAGCCGCAT TGTGCAAGAT GGCCGAGCGC
GGCCAGATCA AAGGTGGCAT CCTCGACGGG CCGCTGGCCT TCGACAACGC GGTGTCGGAA
GAGGCTGCCA AGACCAAGGG TATCGTTTCG CAGGTGGCGG GGCGTGCCGA CATCTTCGTG
GTGCCGGACC TCGAGGCCGG CAACATGCTG GCCAAGCAAC TCGAATATCT GGCGCACGCC
CGCGTCGCCG GGATCGTGCT CGGCGCGCGG GTGCCGATCA TCCTCACCAG CCGCGCCGAC
AAGACGCTGG CGCGGCTCGG GTCTTGCGCG ATCGCGCTGC TGCTCGCTCG CCACAACACC
GCGGCGCCGC CGCGCGTTTC CGGAGGTGCC GCATGA
 
Protein sequence
MEQIQNRTFD EIEVGDTASL VRTLTYRDIE VFAVMSGDVN PMHVDAAFAK SDMFHQVVAH 
GMWGGALIST LLGTQLPGPG TIYLDQSLRF ARPVLLGDTV TVTVTVKEKN AAKKRLLLDC
RATNQRGEEV ITGLAEVIAP VEKISRPRVL LPEIDLNRTA QRYERLIEMT RGLQPIRTAV
VHPVDSASLL GAVEAAREGL IVPVLVGPEA KIRAAAAQAA VDLAGYEIVA VEHSAAAAEA
GVAMARAGEV EAVMKGALHT DELMHAVVDR TRGLRTARRI SHVYAIDAPD YPRALLVTDA
AINIYPTLAD KRDIIQNAID LAHALGIAEP RVAILSAVET VTESIRSTLD AAALCKMAER
GQIKGGILDG PLAFDNAVSE EAAKTKGIVS QVAGRADIFV VPDLEAGNML AKQLEYLAHA
RVAGIVLGAR VPIILTSRAD KTLARLGSCA IALLLARHNT AAPPRVSGGA A