Gene Rpal_0743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0743 
Symbol 
ID6408396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp780478 
End bp782172 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content65% 
IMG OID642710658 
Productbenzoyl-CoA-dihydrodiol lyase 
Protein accessionYP_001989778 
Protein GI192289173 
COG category[I] Lipid transport and metabolism 
COG ID[COG1024] Enoyl-CoA hydratase/carnithine racemase 
TIGRFAM ID[TIGR03222] benzoyl-CoA-dihydrodiol lyase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGAGG TCGCAGACAC GCGTCCGCTC GCGAACGGAG CTGTGCGGGT CGATTTTCAG 
ACCGAGCCGT CCCGCTACCG GCATTGGAAG CTGACGGTCG ATGGCGAGAT CGCGACGCTC
ACCCTCGATG TCGACGAGAA CGGCGGCCTG TTCGAGGGCT ACCAGCTCAA GCTGAATTCC
TACGATCTCG GAGTCGACAT CGAGCTTTCC GACGCGATGC AGCGGCTTCG GTTCGAGCAT
CCGGCCGTGA AAGTCATCCT ACTTCGCTCC GGCAAGAACC GGGTGTTCTG CGCCGGCGCC
AACATCCGGA TGCTCGCCGG CGCCAGCCAT GTCCACAAAG TCAACTTCTG CAAGTTCACC
AACGAAACCC GCAATGGCAT CGAGGATTCC TCGCTGCATT CCGGCCAGCG CAGCATCGCG
GTGATCAACG GCACCGCGGC CGGCGGCGGC TACGAACTCG CGCTAGCCGC CGATCACATC
ATTATGGCTG ACGACGGTTC GGCTGCGGTG GCGTTGCCCG AGGTGCCGCT GCTCGCGGTG
CTGCCCGGCA CCGGCGGCCT GACGCGCGTG GTCGATAAGC GCAAGGTGCG CCGCGACCAC
GCCGACGTGT TCTGTACCAT CGAGGAAGGC ATCAAGGGCA AGCGCGCGGT GCAGTGGCGG
CTGGTCGACG ATATCGTCGC CACTACCAAG CTTGACGCGA AGGTGACCGA ACGCGCCCGT
GAGCTTGCTG CGGCTTCCCC GCGCAACGGA AGCGACGCGG GCGTCCCGTT GACGCCGCTG
CAGCGCCAAT TCGACGACAC CAGCGTGCGC TATGGATTGG TCGGCGTCGA GATCGATCGT
ACCGCCCGGA TCGCGACCAT CACGCTGACG GGCCCGGACG CGGCTCCGCC GACTTCGATC
GATGCATTGC AAGTGCAGGG CGCTGCGTTC TGGCCCTTGC AGCTCGCCCG CGAACTCGAC
GACGCCATCC TGCATCTGCG GCTGAATGAA CCGGAATTGG GACTCTGGGT GTTCAAGTCC
CACGGTGATG CCGAGCAAGT CCTGGCCTAT GACGCGCTGC TCGAAGCGCA CAAGGGCCAC
TGGCTGGTCA ACGAAATCCG TCACTTCTGG AAGCGCGTAC TGAAGCGCGT CGACGTCACG
TCGCGTTCTC TGGTCACGCT GGTCGAGCCA GGCTCGTGCT TCGCCGGAAC GCTCGCCGAA
CTCGTGTTCG CCGCCGACCG TAGCTACATG CTGATCGGCT CTCGCGACGG CGACAACCGT
CCGCCGCCGA TGCTGACGCT GTCGGCGCTG AACTTCGGCG CCTATCCGAT GAGCCATGGT
CTCACCCGGC TGCAGTCACG CTTCCAAGCT GATCCGGCTG ACCACGCGGC GGTGCAGAGC
AAAACTGGCG ATGCGCTCGA CGCCGAGGCG GCCGAGACGC TCGGTCTGGT CACGTTTGCG
CTCGACGATA TCGATTGGGA CGACGAGGTC CGCGTCTTCC TGGAAGAGCG TGCGTCGTTT
TCGCCCGACA GTCTCTCCGG CATGGAAGCC AACCTGCGCT TCGTCGGCCC CGAGACGATG
GAATCGAAAA TCTTCGCCCG GCTCACCGCC TGGCAGAACT GGATCTTCCA GCGTCCTAAC
GCGGTCGGTG AGGTCGGCGC GCTGCGCCGC TACGGCAGCG GCCAGAAACC GCAATTCGAT
ATGACGAGAG TCTGA
 
Protein sequence
MAEVADTRPL ANGAVRVDFQ TEPSRYRHWK LTVDGEIATL TLDVDENGGL FEGYQLKLNS 
YDLGVDIELS DAMQRLRFEH PAVKVILLRS GKNRVFCAGA NIRMLAGASH VHKVNFCKFT
NETRNGIEDS SLHSGQRSIA VINGTAAGGG YELALAADHI IMADDGSAAV ALPEVPLLAV
LPGTGGLTRV VDKRKVRRDH ADVFCTIEEG IKGKRAVQWR LVDDIVATTK LDAKVTERAR
ELAAASPRNG SDAGVPLTPL QRQFDDTSVR YGLVGVEIDR TARIATITLT GPDAAPPTSI
DALQVQGAAF WPLQLARELD DAILHLRLNE PELGLWVFKS HGDAEQVLAY DALLEAHKGH
WLVNEIRHFW KRVLKRVDVT SRSLVTLVEP GSCFAGTLAE LVFAADRSYM LIGSRDGDNR
PPPMLTLSAL NFGAYPMSHG LTRLQSRFQA DPADHAAVQS KTGDALDAEA AETLGLVTFA
LDDIDWDDEV RVFLEERASF SPDSLSGMEA NLRFVGPETM ESKIFARLTA WQNWIFQRPN
AVGEVGALRR YGSGQKPQFD MTRV