Gene RPB_3497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3497 
Symbol 
ID3911299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4002275 
End bp4003342 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content65% 
IMG OID637885399 
Productcholoylglycine hydrolase 
Protein accessionYP_487103 
Protein GI86750607 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3049] Penicillin V acylase and related amidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGCAT TCAGGCGTCG TTTCGTGACC GTCTCCATCG CCGCACTGCT TGCCAGCGGC 
GCCCTGCTCG CGCCCGCTGC GAAAGCCTGC ACCCGCCTGG TCTATCTCGG CGCCGGCGAT
CAGGTGATCA CCGCGCGCTC GATGGACTGG GCGCGCGACA TCGGCACCAA TCTCTGGATC
TTCCCGCGCG GCATCAAGCG CTCCGGCGAG GCCGGGCCGA ATTCGGCACA ATGGACCGCG
CGCTACGGCA GCGTGATCGC CTCGGCCTAC GACATCGCGA CCTCGGACGG CGTCAACGAG
GCCGGCCTGG TGGCCAACGT GCTGTGGCTG GCGGAATCGA CCTATCCGAA GCTCGACGGC
GGCAGGCCCG GCCTCGCGCT GTCGCTGTGG CCGCAATACG TGCTCGACAA TTTCGCCAAT
GTGCAGGAGG CGGTCGCGGC GCTGGCGAAG GAACCGTTCA CCGTGGTCAC TGCGCAACTC
CCCGACGAGA ACCGGCTGGC GACCGTGCAC CTGTCGCTGT CGGACAAAAG CGGCGATAGC
GCCATCATCG AATATATCGA CGGCAAGCAG GTGATCCATC ACGGCCGGCA GTATCAGGTG
ATGACCAATT CGCCGACCTT CGATCAGCAG CTCGCGCTCA ACGCCTACTG GAAGCAGATC
GGCGGCACCG TGATGCTGCC GGGCACCAAC CGCGCCGCGG ACCGCTTCGC CCGCGCCTCG
TTCTATGTCG ATGCGATCCC GAAAGCGGAG AATCCGGTCG AAGCCATCGC CAGCGTGTTC
GGCGTGATCC GCAACGCCTC GGTGCCCTAC GGCATCACCA CGCCCGACCA GCCGAACATC
TCCTCGACGC GCTGGCGCAC CGTGGTCGAT CACAAGCGCA AACTGTACTT CTTCGAATCC
GCGCTGACCC CGAACGTGTT CTGGGTCGAC CTGACCAAAA TCGACTTCTC GGCCGACAAG
GGCACGGTGC AGAAGCTCGA CCTCGGCCCC GGCCAGAGCA ACACCTTCTC CGGCGAGGTC
CACGACCGCT TCAGGCCGAG CGAGCCGTTC AAGTTTCTCG GGCTGTGA
 
Protein sequence
MIAFRRRFVT VSIAALLASG ALLAPAAKAC TRLVYLGAGD QVITARSMDW ARDIGTNLWI 
FPRGIKRSGE AGPNSAQWTA RYGSVIASAY DIATSDGVNE AGLVANVLWL AESTYPKLDG
GRPGLALSLW PQYVLDNFAN VQEAVAALAK EPFTVVTAQL PDENRLATVH LSLSDKSGDS
AIIEYIDGKQ VIHHGRQYQV MTNSPTFDQQ LALNAYWKQI GGTVMLPGTN RAADRFARAS
FYVDAIPKAE NPVEAIASVF GVIRNASVPY GITTPDQPNI SSTRWRTVVD HKRKLYFFES
ALTPNVFWVD LTKIDFSADK GTVQKLDLGP GQSNTFSGEV HDRFRPSEPF KFLGL