Gene RPB_4429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4429 
Symbol 
ID3912244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5018602 
End bp5020347 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content68% 
IMG OID637886334 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_488026 
Protein GI86751530 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.28701 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTCC ACGCCCCTCA CGCCGATCCG ATGAACGTGA TCCATCAGCA GATCATGTGG 
GATCGCCTGA TCTCCGTGGT CGAAGAACAG GCCCAGACGC TGATCCGGAT CGGCTTCAGC
ACCTCGACCC GCGAAGCCGG CGACGTCTCG GCCGGCGTGT TCAACACCGC CGGTCACATG
CTCGCGCAGG CCGTCACCGG CACGCCCGGC CACGTCAATT CGATGGCGCG CGCGGTGGTG
CATTTCCTCG ACAAGTTTCC CGCGACGACG ATGCAGCCCG GCGACATCTT CATCACCAAC
GATCCGTGGA AGGGAACCGG CCATCTCCAC GACTTCACGG TGGTGACGCC CGTCTTCCGC
GCCGACCGGC TGGTCGCGCT GTTCGCCAGC ACCTGCCACG TCATCGACAT CGGCGGACGC
GGCATGGGCC CGGACGCCCG CCAAGTGTTC GAGGAAGGCG TCTTCATCCC GATCATGCGC
TTCGCCAGCG CCGCCGGCAC CAACGAGACG CTGCTCGAGA TCATCAAGGG CAATGTCCGG
GAGCCCGTGC AGGTCGTCGG CGACCTCTAT TCGCTGGCCG GCTGCAACGA TGTCGGCGGC
CGGCAGCTCC TGAAGATGAT GGACGAATTC GGTATCGAGA CTCTCGACCG TCTCGGCGAT
CACATTCTGG AACGGTCACG CGTCGCCACG CTGGAAGCGA TCCGCGCGCT GCCGAAAGGT
ACGTTCCGCA ACAGCATGCG CGTCGACGGC TACGACAGGC CGCTCGATCT GGTCGCGACG
ATGACGATCT CCGACGACGG CATCGATGTC GATTTCGCCG GCACCTCGCC GCCCTCGTCC
TTCGGCATCA ACGTGCCGTT CTGCTACACC GAGGCCTATG CGAGCTTCGG CGTGAAATGC
ATCATCGCAC CGAAGATCCC GAACAACGAG GGCTCGCTCG CCTTGCTGCG CATGCGCGCA
CCGGCGGATT GCATCCTCAA CGCGCAGCCG CCGCTGCCCG TCGCCACGCG CCATATCGTC
GGACAGATGT TGCCCGATCT GGTGATCGGC TGCCTCGGCC AGGCGCTTCA CGGCAACGTC
CCGGCGGAAG GCACCTCCTG TCTTTGGAAC CTGTTCGCCT TCGGCGGCTC CAGCCAGATC
GACGCCGACT CCACCGAGAT GATGCGGGCC CGCGTGTTCA ACGTGATGTC GTTCCATTCC
GGCGGCACCG GCGCGCGGCC GGGCAAGGAC GGCCTGTCGG CCACCGCCTT CCCGAGCGGC
GTGCGCAACG TGCCGGTGGA GGTCACCGAG GCGATGTCGC CGCTGCTGAT CAAGCGCAAG
GAATATCGCA CCGACTCCGG CGGCCCCGGC CAATTCCGCG GCGGCCTCGG CCAGGTGATG
GAAGTCGTCA GCCTCGACGA CACCGCCTTC GCGATCTCCG CCAACTACGA CCGCGTCGAT
TTCCCCGCCC GCGGCCGCGA CGGCGGCGCC GACGGCAAGG CCGGCAAGAT CTCGCTCGGC
TCCGGCAGGC TGCTGAAGAG CAAGGGCCAG CAGACCATTC CGCGCGGCGA AGCCGTGCTG
ATCGAAATGC CCGGCGGCGG CGGCCTCGGC GATCCCTTCA GCCGCGATGC CGCGGCGGTC
GCCGCGGACG TGCATCTCGG CATGGTGTCG CGCGAGGCCG CCGAGACGGC CTACGGCGTC
GTGCTGCGCG GCGACCACTC CGTCGACGAA ACCGCGACCG CCGCACGCCG CGGCCATCGC
GCCTGA
 
Protein sequence
MDVHAPHADP MNVIHQQIMW DRLISVVEEQ AQTLIRIGFS TSTREAGDVS AGVFNTAGHM 
LAQAVTGTPG HVNSMARAVV HFLDKFPATT MQPGDIFITN DPWKGTGHLH DFTVVTPVFR
ADRLVALFAS TCHVIDIGGR GMGPDARQVF EEGVFIPIMR FASAAGTNET LLEIIKGNVR
EPVQVVGDLY SLAGCNDVGG RQLLKMMDEF GIETLDRLGD HILERSRVAT LEAIRALPKG
TFRNSMRVDG YDRPLDLVAT MTISDDGIDV DFAGTSPPSS FGINVPFCYT EAYASFGVKC
IIAPKIPNNE GSLALLRMRA PADCILNAQP PLPVATRHIV GQMLPDLVIG CLGQALHGNV
PAEGTSCLWN LFAFGGSSQI DADSTEMMRA RVFNVMSFHS GGTGARPGKD GLSATAFPSG
VRNVPVEVTE AMSPLLIKRK EYRTDSGGPG QFRGGLGQVM EVVSLDDTAF AISANYDRVD
FPARGRDGGA DGKAGKISLG SGRLLKSKGQ QTIPRGEAVL IEMPGGGGLG DPFSRDAAAV
AADVHLGMVS REAAETAYGV VLRGDHSVDE TATAARRGHR A