Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4429 |
Symbol | |
ID | 3912244 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 5018602 |
End bp | 5020347 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637886334 |
Product | 5-oxoprolinase (ATP-hydrolyzing) |
Protein accession | YP_488026 |
Protein GI | 86751530 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.28701 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGTCC ACGCCCCTCA CGCCGATCCG ATGAACGTGA TCCATCAGCA GATCATGTGG GATCGCCTGA TCTCCGTGGT CGAAGAACAG GCCCAGACGC TGATCCGGAT CGGCTTCAGC ACCTCGACCC GCGAAGCCGG CGACGTCTCG GCCGGCGTGT TCAACACCGC CGGTCACATG CTCGCGCAGG CCGTCACCGG CACGCCCGGC CACGTCAATT CGATGGCGCG CGCGGTGGTG CATTTCCTCG ACAAGTTTCC CGCGACGACG ATGCAGCCCG GCGACATCTT CATCACCAAC GATCCGTGGA AGGGAACCGG CCATCTCCAC GACTTCACGG TGGTGACGCC CGTCTTCCGC GCCGACCGGC TGGTCGCGCT GTTCGCCAGC ACCTGCCACG TCATCGACAT CGGCGGACGC GGCATGGGCC CGGACGCCCG CCAAGTGTTC GAGGAAGGCG TCTTCATCCC GATCATGCGC TTCGCCAGCG CCGCCGGCAC CAACGAGACG CTGCTCGAGA TCATCAAGGG CAATGTCCGG GAGCCCGTGC AGGTCGTCGG CGACCTCTAT TCGCTGGCCG GCTGCAACGA TGTCGGCGGC CGGCAGCTCC TGAAGATGAT GGACGAATTC GGTATCGAGA CTCTCGACCG TCTCGGCGAT CACATTCTGG AACGGTCACG CGTCGCCACG CTGGAAGCGA TCCGCGCGCT GCCGAAAGGT ACGTTCCGCA ACAGCATGCG CGTCGACGGC TACGACAGGC CGCTCGATCT GGTCGCGACG ATGACGATCT CCGACGACGG CATCGATGTC GATTTCGCCG GCACCTCGCC GCCCTCGTCC TTCGGCATCA ACGTGCCGTT CTGCTACACC GAGGCCTATG CGAGCTTCGG CGTGAAATGC ATCATCGCAC CGAAGATCCC GAACAACGAG GGCTCGCTCG CCTTGCTGCG CATGCGCGCA CCGGCGGATT GCATCCTCAA CGCGCAGCCG CCGCTGCCCG TCGCCACGCG CCATATCGTC GGACAGATGT TGCCCGATCT GGTGATCGGC TGCCTCGGCC AGGCGCTTCA CGGCAACGTC CCGGCGGAAG GCACCTCCTG TCTTTGGAAC CTGTTCGCCT TCGGCGGCTC CAGCCAGATC GACGCCGACT CCACCGAGAT GATGCGGGCC CGCGTGTTCA ACGTGATGTC GTTCCATTCC GGCGGCACCG GCGCGCGGCC GGGCAAGGAC GGCCTGTCGG CCACCGCCTT CCCGAGCGGC GTGCGCAACG TGCCGGTGGA GGTCACCGAG GCGATGTCGC CGCTGCTGAT CAAGCGCAAG GAATATCGCA CCGACTCCGG CGGCCCCGGC CAATTCCGCG GCGGCCTCGG CCAGGTGATG GAAGTCGTCA GCCTCGACGA CACCGCCTTC GCGATCTCCG CCAACTACGA CCGCGTCGAT TTCCCCGCCC GCGGCCGCGA CGGCGGCGCC GACGGCAAGG CCGGCAAGAT CTCGCTCGGC TCCGGCAGGC TGCTGAAGAG CAAGGGCCAG CAGACCATTC CGCGCGGCGA AGCCGTGCTG ATCGAAATGC CCGGCGGCGG CGGCCTCGGC GATCCCTTCA GCCGCGATGC CGCGGCGGTC GCCGCGGACG TGCATCTCGG CATGGTGTCG CGCGAGGCCG CCGAGACGGC CTACGGCGTC GTGCTGCGCG GCGACCACTC CGTCGACGAA ACCGCGACCG CCGCACGCCG CGGCCATCGC GCCTGA
|
Protein sequence | MDVHAPHADP MNVIHQQIMW DRLISVVEEQ AQTLIRIGFS TSTREAGDVS AGVFNTAGHM LAQAVTGTPG HVNSMARAVV HFLDKFPATT MQPGDIFITN DPWKGTGHLH DFTVVTPVFR ADRLVALFAS TCHVIDIGGR GMGPDARQVF EEGVFIPIMR FASAAGTNET LLEIIKGNVR EPVQVVGDLY SLAGCNDVGG RQLLKMMDEF GIETLDRLGD HILERSRVAT LEAIRALPKG TFRNSMRVDG YDRPLDLVAT MTISDDGIDV DFAGTSPPSS FGINVPFCYT EAYASFGVKC IIAPKIPNNE GSLALLRMRA PADCILNAQP PLPVATRHIV GQMLPDLVIG CLGQALHGNV PAEGTSCLWN LFAFGGSSQI DADSTEMMRA RVFNVMSFHS GGTGARPGKD GLSATAFPSG VRNVPVEVTE AMSPLLIKRK EYRTDSGGPG QFRGGLGQVM EVVSLDDTAF AISANYDRVD FPARGRDGGA DGKAGKISLG SGRLLKSKGQ QTIPRGEAVL IEMPGGGGLG DPFSRDAAAV AADVHLGMVS REAAETAYGV VLRGDHSVDE TATAARRGHR A
|
| |