Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2433 |
Symbol | |
ID | 3909567 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2790959 |
End bp | 2792035 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637884332 |
Product | 2-nitropropane dioxygenase, NPD |
Protein accession | YP_486049 |
Protein GI | 86749553 |
COG category | [R] General function prediction only |
COG ID | [COG2070] Dioxygenases related to 2-nitropropane dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.769638 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0464074 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGCCGG ACCGCCGTCT CCTCGACCTG TTCGATATCG AATTTCCATT CGTTCTGGCG CCGATGGCCG GCGCGATGGA TGCCGAGCTG GCGATTGCAG CAGCGCGGGG TGGGGCGCTG GCGTCGCTTC CCTGCGCGAT GCTGACCGCT GACAAGGCGC GCGAGCAGGT CGGGATCTTC CGCCAGCAGG TGTCGGCGCC GGTCAATCTC AACTTCTTCT GCCACAGGTC GGTCGCCGCC GATCCGGCGC GCGAGGCGGT GTGGAAGCAG CGGCTCGGCG CCTACTATCA GGAATTCGGC CTCGACCCGG CCGCGCCGGT GGCCGCCGCC AATCGTGCGC CGTTCGATGC GGCGATGTGC GAACTGGTCG AGCAGTTGAA GCCTGCGGCG GTCAGCTTTC ATTTCGGCCT GCCGGACGAG GCTTTGCTGC GGCGCGTGAA GGACGCCGGC TGCATCGTGC TGGCGTCGGC GACGATCGTG CGCGAGGCGA TTTGGCTCGA GGAGCGCGGT GCCGATCTGG TGATCGCGCA GGGCGCCGAG GCGGGCGGGC ATCGCGGCAT GTTCCTGACC GAGAATATCG CCGAGCAGCC GGGCCTGTTC GCGCTGCTGC CGCAGGTGGT CGATGCGGTG CGGGTGCCGG TGATCGCGGC CGGCGGCATT GCCGATGGCC GGGGCATCGC CGCGGCGATG GCGCTCGGCG CTTCCGGCGT CCAGATCGGC ACAGCCTATC TGCGCTGCCC GGAATCGCGG ATCAGCGCTC CGGCGCGCGC CGCGTTGGCC CAGGCGACCG ACGCGTCCAC CGTGATCACC AACGTCATGA CCGGGCGTCC CGCACGCGGC GTCGCCAACC GGGTGATGCG CGAAATCGGG CCACTGTCCG CTGACGCGCC GGCGTTCCCG CATGCGGCCA CAGCATTGGG CCCGCTGAAG GCCGAGGCGG AGAAGCAGGG CCGCACCGAT TTCACCAACC TTTGGGCCGG GCAGGCGGTG CGGCTCGGTC GCGACATGCC GGCGGCGGAA CTGACCCGGG CGCTGGCGGG TTCGGCGTTG GCGCGGCTGA GCCGGCTCGC CGGCTGA
|
Protein sequence | MWPDRRLLDL FDIEFPFVLA PMAGAMDAEL AIAAARGGAL ASLPCAMLTA DKAREQVGIF RQQVSAPVNL NFFCHRSVAA DPAREAVWKQ RLGAYYQEFG LDPAAPVAAA NRAPFDAAMC ELVEQLKPAA VSFHFGLPDE ALLRRVKDAG CIVLASATIV REAIWLEERG ADLVIAQGAE AGGHRGMFLT ENIAEQPGLF ALLPQVVDAV RVPVIAAGGI ADGRGIAAAM ALGASGVQIG TAYLRCPESR ISAPARAALA QATDASTVIT NVMTGRPARG VANRVMREIG PLSADAPAFP HAATALGPLK AEAEKQGRTD FTNLWAGQAV RLGRDMPAAE LTRALAGSAL ARLSRLAG
|
| |