Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3433 |
Symbol | |
ID | 3911235 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 3931836 |
End bp | 3932831 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637885336 |
Product | 2-nitropropane dioxygenase, NPD |
Protein accession | YP_487040 |
Protein GI | 86750544 |
COG category | [R] General function prediction only |
COG ID | [COG2070] Dioxygenases related to 2-nitropropane dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.995497 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0477625 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAAGA CGCGATTCAC CGAAACCTTC GGCATCGAGC ATCCGATCGT CCAGGGCGGG ATGCAATGGG TCGGCCGTGC CGAACTGGTC GCGGCGGTCG CCAATGCCGG CGCGCTCGGC ATGATCACCG CGCTGACGCA GCCGACGCCG GAAGACCTCA CCAAGGAAAT CGCGCGCTGC CGTGAGCTCA CCGACAAGCC GTTCGGCGTG AACCTGACGA TCCTGCCGGC GATCAAGCCG CCGCCTTATG CGGAATATCG CCAGGCGATC ATCGAGAGCG GCGTGCGGAT CGTCGAAACC GCGGGCAACA AGCCGCAGGA GCACGTCGAG GAATTCAAGA AGCACGGCGT CAAGGTGCTG CACAAATGCA CCAGCGTCCG CCACGCGCTC TCGGCCGAGC GGATGGGCGT CGACGGCATT TCGATCGACG GCTTCGAATG CGCCGGCCAT CCGGGCGAGG ACGACACCCC CGGCCTGATC CTGATCCCGG CCGCCGCCGA CAAGATCAAA ATTCCGATGA TCGCCTCCGG CGGCTTCGCC GACGGCCGCG GCCTGGTGGC GGCGCTGTCG CTCGGCGCCG ACGGCATCAA CATGGGCACG CGCTTCATGT GCACCAAGGA GAGCCCGATC CACGAAGCGG TGAAGCAGAA GATCGTCGAC AATGACGAGC GCGCGACCGA CCTGATCTTC CGGACCTTGC GCAACACCTC GCGCGTCGCG AAGAACGCGA TCAGCCAGCA GGTCCTCGAA CTGGAAAAGC AGGGCGCGAC CTTCGAGCAG GTCAAGGACC TGGTTGCGGG CGCGCGCGGC AAGATGGTCT ATGTCACCGG CGACACCGAC GAAGGCGTGT GGTCGGCCGG CCAGGTGCAG GGGCTGATCC ACGACATCCC GAGCTGCGCC GACCTGGTGT CGCGGATCGT GCGCGACGCC GAGGCGATCA TTCGCGGCCG GCTCGAAACG ATGCTATCGG GGGGCGCGCG TCAGGCTGCG GAATAG
|
Protein sequence | MIKTRFTETF GIEHPIVQGG MQWVGRAELV AAVANAGALG MITALTQPTP EDLTKEIARC RELTDKPFGV NLTILPAIKP PPYAEYRQAI IESGVRIVET AGNKPQEHVE EFKKHGVKVL HKCTSVRHAL SAERMGVDGI SIDGFECAGH PGEDDTPGLI LIPAAADKIK IPMIASGGFA DGRGLVAALS LGADGINMGT RFMCTKESPI HEAVKQKIVD NDERATDLIF RTLRNTSRVA KNAISQQVLE LEKQGATFEQ VKDLVAGARG KMVYVTGDTD EGVWSAGQVQ GLIHDIPSCA DLVSRIVRDA EAIIRGRLET MLSGGARQAA E
|
| |