Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3019 |
Symbol | |
ID | 4023522 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 3362909 |
End bp | 3363985 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637963218 |
Product | 2-nitropropane dioxygenase, NPD |
Protein accession | YP_570146 |
Protein GI | 91977487 |
COG category | [R] General function prediction only |
COG ID | [COG2070] Dioxygenases related to 2-nitropropane dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.978745 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGCCAG ACCGCAGACT GCTCGACCTC TTCAAAATTG GCATTCCGAT CGTGCAGGCG CCGATGGTGG GCGTGCAGGA CGCCGACATC ATGATCGGCG CGGCCCGCGC CGGTGCGCTG CCGTCGCTCG CCTGCGCCTC GCTGTCGCCG GAGAAAGCCC GCGCGCAGGT CGGCATCGTC CGGCAGGCGG TATCGACGCC GATCAATCTG AATTTCTTCT GCCACCACGC GGTCGATGCC GATCCGCAGC GTGAGGCTGG ATGGAAGCAA CGGTTGGGCG AGTACTACAA GGAATATCAC GTCGACCCCG ACAAGCCGCT CGCGTTCGCC AACCGCGCAC CATTCGACGA AGCGATGTGC GAACTCGTCG AAGAGTTGAA GCCGGAGGTT GTGAGCTTCC ATTTCGGACT GCCGGCGCCG GCGTTGCTGG CGCGGGTCAA GGCGGCTGGC TGCATCGTGA TCGCGTCGGC CACGATCGTG CGCGAAGCAA TCTGGCTCGA GGAGCGTGGC GTCGATGCGG TGATCGCGCA GGGCGCCGAG GCCGGCGGCC ATCGCGGCAT GTTCCTGACC GACAGGATCG CCGAACAGCC CGGTCTGTTC GCGCTGCTTC CGCAGGTGGT GGACGCGGTG CGGGTGCCTG TGATTGCGGC CGGCGGCATC GCGGACGGAC GCGGCATTGC GGCGGCCTTC GCGCTGGGTG CGGCCGGCGT GCAGATCGGG ACTGCCTATC TGCGTTGTCC GGAATCCAAG GTCAGCGCAC CGGGGCGCGT CGCTCTGGCG CGCGCCGGAG ACGATTCCAC CGTGATCACC AATGTGATGA CTGGTCGACC GGCGCGCGGC GTGGTCAACC GCGTCATGCG CGAGGTCGGG CCGATCGCGC CGGAAGCGCC GCCGTTTCCG TATGCTGCGA CCGCTCTGGC GCCGCTGAAG GCTGCCGCCG AGGCGCAGGG CCGGGTCGAT TTCACCATGC TGTGGGCCGG GCAGGCGGTC GGGCTCGGCC GCGACATGTC GGCTGCTGAT ATGACGCGGG CGCTGGCCGG GGCGGCGCTG GCCCGGCTGA GCCAGCTCGC TCGCTGA
|
Protein sequence | MWPDRRLLDL FKIGIPIVQA PMVGVQDADI MIGAARAGAL PSLACASLSP EKARAQVGIV RQAVSTPINL NFFCHHAVDA DPQREAGWKQ RLGEYYKEYH VDPDKPLAFA NRAPFDEAMC ELVEELKPEV VSFHFGLPAP ALLARVKAAG CIVIASATIV REAIWLEERG VDAVIAQGAE AGGHRGMFLT DRIAEQPGLF ALLPQVVDAV RVPVIAAGGI ADGRGIAAAF ALGAAGVQIG TAYLRCPESK VSAPGRVALA RAGDDSTVIT NVMTGRPARG VVNRVMREVG PIAPEAPPFP YAATALAPLK AAAEAQGRVD FTMLWAGQAV GLGRDMSAAD MTRALAGAAL ARLSQLAR
|
| |