Gene RPB_2433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2433 
Symbol 
ID3909567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2790959 
End bp2792035 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content71% 
IMG OID637884332 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_486049 
Protein GI86749553 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.769638 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0464074 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGCCGG ACCGCCGTCT CCTCGACCTG TTCGATATCG AATTTCCATT CGTTCTGGCG 
CCGATGGCCG GCGCGATGGA TGCCGAGCTG GCGATTGCAG CAGCGCGGGG TGGGGCGCTG
GCGTCGCTTC CCTGCGCGAT GCTGACCGCT GACAAGGCGC GCGAGCAGGT CGGGATCTTC
CGCCAGCAGG TGTCGGCGCC GGTCAATCTC AACTTCTTCT GCCACAGGTC GGTCGCCGCC
GATCCGGCGC GCGAGGCGGT GTGGAAGCAG CGGCTCGGCG CCTACTATCA GGAATTCGGC
CTCGACCCGG CCGCGCCGGT GGCCGCCGCC AATCGTGCGC CGTTCGATGC GGCGATGTGC
GAACTGGTCG AGCAGTTGAA GCCTGCGGCG GTCAGCTTTC ATTTCGGCCT GCCGGACGAG
GCTTTGCTGC GGCGCGTGAA GGACGCCGGC TGCATCGTGC TGGCGTCGGC GACGATCGTG
CGCGAGGCGA TTTGGCTCGA GGAGCGCGGT GCCGATCTGG TGATCGCGCA GGGCGCCGAG
GCGGGCGGGC ATCGCGGCAT GTTCCTGACC GAGAATATCG CCGAGCAGCC GGGCCTGTTC
GCGCTGCTGC CGCAGGTGGT CGATGCGGTG CGGGTGCCGG TGATCGCGGC CGGCGGCATT
GCCGATGGCC GGGGCATCGC CGCGGCGATG GCGCTCGGCG CTTCCGGCGT CCAGATCGGC
ACAGCCTATC TGCGCTGCCC GGAATCGCGG ATCAGCGCTC CGGCGCGCGC CGCGTTGGCC
CAGGCGACCG ACGCGTCCAC CGTGATCACC AACGTCATGA CCGGGCGTCC CGCACGCGGC
GTCGCCAACC GGGTGATGCG CGAAATCGGG CCACTGTCCG CTGACGCGCC GGCGTTCCCG
CATGCGGCCA CAGCATTGGG CCCGCTGAAG GCCGAGGCGG AGAAGCAGGG CCGCACCGAT
TTCACCAACC TTTGGGCCGG GCAGGCGGTG CGGCTCGGTC GCGACATGCC GGCGGCGGAA
CTGACCCGGG CGCTGGCGGG TTCGGCGTTG GCGCGGCTGA GCCGGCTCGC CGGCTGA
 
Protein sequence
MWPDRRLLDL FDIEFPFVLA PMAGAMDAEL AIAAARGGAL ASLPCAMLTA DKAREQVGIF 
RQQVSAPVNL NFFCHRSVAA DPAREAVWKQ RLGAYYQEFG LDPAAPVAAA NRAPFDAAMC
ELVEQLKPAA VSFHFGLPDE ALLRRVKDAG CIVLASATIV REAIWLEERG ADLVIAQGAE
AGGHRGMFLT ENIAEQPGLF ALLPQVVDAV RVPVIAAGGI ADGRGIAAAM ALGASGVQIG
TAYLRCPESR ISAPARAALA QATDASTVIT NVMTGRPARG VANRVMREIG PLSADAPAFP
HAATALGPLK AEAEKQGRTD FTNLWAGQAV RLGRDMPAAE LTRALAGSAL ARLSRLAG