Gene RPD_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2021 
Symbol 
ID4022503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2266957 
End bp2267952 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content65% 
IMG OID637962214 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_569157 
Protein GI91976498 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.01256 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.586253 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAGA CGCGTTTTAC CGAAACATTC GGCATCCAGC ATCCGATCGT CCAGGGCGGG 
ATGCAGTGGG TCGGCCGCGC CGAACTCGTG GCCGCTATCG CCAATGCCGG CGCGCTCGGC
ATGATCACCG CGCTGACGCA GCCGACGCCG GAGGATCTCA CCAAGGAGAT CGCGCGCTGC
CGTGACCTCA CCGACAAGCC GTTCGGCGTC AACCTCACGA TCCTGCCGGC GATCAAGCCG
CCGCCTTATG CGGAGTATCG TCAGGCGATC ATCGAGAGCG GCGTCAGGAT CGTCGAGACC
GCGGGCAACA AGCCGCAGGA GCACGTCGAG GAATTCAGAA AGCACGGCGT CAAGGTTCTG
CACAAATGTA CCAGCGTCCG CCACGCGCTG TCGGCGGAGC GGATGGGCGT CGACGGCATT
TCGATCGACG GTTTCGAATG CGCCGGCCAC CCGGGCGAAG ACGATACGCC CGGCCTGATC
CTGATCCCCG CCGCCGCCGA CAAGATCAAG GTCCCGATGA TCGCCTCGGG CGGCTTCGCC
GATGGGCGCG GCCTGGTCGC GGCGCTGGCG CTCGGCGCCG ACGGCATCAA CATGGGCACG
CGATTCATGT GCACCAAAGA GAGCCCGATC CATCAGGCGG TGAAGGAAAA GATCGTCGCC
AATGACGAGC GCTCGACCGA CCTGATCTTC CGCACCATGC GCAACACCTC GCGCGTCGCG
AAGAACGCGA TCAGCCAGCA GGTGATCGAG CTAGAGAAGC AGGGCGCGAC CTTCGAGCAG
GTCCGCGAAC TGGTCGCCGG CGCCCGCGGC AAGATGGTCT ACGCTACCGG CGACACCGAT
GAAGGCGTGT GGTCGGCCGG TCAGGTCCAG GGACTGATTC ATGACATTCC GAGCTGCGCC
GAGCTGGTGT CGCGGATCAT GCGCGACGCC GAGGCGATCA TTCGTGCGCG GCTCGAAGCG
ATGCTGTCGG GCGGCCAGCG CGAAGCCGCC GAATGA
 
Protein sequence
MIKTRFTETF GIQHPIVQGG MQWVGRAELV AAIANAGALG MITALTQPTP EDLTKEIARC 
RDLTDKPFGV NLTILPAIKP PPYAEYRQAI IESGVRIVET AGNKPQEHVE EFRKHGVKVL
HKCTSVRHAL SAERMGVDGI SIDGFECAGH PGEDDTPGLI LIPAAADKIK VPMIASGGFA
DGRGLVAALA LGADGINMGT RFMCTKESPI HQAVKEKIVA NDERSTDLIF RTMRNTSRVA
KNAISQQVIE LEKQGATFEQ VRELVAGARG KMVYATGDTD EGVWSAGQVQ GLIHDIPSCA
ELVSRIMRDA EAIIRARLEA MLSGGQREAA E