Gene RPD_1859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1859 
Symbol 
ID4022341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2079765 
End bp2080898 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content66% 
IMG OID637962052 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_568995 
Protein GI91976336 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.468508 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCGC CGATCTGCGA GATGCTGGGC ATCGAGTTTC CGCTGCTCGC GTTCAGCCAT 
TGCCGCGACG TGGTCGCAGC CGTCAGCCGC GCCGGCGGAT TTGGTGTGCT GGGCGCCACC
ATTCACACGC CGGAGACGAT CGAGCAGGAA CTGAAATGGA TCGACGATCA TGTCGACGGC
AAGCCCTATG GGCTCGACGT GCTGATCCCG GAGAACATCT CGACCGCCGG CGAAAAGGAT
GTCACCTGGC AGAGCCTCGA GACGCGCATC GGCCCGGAGC ATCGCGATTT CACTCGCGAC
CTGCTGAAGA AGTACAATAT CGACTACAAG CCCGTGCCGG TCCCGGCGAA CCAGCCGCAG
CCGTTCGACG CGCAATGTGC GCTCGAAGTG CTCGAGGTCT CGTTCAGCCA TCCGATCCGG
TTGATCGCCA ATGCGCTCGG CGTGCCGCCC AAGGCGATGA TCGACATGGG CAGGAAACAC
GGCGTGCCAG TGGCGGCATT GGTCGGCGCC AAGGAACACG CGATCCGGCA GGTCGCGGCT
GGCGTCGACA TCATCGTCGC GCAGGGCACC GAGGCCGGCG GGCATTGCGG CGAGGTGTCG
ACGATGGTGT TGGTGCCGGA GGTGATCAAG GCGATCAAAC CGATCCGCGA GGTGCCCGTG
CTCGCCGCGG GCGGCATCAT GACCGGGCGG CAGATGGCGG CCTGCATGGC GATGGGCGCT
GCCGGTGCGT GGACCGGCTC GGTTTGGCTG GCAACGGTGG AATCCGAAAC CAGCGAGACG
TTCCGCGAGA AGATGATCGC CGCCTCGTCG CGCGACGCTG TGCGCTCGAA GGGCCGCACC
GGCAAGCCGG CGCGACAGTT GCGCTCGGTG TGGACCGACG CGTGGGATCG CGGCCCGGAC
AGCCCGGGCG CGTTGCCAAT GCCGCTGCAG TCCATCATCA GCCGCGACGC CTTCATCGCG
ATCGATCGCG CCGCCGCGGC CGGCAGTGCG CAGGCGCGCG ATCTGGTCAG CTACTTCGTC
GGCCAGGGTG TCGGCCTGAT CGACAGCGTC AAGAGCGCCG GCGCCGTGGT TCAGGAATTC
AAACAGGACT TCGCCGAGGC GGTCGAACAT CTCGACGCGC TGGTGGCGAG TTGA
 
Protein sequence
MKSPICEMLG IEFPLLAFSH CRDVVAAVSR AGGFGVLGAT IHTPETIEQE LKWIDDHVDG 
KPYGLDVLIP ENISTAGEKD VTWQSLETRI GPEHRDFTRD LLKKYNIDYK PVPVPANQPQ
PFDAQCALEV LEVSFSHPIR LIANALGVPP KAMIDMGRKH GVPVAALVGA KEHAIRQVAA
GVDIIVAQGT EAGGHCGEVS TMVLVPEVIK AIKPIREVPV LAAGGIMTGR QMAACMAMGA
AGAWTGSVWL ATVESETSET FREKMIAASS RDAVRSKGRT GKPARQLRSV WTDAWDRGPD
SPGALPMPLQ SIISRDAFIA IDRAAAAGSA QARDLVSYFV GQGVGLIDSV KSAGAVVQEF
KQDFAEAVEH LDALVAS