Gene Rpal_5189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5189 
Symbol 
ID6412889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5591465 
End bp5592433 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content65% 
IMG OID642715079 
Product2-nitropropane dioxygenase NPD 
Protein accessionYP_001994152 
Protein GI192293547 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.75587 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCATGC CTGCGTTGTT CAAGGGCCGC CTCTCGGTCC CGGTGATCGG CTCACCGTTG 
TTCATTATTT CGGGTCCCGA GCTGGTGATC GCGCAATGCA AGGCCGGCGT GGTCGGGTCG
TTTCCGGCGC TCAATGCACG TCCCGCCGCG CTGCTCGACG AGTGGCTGCA CCAGATCAAG
GAAGAGCTCG CGGCCTACGA CAAGGCGCAT CCCGAGCGTC CGTCGGCGCC GTTCGCCGTC
AACCAGATCG TACACAAATC CAACAACCGG CTCGATGCCG ACCTCGCGCT TTGTGAAAAG
CACAAGGTCC CGATGCTGAT CACCTCGCTC GGCGCCCGTG AAGAGCTGAA CCAGGCCGCG
CATAATTGGG GCGGCATCGT CTTCCACGAC GTCATCAATC AGAAGTTCGC CCATAAGGCG
GTCGAGAAAG GCGCCGACGG CCTGATCCTG GTCGCGGCCG GCGCCGGCGG CCATGCCGGC
ACGCAGTCGC CGTTCGCCTT CGTCACCGAA ACCCGCGCCT GGTACAACGG CCCGATCGCT
CTGTCCGGCG CGATCGCCAA TGGTCGCGCG ATCCGCGCCG CCCGCGTGCT CGGCGCCGAC
TTCGCCTATA TCGGCTCGGC CTTCATCGCC ACTAAGGAAG CCAATGCGGT CGACCGCTAC
AAGGAGATGA TCACCACCTC CGGCGCCGAC GACATCGTTT ATTCCAACCT GTTCACCGGC
GTGCACGGCA ACTACCTCAA GCCGTCGATC GTCGCGGCCG GCATGGACCC GGACAATCTC
GAACAGTCCG ATCCGTCGAA GATGAACTTC GGCACCGACG AGTCCGGCGA GCGCGCCAAG
CCGAAGGCCT GGAAGGAGAT CTGGGGAGCG GGCCAGGGCA TCGGCAGCAT CGACGCAGTG
CTGCCGGCCG GCGAACTGAT CGCCCGCTTC AAGAAGGAAT ACGACGAGGC GATCGACCCG
CCGCTGTGA
 
Protein sequence
MPMPALFKGR LSVPVIGSPL FIISGPELVI AQCKAGVVGS FPALNARPAA LLDEWLHQIK 
EELAAYDKAH PERPSAPFAV NQIVHKSNNR LDADLALCEK HKVPMLITSL GAREELNQAA
HNWGGIVFHD VINQKFAHKA VEKGADGLIL VAAGAGGHAG TQSPFAFVTE TRAWYNGPIA
LSGAIANGRA IRAARVLGAD FAYIGSAFIA TKEANAVDRY KEMITTSGAD DIVYSNLFTG
VHGNYLKPSI VAAGMDPDNL EQSDPSKMNF GTDESGERAK PKAWKEIWGA GQGIGSIDAV
LPAGELIARF KKEYDEAIDP PL