Gene RPB_3895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3895 
Symbol 
ID3911699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4450910 
End bp4452181 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content64% 
IMG OID637885796 
Productring hydroxylating dioxygenase, Rieske (2Fe-2S) protein 
Protein accessionYP_487499 
Protein GI86751003 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGCC TGGCGGAGCT CTCCACCAAT CCCGACGACC TGATCCGCAA GGATCGCGTT 
CGCACCCCGC TCTACACCGA TCCGGCGATC TTCGACGCCG AGATGACCAA GATCTTCCGC
AATACATGGG TCTGGGTCGC GCATGTCAGC GAGATCGCCA ACAACGGCGA CTTCAAGATG
GCGCAGGTCG GGCTCGAGCC GGTGATCGTG GTGCGCGACC GCAAGGGCAA GGTCCACGTC
CACCTCAACC GCTGCCGGCA TCGCGGCGCG ACGCTGTGCG AAGTGCGCAA GGGCAAGACC
TCCAGCTTGG TCTGCCCGTA TCACGGCTGG GGCTATTCGC TCGACGGCAA GCTGCGCGGC
GTGCCTTACG AGAACGGCTA CGACGAGACG ACGCTCGACC GCAACGAGCT GTCTTTGAAG
TCGCTGCGGG TCGAGGAATA TAACGGCCTG ATCTTCGCGA CCTTCCGCGA CGACATCGAG
CCCCTGGCCG ACTTCCTCGG GCCGGCGAAG AAGTGGATCG ACCTGTTCAT GAAGCAGGGC
GGCGGGTTTC CGGTGAAGGT GCTCGGCGAA CACAAGTTCC GCTTTCCCGG CAACTGGAAG
ATCCAGCTCG AGAACACCAC CGACGCCTAT CACTTCCCGC TGGTGCACAA ATCCTTCCTC
ACCGCGGTCG ACAAGGAAAC CGAGCAGCAG CTCGACATGC TGAACAATGG CGGCTTCGTC
GAGGATCTCG GCAACGGCCA CAGCGTGATG GTGATGATCC CCGAACTGAT CGACCTCGAC
GACAATCTCG AGGCGCCGAT CCCGGAGAAG TTCGCCGATC TCGCCAAGGA GCTCGTCGCC
GAAGGCTATT CCGACACGGA AGTCCGGCGC ATCGTCCGCG CCGCCGGCGG CTCCGGCTTC
AACATCAATC TGTTTCCGAA TCTGGCCTGC TCGATGGCGT TCTTCCGCGT GCTGCAGCCG
GTCTCGGTGA ACGAGACCGA GATCCGCCAC ATCGCGATCG GCATGGACGG CGGGCCGGCG
GCGGCCAACC GCGCGCGGAT CCGCCTGCAC GAATTCTTCC AGGGCCCGAT GGGCTTCGGC
AGCCCCGACG ACGCTGAGGT GTGGGATCGC GTCCAGCACG GCGCGCAGGG CGGCGACGAG
ATGTGGATCA TGCTGAACCG CGGTTTTCCA AAGGAGGAAA ACCTGCCCGA CGGGCGCGCC
CGCAGCGACG TCAGCGCCGA AACCGGGATG CGCGCCGCCT ATGATCAATG GAAGCGGATG
ATGACGGTCT GA
 
Protein sequence
MTSLAELSTN PDDLIRKDRV RTPLYTDPAI FDAEMTKIFR NTWVWVAHVS EIANNGDFKM 
AQVGLEPVIV VRDRKGKVHV HLNRCRHRGA TLCEVRKGKT SSLVCPYHGW GYSLDGKLRG
VPYENGYDET TLDRNELSLK SLRVEEYNGL IFATFRDDIE PLADFLGPAK KWIDLFMKQG
GGFPVKVLGE HKFRFPGNWK IQLENTTDAY HFPLVHKSFL TAVDKETEQQ LDMLNNGGFV
EDLGNGHSVM VMIPELIDLD DNLEAPIPEK FADLAKELVA EGYSDTEVRR IVRAAGGSGF
NINLFPNLAC SMAFFRVLQP VSVNETEIRH IAIGMDGGPA AANRARIRLH EFFQGPMGFG
SPDDAEVWDR VQHGAQGGDE MWIMLNRGFP KEENLPDGRA RSDVSAETGM RAAYDQWKRM
MTV