Gene RPC_1956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1956 
Symbol 
ID3973588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2127554 
End bp2128882 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content67% 
IMG OID637925067 
ProductTat-translocated enzyme 
Protein accessionYP_531832 
Protein GI90423462 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01412] Tat-translocated enzyme
[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.351506 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCTC ACGATCAAGA CGCGGCCGGC TCATCAAGCG ACGCGCCGGC GTCGCCGCAA 
CGCCGCGGCC TGCTGCTCGG ATTGGCTGCG GGCAGCCTCG GTCTGATGCC GTCGGCTGCG
CCGGCGGCCG TTACTACACC CGCCGCGGTG TCATCGCTGG TGATCCCGTT TCACGGTCCG
CATCAGGCCG GGATCACAAC GCCGCAACCC TATGCCGGCC TCGTCGCCGC GTTCGATGTG
CTCGCGGAGA ACCGCGAGGA GTTGCGCCAG CTCTTCATCA CACTGACGAC CCGCGCGGAA
TTTCTCATGC ATGGCGGCAC GCCGCCCGAG CTCGACGAGG CGCTGCCGCC GGCCGACAGC
GGCGTGTTGG GGCCCACCAT CAAACCCGAA CACCTCACCA TGACGGTCGC GGTCGGCGCT
TCGTTGTTCG ACGGCCGGTT CGGCCTCGCA CCGCTGAAGC CGAAGCAGCT CGCCGAGATG
GAGGACTTTC CCAACGACGC GCTCGACGCC GAGCTGTGCC ACGGCGACCT GATGATCCAG
TTCTGCGCCG AGACCCCCGA AGAGGTGATT CATGCGCTGC GCGACATCGT CAAGGCGACG
CCGGACCTGC TCGCGATCAA GTGGAAGCAG GAGGGCTTCG CCGCCACCCA TGGCGCGCGC
AGTGGGCCGA TCGGCACCGG ACGCAATCTG CTCGGCTTCA AGGACGGCAC CGCCAACGCC
GACATCCGCG ACGATGCGAT GATGAACGCC TACGTCTGGG TGCAGCCGGA TGCCGGCGAG
CCGGCCTGGA GCGTCGGCGG CAGCTATCAG GTGGTGCGGC TGGTGCGCAA TTTCGTCGAG
CGCTGGGATC GCACGCCGCT CAGCGAGCAG GAGGCGATCT TCGGACGCGC GCGCAATTCC
GGCGCGCCGC TCGGCAAGGC CGACGAATTC GACGAGCCCG GCTACCCAGC CGACCCGAAC
GGCGAGAAGA TCCCGCTGAC CTCGCATATC CGCCTCGCCA ACCCGCGGAC TCCGGAAGCA
ACGGCGCGAC TGATCCGGCG CGGCTTCAAC TATTCCAACG GCGTCAGCAA ATCGGGCCAG
CTCGACATGG GCCTGTTGTT CGTCTCGTTC CAATCGGATC TGGCGCAGGG CTTCATCGCC
ACGCAGAACC GGCTGAACGG CGAGCCGCTC GAGGAATACA TCAAGCCGTT CGGCGGCGGT
TATTACTTCG TGCTTCCGGG CGTTGCCGCG CCGGGCGGTC ATCTCGGCCA AGGCCTGTTC
GCCGACGCCG CATCCCTTCC CACTCCCTCG CAACAGGCGC CCGCAAAAGC GCCCTCGAAC
AAAGGCTGA
 
Protein sequence
MKAHDQDAAG SSSDAPASPQ RRGLLLGLAA GSLGLMPSAA PAAVTTPAAV SSLVIPFHGP 
HQAGITTPQP YAGLVAAFDV LAENREELRQ LFITLTTRAE FLMHGGTPPE LDEALPPADS
GVLGPTIKPE HLTMTVAVGA SLFDGRFGLA PLKPKQLAEM EDFPNDALDA ELCHGDLMIQ
FCAETPEEVI HALRDIVKAT PDLLAIKWKQ EGFAATHGAR SGPIGTGRNL LGFKDGTANA
DIRDDAMMNA YVWVQPDAGE PAWSVGGSYQ VVRLVRNFVE RWDRTPLSEQ EAIFGRARNS
GAPLGKADEF DEPGYPADPN GEKIPLTSHI RLANPRTPEA TARLIRRGFN YSNGVSKSGQ
LDMGLLFVSF QSDLAQGFIA TQNRLNGEPL EEYIKPFGGG YYFVLPGVAA PGGHLGQGLF
ADAASLPTPS QQAPAKAPSN KG