Gene RPD_0617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0617 
Symbol 
ID4021086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp698230 
End bp700362 
Gene Length2133 bp 
Protein Length710 aa 
Translation table11 
GC content62% 
IMG OID637960805 
Productcatalase 
Protein accessionYP_567756 
Protein GI91975097 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0753] Catalase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACCCA AGAAGTCGCT TACCAAGAGC TCGGCTCCCG AGAATTCCGA AACCACATCC 
GCGACGATCG CGGATGCGAG CATTCAGCGC GGCCAAGGCG GCGAGACCCA TCAGATCGCG
ACCGGGAAAA CGCCGGTGCT CACCACGCGG CAAGGTGTCC CTGTCAGCGA CGACCAGAAC
AGCCTGAAGA TCGGTCCGCG CGGGCCGACA CTGATGGAAG ATTTCCATTT CCGCGAGAAG
ATCTTCCACT TCGACCACGA ACGCATCCCG GAGCGGGTGG TGCATGCCCG CGGCTTCGGC
GCGCATGGCT TTTTCGAAAC CTACGAGTCG CTCGCGGACA TCACCCGGGC GGATATTTTC
CAGCGCGCCG GGGAAAAGAC CCCGGCCTTT GTGCGCTTCT CGACGGTCGC CGGCAACAAA
GGCTCGGCGG ATCTCGCGCG CGATGTCCGC GGTTTTGCGG TCAAGCTGTA CACCCAGGAA
GGCAATTGGG ACATCGTCGG CAACAACATC CCGGTGTTCT TCATTCAGGA CGCGATCAAG
TTTCCCGACC TGATCCATGC CGCCAAGCAG GAGCCCGATC GCGGCTTTCC GCAGGCGCAG
ACCGCGCATG ACAATTTCTG GGATTTCATT TCGCTGATGC CGGAATCGAT CAACATGGCG
CTGTGGATCA TGTCCGATCG CACCATTCCG CGTTCGTTCC GCTTCATGGA GGGGTTCGGC
GTCCACACCT TCCGGCTGAT CAACGCCGAA GGCAAATCGA CCTACGTCAA ATTTCACTGG
AAGCCGAAGC TCGGGATGCA GTCGGTCGCG TGGAACGAGG CGGTGAAGCT GAACGGCGCC
GATCCGGATT TTCATCGCCG CGATCTATGG GACGCGATCC AGGCCGGCGA CTATCCGGAA
TGGGAGCTCG GCCTGCAATT GTTCGACGAC GCCTTCGCCG ACGAGTTCGA ATTCGACATT
CTCGACGCGA CGAAAATCAT TCCGGAAGAG CGCGTGCCGA TCCGCCGCGT CGGCCGGCTG
GTGCTGGACC GGACCGTCGA CAACTTCTTC AACGAGACCG AGCAGGTCGC CTTCCAGACC
GGCAACATCG TGCCCGGCGT CGACTTCACC AACGACCCGC TTCTGCAGGG TCGCAATTTC
TCCTATCTCG ACACCCAGCT GAAGCGGCTC GGGAGTCCGA ACTTCAACGA TCTGCCGATC
AACGCGCCGA AATGCCCGTT CCACACGTTC CAGCAGGACG GGCACATGAC GCTGAACAAT
CCCAAGGGCC GCGCCAATTA CGAGCCGAAT TCATGGGGCG CCGCGGGCGG ACCGCGCGAG
ACACCGAACG GATTTCAGAC CTTCCCCGCA GAAGAAACCG GCGAGCGGCG GCGCGCACGC
GGCGAATTGT TCGCCGATCA CTACAGTCAA GCCCGGCAGT TTTACATCAG CCAGACCGAG
ATCGAGCAGA CCCATATCAA GGACGCGTTC ACCTTCGAAC TGAGCAAGGT GGAGAGGCCC
GATATCCGGG CGCGCGTTGT TTCGCATTTG GTCAATGTCG ATGAGGAGCT GGCCGGGAAA
GTCGCCGACG GCCTCGGAAT GGAGCTGCCG GCCGCCGCCG AGCCTGCGCG CGCCGTGATG
ACCGACCTCG ATCCCTCGCC TGCACTCAGC ATCCTGAAAA ATGGACCCAA GGATTTTTCA
GGCCGCAAGA TCGGCATCCT GGTCAGCGAC GGCGCCGACG CCAAACTGCT GGCGGCGCTG
CAAGCCGCAG CCGGCAAGCA TGGCGTTCTG GTGGAGCTGG TCGCGCCGAA GGTCGGCGGC
TTCGAAACGT CCGACGGAGA GCTGCTGCCC GCCAAACAGA AGATCAATGG CGGGCCGTCG
GTTCTGTACG ACGCGGTGGC AATTCTCGTC TCCGACGAAG GCGCTGAAAT GCTGCTCGGC
GAGGCGACCG CCCGCGACTT CGTTGCCGAT GCCTTCGCCC ACGCCAAATT CATCGCCTAT
ACCGAAGCTG CGCAGCCGCT GCTCGACAAG GCCGGCGTCG AACCCGATGA CGGCTTCCTC
GCCTTGAGCA AGCCGGCCGA TGCCGAACGC TTCCTGAAGC TCTGCGCCAA GCTGCGCTTC
TGGGAACGCG AGGCAAGCGT TCACGCGGTG TAG
 
Protein sequence
MSPKKSLTKS SAPENSETTS ATIADASIQR GQGGETHQIA TGKTPVLTTR QGVPVSDDQN 
SLKIGPRGPT LMEDFHFREK IFHFDHERIP ERVVHARGFG AHGFFETYES LADITRADIF
QRAGEKTPAF VRFSTVAGNK GSADLARDVR GFAVKLYTQE GNWDIVGNNI PVFFIQDAIK
FPDLIHAAKQ EPDRGFPQAQ TAHDNFWDFI SLMPESINMA LWIMSDRTIP RSFRFMEGFG
VHTFRLINAE GKSTYVKFHW KPKLGMQSVA WNEAVKLNGA DPDFHRRDLW DAIQAGDYPE
WELGLQLFDD AFADEFEFDI LDATKIIPEE RVPIRRVGRL VLDRTVDNFF NETEQVAFQT
GNIVPGVDFT NDPLLQGRNF SYLDTQLKRL GSPNFNDLPI NAPKCPFHTF QQDGHMTLNN
PKGRANYEPN SWGAAGGPRE TPNGFQTFPA EETGERRRAR GELFADHYSQ ARQFYISQTE
IEQTHIKDAF TFELSKVERP DIRARVVSHL VNVDEELAGK VADGLGMELP AAAEPARAVM
TDLDPSPALS ILKNGPKDFS GRKIGILVSD GADAKLLAAL QAAAGKHGVL VELVAPKVGG
FETSDGELLP AKQKINGGPS VLYDAVAILV SDEGAEMLLG EATARDFVAD AFAHAKFIAY
TEAAQPLLDK AGVEPDDGFL ALSKPADAER FLKLCAKLRF WEREASVHAV