Gene Rsph17029_1421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1421 
Symbol 
ID4897116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1475330 
End bp1476796 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content67% 
IMG OID640112009 
Productcatalase 
Protein accessionYP_001043303 
Protein GI126462189 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0753] Catalase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.898508 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0406547 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGGA TGACCACGAC GGCCGGCGCG CCGATCGCCG ACAACCAGAA CACGCTGACC 
GCGGGTCCCC GCGGCCCGGT GCTCTTGGAG GATTACCAGC TGATCGAGAA GCTCGCGCAC
CAGAATCGCG AGCGCATTCC CGAGCGCGTT GTCCATGCCA AGGGCTGGGG CGCCTTCGGC
ACCTTCACGG TGACGAACGA CATCACCCGC TACAGCTGCG CCAAGGTCTT TTCGGAAGTG
GGCAAGCAGA CCGATCTCGC GGTGCGCTTC TCGACGGTGG CGGGCGAGCT CGGCGCGGCC
GACCATGAGC GCGACGTGCG CGGCTTCTCG GTGAAGATGT ATACCGAGGA AGGCAACTGG
GACCTCGCCG GAAACAATAC CCCGGTGTTC TTCGTGCGCG ACCCGATGAA GTTCCCCGAC
TTCATCCACA CCCAGAAGCG CCACCCGCGC ACCAACCTGC GCTCGGCCAC CGCGATGTGG
GACTTCTGGT CGCTCAGCCC CGAGTCGCTG CATCAGGTGA CGATCCTGAT GTCCGACCGC
GGCCTGCCGG TCGATCCGAT GCATATGAAC GGCTACGGCT CGCACACCTA TTCGCTGTGG
AACGCGCAGG GCGAGCGGTT CTGGGTCAAG TTCCACTGGA AGACCCTGCA GGGCCACGCG
CATTACACCA ATGCCGAGGC TGCCGAGATC GTCGGCCGCA CCCGCGAGGG CTATCAGGAG
GCGCTGTTCG GCGCCATCGA GGAGGGCCGC TTCCCGAAAT GGCGCCTCAT GGTTCAGGTG
ATGCCCGAGG CCGAGGCCGA GACCACGCCC TACAACCCGT TCGACCTGAC GAAGGTCTGG
CCGCATTCGG ACTATCCGCT GATCGAGGTC GGCGTGATGG AGCTGAACCG CAATGCGGAC
AACTACTTCG CCCAGATCGA GCAGCTGGCC TTCTCGCCCT CGAACAAGGT GCCGGGCATC
GGCTACAGCC CCGACAAGAT GCTGCAGGCC CGCGTCTTCT CCTATGCCGA CGCGCACCGC
TACCGGCTCG GCACGCACTA CGAGGCCCTG CCCGTGAACG CGCCGCGCTG CCCGGTGCAT
CACTACCACA AGGACGGCCA GATGAACTTC TTCGGCCATC GCACCGGCGC GGTCGATGCC
TATTACGAGC CGAACTCGGT GCCGGGCGCT GCCGTCGAGG ATCCGACCGT GGCCGAGCCG
CCGCTGCGCA TCTCGGGCGA TGCGGCGCGC TACAACCACC GCGAGGGCAA CGACGACTAC
AGCCAGCCCC GTGCCCTGCA TGACAAGGTG ATGACCGACG AGCAGCGGGC CCGGCTCTAT
GCCAACATCG CCGAAGCCAT GGCGGGCATC CCCGAGGAGA TCGCGAACCG GGCCATCGCC
CAGTTCGACC GGGTGAGCCC CGCCTACGGC GAGGGCATCC GGGCGGCCCG TGCGCGGCTC
GCCGGCGCCC TTCAGGCGGC CGAGTAG
 
Protein sequence
MTRMTTTAGA PIADNQNTLT AGPRGPVLLE DYQLIEKLAH QNRERIPERV VHAKGWGAFG 
TFTVTNDITR YSCAKVFSEV GKQTDLAVRF STVAGELGAA DHERDVRGFS VKMYTEEGNW
DLAGNNTPVF FVRDPMKFPD FIHTQKRHPR TNLRSATAMW DFWSLSPESL HQVTILMSDR
GLPVDPMHMN GYGSHTYSLW NAQGERFWVK FHWKTLQGHA HYTNAEAAEI VGRTREGYQE
ALFGAIEEGR FPKWRLMVQV MPEAEAETTP YNPFDLTKVW PHSDYPLIEV GVMELNRNAD
NYFAQIEQLA FSPSNKVPGI GYSPDKMLQA RVFSYADAHR YRLGTHYEAL PVNAPRCPVH
HYHKDGQMNF FGHRTGAVDA YYEPNSVPGA AVEDPTVAEP PLRISGDAAR YNHREGNDDY
SQPRALHDKV MTDEQRARLY ANIAEAMAGI PEEIANRAIA QFDRVSPAYG EGIRAARARL
AGALQAAE