Gene Rsph17025_2914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2914 
Symbol 
ID5084514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2971860 
End bp2972951 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content73% 
IMG OID640484485 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_001169105 
Protein GI146278946 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.118782 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.174393 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCACC CGCTCACCTT CCTCGGCATC GAGAGCAGCT GCGACGACAC CGCGGCGGCC 
GTGGTGCGCG CGGCCGAGCG GGCCGAGATC CTGTCCTCGG TGGTGGACGG GCAGGCCGCG
CTGCACGCGC CCTTCGGCGG CGTGGTGCCG GAAATCGCGG CCCGCGCCCA TGCCGAGCGG
CTCGACCTCT GCGTCGAACG CGCGCTGCAG GAGGCCGGGC TGGGTCTTGG CGATCTCGAC
GGGATCGCGG TGACGGCGGG GCCGGGCCTG ATCGGGGGCG TGCTGTCGGG CGTGATGCTG
GCCAAGGGGC TGGCGGCGGG AACGGGCCTG CCGCTCGTGG GGGTGAATCA CCTCGCGGGC
CACGCGCTCA CACCACGGCT GACCGACGCG CTTGCCTTTC CCTATCTGAT GCTTCTCGTG
TCGGGAGGTC ATTGCCAGTT CCTGATCGCT CGTGGAGCGG AAGCGTTTTC GCGCCTTGGC
GGTTCCATCG ACGATGCGCC GGGCGAGGCT TTCGACAAGA CCGCCAAGCT TCTGGGCCTG
CCGCAACCCG GAGGCCCCTC GGTCGAGGCC GAGGCGGCCA CGGGCGATCC GCGCCGCTTC
GCCTTTCCGC GGCCGATGCT GGACCGGCCG GGGTGCGACA TGTCCTTTTC GGGGCTGAAG
ACCGCGCTGC TCCGGGCCCG CGACGGGATC GTGGCGGAGA AGGGCGGGAT CACGCGGCAG
GATCGGGCCG ATCTCTGCGC GGGCTTTCAG GCGGCCGTGG TGGATGTGCT GGCGGAAAAG
ACCCGCCGCG CGCTCGCGAT CTATGCGGAG GAACAGGCGC CCGTGCCCGC GCTGGCGGTG
GCCGGCGGGG TGGCGGCCAA CGGGCCGATC CGCGCGGCGC TGACCCGCGT GGCCGAGGAG
GCGGGCGCGC GCTTCCTCGC CCCGCCGCTG CGGCTCTGCA CGGACAATGC CGCCATGATC
GCCTGGGCGG GCATCGAGAG GTTTCGGGCG GGCGGCCGCG ACGGGATGGA TCTGCAGGCC
CGTCCGCGCT GGCCGCTCGA CCAGAGCGCG CCGGCCCTGA TCGGGTCGGG CAGGAAGGGG
GCAAAGGCAT GA
 
Protein sequence
MSHPLTFLGI ESSCDDTAAA VVRAAERAEI LSSVVDGQAA LHAPFGGVVP EIAARAHAER 
LDLCVERALQ EAGLGLGDLD GIAVTAGPGL IGGVLSGVML AKGLAAGTGL PLVGVNHLAG
HALTPRLTDA LAFPYLMLLV SGGHCQFLIA RGAEAFSRLG GSIDDAPGEA FDKTAKLLGL
PQPGGPSVEA EAATGDPRRF AFPRPMLDRP GCDMSFSGLK TALLRARDGI VAEKGGITRQ
DRADLCAGFQ AAVVDVLAEK TRRALAIYAE EQAPVPALAV AGGVAANGPI RAALTRVAEE
AGARFLAPPL RLCTDNAAMI AWAGIERFRA GGRDGMDLQA RPRWPLDQSA PALIGSGRKG
AKA