Gene Rsph17029_1203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1203 
Symbol 
ID4895884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1249198 
End bp1250397 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content73% 
IMG OID640111789 
Productpeptidase M23B 
Protein accessionYP_001043085 
Protein GI126461971 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.104562 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCACC GTCCCACGCG CGCAGGCGCC CTTTGCATGA CGGCGACGCT GACGATCCTC 
GCCGCCTGCA GCACTTCGGA CCTAGACTGG GACCTGCGCG GTCGCCCCGG CGGGCTCAGC
ACGGCCGAGG CCGCGCGGGC GGTCAGCGCC CCCCGCCCCC GCGCCGACGA CCGCGGGATC
ATCTCCTATC CGACCTATCA GGTGGCCGTT GCCCGTCAGG GCGAGACGGT CGCCTCGCTC
TCGAGCCGCC TCGGGCTCGA TGCCACGCAG GTCGCGAGCT ACAACGCCCT CTCGCCGCAG
AACCCTCTGC GCGCCGGAGA AGTGGTCGTG CTGCCGCAGC GCGTGGCGGC GGCTCCCGCC
ATGACCCCGG CGCCCGTCAT GACGGCGCCC GGCGCGGCGA GCCCCGGCGG CATCGACGTG
ACCGCCATCG CCACGAGCGC CCTCGACCGG GCAGGCCCTG CCCCGGCGCC GGTGGCCGCC
GCTCCCGCTG CGGCCCCCGC GCAGTCTGCC GCGACCGAGC CGGCCCGCCA CCGCGTGTCG
CGGGGAGAGA CCGCCTATTC GATCGCGCGC AGCTACAATG TCTCGCCCAA GGCGCTGGCG
GACTGGAACG GGCTCGGGCC GGATCTTGCG ATCCGCGAGG GCCAGTATCT GATGATCCCG
ACCGCCTCTG CGCCCCCGCC CACGGTGCCC GCCAACGTGA CCGCGGTCAC GGTGCCCGGG
GCAGGCTCGC CGACGCCCAC CCCGCCCTCG GCGGCCAAGC CGCTGCCCGC CGAGTCGACC
ACGCCCGCCT CGAAACCCTC AGGCCAGCCC GCCTCACCCG ACATGGGCGC ACAGCGCACG
CAGGCCTCGG CCTCGCGGCT GGGATTCCCG GTGCAGGGCA AGATCATCCG CGGCTATGTG
AAGAAGAAGA ACGACGGCAT CGACATCTCG GCGGCCGTGG GCACGCCGGT GGCGGCGGCC
GCGGACGGGA CGGTGGCGGC CATCACGCAG GACACCGATC AGGTGCCGAT CCTCGTGATC
CGGCACCCCG ACAACCTGCT GACGGTCTAT GCCAATATCG ACGGCATCAA GGTCACCAAG
GGTGCCAGCG TGAAGCGCGG ACAGCCCATC GCCGTGGTGC GCGCGGCCGA CCCGCCCTTC
GTCCATTTCG AGGTCCGCAA GGGGTTCGAG AGCGTGGATC CGATGCCCTA CCTCCAGTAG
 
Protein sequence
MFHRPTRAGA LCMTATLTIL AACSTSDLDW DLRGRPGGLS TAEAARAVSA PRPRADDRGI 
ISYPTYQVAV ARQGETVASL SSRLGLDATQ VASYNALSPQ NPLRAGEVVV LPQRVAAAPA
MTPAPVMTAP GAASPGGIDV TAIATSALDR AGPAPAPVAA APAAAPAQSA ATEPARHRVS
RGETAYSIAR SYNVSPKALA DWNGLGPDLA IREGQYLMIP TASAPPPTVP ANVTAVTVPG
AGSPTPTPPS AAKPLPAEST TPASKPSGQP ASPDMGAQRT QASASRLGFP VQGKIIRGYV
KKKNDGIDIS AAVGTPVAAA ADGTVAAITQ DTDQVPILVI RHPDNLLTVY ANIDGIKVTK
GASVKRGQPI AVVRAADPPF VHFEVRKGFE SVDPMPYLQ