Gene Rsph17029_1075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1075 
Symbol 
ID4897641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1108484 
End bp1110304 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content66% 
IMG OID640111662 
ProductpepF/M3 family oligoendopeptidase 
Protein accessionYP_001042958 
Protein GI126461844 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR02290] oligoendopeptidase, pepF/M3 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.371255 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTGC CCCTCCCCCG CCCGGTCTTC GACGCCAACG CCAGTGCCGG GGGCCTCGGG 
AACCTGCCCG ACTGGGATCT GCGCGACCTC TATCCCACCC CGGACGGTCC GGAATTCCGC
GACGACATGG CCTGGCTCAA GGAGGCCTGC GCAGGCTTCG CCGCCAGCTA CGAGGGCAAG
CTTGCGAGCC TCGATGCGGC GGGCCTTCTC GCCTGCATCG AAGCCTACGA GAAGATCGAC
ATCGTGGCCG GGCGGCTCAT GTCCTACGCC GGCCTGCGCT ATTACCAGAA CACGATGGAC
AGCGAGCGCG CCAAGTTCAT GGCCGATGCG CAGGACAAGG TGACCGACTC CACGACCGCG
CTCGTCTTCT TCAGCCTCGA GTTCAACCGG CTGGAGGATG CCCATCTCGA AGCCCGTCTG
GCCGAAAGCG CGGCGCTCGC GCGCTACAAG CCCGTCTTCG ACCGGATGCG CGCCATGCGC
CCGCACCAGC TTTCGGACGA GCTGGAACGC TTCCTCCATG ACGAATCGAC CGTCGGCGCC
GCCGCCTGGA ACCGGCTCTT CGACGAGACG ATGGCGGGGC TCACCTTCAC GCTCGAGGGC
GAGGAGCTGA ACCTCGAATC CACCCTGAAC CTGCTGACCG ACCCCGAGCG CCCGCGCCGC
GAGGCCGCCG CCCGCGCTCT GGCGGAGGTC TTCGGCCGCA ACATCAAGCT CTTCGCGCGA
GTGCACAACA CGCTCGCGAA AGAGAAGGAG ATCCACGACC GCTGGCGCAA GATGCCCACG
CCGCAATATG GCCGGCACCT CGCGAACCAT GTCGAGCCCG AGGTGGTCGA GGCGCTGCGC
AATGCGGTGG TCGCGGCCTA TCCCAAGCTC TCGCACCGCT ACTACCGGCT GAAGGCCAAG
TGGCTGGGCC TCGAGAAGCT GCAGGTCTGG GACCGCAACG CCCCGCTGCC CACCGAGACG
CCGCGGCTCG TCGGCTGGGA CGAGGCGCAG TCGACGGTGA TGGAGGCCTA TTCGGCCTTC
GATCCGCGGA TGGCAGAGAT CGCGAAACCC TTCTTCGAAA AGGGCTGGAT CGATGCGGGC
GTGAAGCCCG GCAAGGCGCC CGGGGCCTTC GCTCATCCGA CCGTGACGAC CGTCCACCCC
TATGTGATGC TGAACTATCT CGGCAAACCG CGCGACGTGA TGACCCTCGC GCATGAGCTC
GGCCACGGCG TCCATCAGGT GCTGGCGGCG GGACAGGGGG AACTCCTCTC CTCGACGCCG
CTCACGCTGG CCGAGACGGC GAGCGTCTTC GGCGAGATGC TGACCTTCCG CAAGCTCCTC
GATGCGGCGC GGACCCCGGC CGAGCGGAAG ACGCTGCTGG CCGGCAAGGT CGAGGACATG
ATCAACACGG TCGTGCGCCA GATCGCCTTC TACGATTTCG AATGCAAGCT GCACGAGGCG
CGCCGGCAGG GCGAGCTCAC CCCCGAGGAC ATCAACGCCC TGTGGATGAG CGTGCAGGCC
GAAAGCCTCG GCGATGCGTT CGAGTTCATG GAAGGATACG AGACCTTCTG GTCCTACGTT
CCGCATTTCG TCCATTCGCC CTTCTACGTC TATGCCTATG CTTTCGGCGA CGGGCTGGTG
AATGCGCTCT ATGCCGTCTA TGCCGAGGGC ACTCCGGGCT TTCAGGACAA ATATTTCGAG
ATGCTCTCGG CGGGCGGCTC CAAGCATCAC AAAGAGCTTC TCGCCCCCTT CGGCCTCGAT
GCGAGCGACC CGACCTTCTG GGACAAGGGG TTGAGCATGA TCGCAGGCTT CATCGACGAG
CTCGAAGCCA TGGACGATTG A
 
Protein sequence
MTLPLPRPVF DANASAGGLG NLPDWDLRDL YPTPDGPEFR DDMAWLKEAC AGFAASYEGK 
LASLDAAGLL ACIEAYEKID IVAGRLMSYA GLRYYQNTMD SERAKFMADA QDKVTDSTTA
LVFFSLEFNR LEDAHLEARL AESAALARYK PVFDRMRAMR PHQLSDELER FLHDESTVGA
AAWNRLFDET MAGLTFTLEG EELNLESTLN LLTDPERPRR EAAARALAEV FGRNIKLFAR
VHNTLAKEKE IHDRWRKMPT PQYGRHLANH VEPEVVEALR NAVVAAYPKL SHRYYRLKAK
WLGLEKLQVW DRNAPLPTET PRLVGWDEAQ STVMEAYSAF DPRMAEIAKP FFEKGWIDAG
VKPGKAPGAF AHPTVTTVHP YVMLNYLGKP RDVMTLAHEL GHGVHQVLAA GQGELLSSTP
LTLAETASVF GEMLTFRKLL DAARTPAERK TLLAGKVEDM INTVVRQIAF YDFECKLHEA
RRQGELTPED INALWMSVQA ESLGDAFEFM EGYETFWSYV PHFVHSPFYV YAYAFGDGLV
NALYAVYAEG TPGFQDKYFE MLSAGGSKHH KELLAPFGLD ASDPTFWDKG LSMIAGFIDE
LEAMDD