Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_1570 |
Symbol | |
ID | 5082381 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 1610323 |
End bp | 1612143 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640483128 |
Product | pepF/M3 family oligoendopeptidase |
Protein accession | YP_001167768 |
Protein GI | 146277609 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1164] Oligoendopeptidase F |
TIGRFAM ID | [TIGR02290] oligoendopeptidase, pepF/M3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.713222 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.288392 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCTGC CCCTGCCTCG CCCGGTCCAT GACGCCAACG CCAGCGCCGG GGGCCTAGGG AACCTGCCCG AATGGGATCT GCGCGACCTC TATCCCGCGC CCGACGCGGC CGAGGTTGGT CAGGACATGG CCTGGCTCAA GTCCGCCTGC GCCGACTTCG CCGCCACCTA CGAAGGCAAG CTGGCAAGCC TCGACGCGGC GGGCCTGCTG ACCTGCGTCG AGACCTACGA GAAGATCGAC ATCACCGCCG GGCGGCTGAT GTCCTACGCG GGGCTGCGCT ACTACCAGAA CACGATGGAC AGCGAGCGCG CCAAGTTCAT GGCCGACGCG CAGGACAAGG TGACGGACTA CACCACCGCG CTCGTCTTCT TCAGCCTCGA GTTCAACCGG CTCGACGACG CTCACCTCGA GTCGCGGCTG GCGGAAAGCC CCGCCCTTGC CCGCTACCGG CCCGTCTTCG ACCGGATGCG CGCCATGCGC CCGCACCAGC TGTCGGACGA GCTGGAACGG TTCCTCCACG ACCAGTCCAC CGTGGGCGCC GCCGCGTGGA ACCGGCTCTT CGACGAGACG ATGGCAGGGC TCACCTTCAC GATCGAGGGC GAGGAGCTGA ACCTCGAATC CACCCTCAAC CTGCTGACCG ACCCCGAGCG GTCGCGCCGC GAGGCCGCCG CCCGCGCGCT GGCCGATGTG TTTGGCGGCC ACATCAAGCT CTTTGCCCGC GTGCACAACA CGCTGGCCAA GGAAAAGGAG ATCCACGACC GCTGGCGCAA GATGCCCACC CCTCAATATG GCCGGCACCT CGCCAACCAT GTCGAGCCCG AGGTGGTCGA GGCGCTGCGC AACGCCGTGG TCGCGGCCTA CCCCAAGCTC TCGCACCGCT ATTACCGGCT GAAGGCGAAA TGGATGGGGC TCGACAAGCT GCAGGTCTGG GACCGCAACG CCCCGCTGCC GATCGAGACC CCGCGCCTCG TGGACTGGAC AGAGGCGCAG GCCACGGTGC TCGAGGCCTA TTCGGCCTTC GATCCGAGGA TGGCCGAGAT CGCGCAGCCC TTCTTCGACA AGGGCTGGAT CGACGCGGGC GTGAAGCCCG GCAAGGCCCC CGGCGCCTTT GCCCACCCGA CCGTGACCAC CGTCCACCCC TATGTGATGC TGAACTATCT CGGCAAGCCG CGCGACGTGA TGACGCTGGC GCATGAGCTG GGTCACGGCG TCCATCAGGT GCTGGCGGCG GGACAGGGAG AGCTGCTTTC CTCCACGCCC CTCACCCTCG CCGAAACGGC GAGCGTGTTC GGCGAGATGC TGACCTTCCG CAAGCTCCTC GATGCCGCCA GGACCCCGGC CGAACGCAAG ACGCTGCTGG CCGGCAAGGT CGAGGACATG ATCAACACGG TCGTGCGCCA GATCGCCTTC TACGACTTCG AATGCAAGCT GCACGAGGCG CGCCGGCAGG GCGAGCTGAC GCCCGAGGAC ATCAACGCCC TCTGGATGAG CGTGCAGGCC CAAAGCCTCG GCGATGCGTT CGAGTTCATG GAGGGCTACG AGACCTTCTG GTCCTACATC CCCCATTTCG TCCATTCGCC CTTCTACGTC TATGCCTATG CCTTTGGCGA CGGTCTGGTG AATGCGCTCT ATGCCGTCTA TGCCGAGGGG GGCGAAGGCT TCGAGGACAA GTATTTCGAG ATGCTCGCGG CGGGCGGCTC CAAGCACCAC AAGGAGCTTC TGGCGCCCTT CGGCCTCGAT GCGAGCGACC CGACCTTCTG GGACAAGGGC CTGAGCATGA TCGCAGGCTT CATCGACGAG CTGGAAGCCA TGGACGGTTG A
|
Protein sequence | MTLPLPRPVH DANASAGGLG NLPEWDLRDL YPAPDAAEVG QDMAWLKSAC ADFAATYEGK LASLDAAGLL TCVETYEKID ITAGRLMSYA GLRYYQNTMD SERAKFMADA QDKVTDYTTA LVFFSLEFNR LDDAHLESRL AESPALARYR PVFDRMRAMR PHQLSDELER FLHDQSTVGA AAWNRLFDET MAGLTFTIEG EELNLESTLN LLTDPERSRR EAAARALADV FGGHIKLFAR VHNTLAKEKE IHDRWRKMPT PQYGRHLANH VEPEVVEALR NAVVAAYPKL SHRYYRLKAK WMGLDKLQVW DRNAPLPIET PRLVDWTEAQ ATVLEAYSAF DPRMAEIAQP FFDKGWIDAG VKPGKAPGAF AHPTVTTVHP YVMLNYLGKP RDVMTLAHEL GHGVHQVLAA GQGELLSSTP LTLAETASVF GEMLTFRKLL DAARTPAERK TLLAGKVEDM INTVVRQIAF YDFECKLHEA RRQGELTPED INALWMSVQA QSLGDAFEFM EGYETFWSYI PHFVHSPFYV YAYAFGDGLV NALYAVYAEG GEGFEDKYFE MLAAGGSKHH KELLAPFGLD ASDPTFWDKG LSMIAGFIDE LEAMDG
|
| |