Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1075 |
Symbol | |
ID | 4897641 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 1108484 |
End bp | 1110304 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640111662 |
Product | pepF/M3 family oligoendopeptidase |
Protein accession | YP_001042958 |
Protein GI | 126461844 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1164] Oligoendopeptidase F |
TIGRFAM ID | [TIGR02290] oligoendopeptidase, pepF/M3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.371255 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCTGC CCCTCCCCCG CCCGGTCTTC GACGCCAACG CCAGTGCCGG GGGCCTCGGG AACCTGCCCG ACTGGGATCT GCGCGACCTC TATCCCACCC CGGACGGTCC GGAATTCCGC GACGACATGG CCTGGCTCAA GGAGGCCTGC GCAGGCTTCG CCGCCAGCTA CGAGGGCAAG CTTGCGAGCC TCGATGCGGC GGGCCTTCTC GCCTGCATCG AAGCCTACGA GAAGATCGAC ATCGTGGCCG GGCGGCTCAT GTCCTACGCC GGCCTGCGCT ATTACCAGAA CACGATGGAC AGCGAGCGCG CCAAGTTCAT GGCCGATGCG CAGGACAAGG TGACCGACTC CACGACCGCG CTCGTCTTCT TCAGCCTCGA GTTCAACCGG CTGGAGGATG CCCATCTCGA AGCCCGTCTG GCCGAAAGCG CGGCGCTCGC GCGCTACAAG CCCGTCTTCG ACCGGATGCG CGCCATGCGC CCGCACCAGC TTTCGGACGA GCTGGAACGC TTCCTCCATG ACGAATCGAC CGTCGGCGCC GCCGCCTGGA ACCGGCTCTT CGACGAGACG ATGGCGGGGC TCACCTTCAC GCTCGAGGGC GAGGAGCTGA ACCTCGAATC CACCCTGAAC CTGCTGACCG ACCCCGAGCG CCCGCGCCGC GAGGCCGCCG CCCGCGCTCT GGCGGAGGTC TTCGGCCGCA ACATCAAGCT CTTCGCGCGA GTGCACAACA CGCTCGCGAA AGAGAAGGAG ATCCACGACC GCTGGCGCAA GATGCCCACG CCGCAATATG GCCGGCACCT CGCGAACCAT GTCGAGCCCG AGGTGGTCGA GGCGCTGCGC AATGCGGTGG TCGCGGCCTA TCCCAAGCTC TCGCACCGCT ACTACCGGCT GAAGGCCAAG TGGCTGGGCC TCGAGAAGCT GCAGGTCTGG GACCGCAACG CCCCGCTGCC CACCGAGACG CCGCGGCTCG TCGGCTGGGA CGAGGCGCAG TCGACGGTGA TGGAGGCCTA TTCGGCCTTC GATCCGCGGA TGGCAGAGAT CGCGAAACCC TTCTTCGAAA AGGGCTGGAT CGATGCGGGC GTGAAGCCCG GCAAGGCGCC CGGGGCCTTC GCTCATCCGA CCGTGACGAC CGTCCACCCC TATGTGATGC TGAACTATCT CGGCAAACCG CGCGACGTGA TGACCCTCGC GCATGAGCTC GGCCACGGCG TCCATCAGGT GCTGGCGGCG GGACAGGGGG AACTCCTCTC CTCGACGCCG CTCACGCTGG CCGAGACGGC GAGCGTCTTC GGCGAGATGC TGACCTTCCG CAAGCTCCTC GATGCGGCGC GGACCCCGGC CGAGCGGAAG ACGCTGCTGG CCGGCAAGGT CGAGGACATG ATCAACACGG TCGTGCGCCA GATCGCCTTC TACGATTTCG AATGCAAGCT GCACGAGGCG CGCCGGCAGG GCGAGCTCAC CCCCGAGGAC ATCAACGCCC TGTGGATGAG CGTGCAGGCC GAAAGCCTCG GCGATGCGTT CGAGTTCATG GAAGGATACG AGACCTTCTG GTCCTACGTT CCGCATTTCG TCCATTCGCC CTTCTACGTC TATGCCTATG CTTTCGGCGA CGGGCTGGTG AATGCGCTCT ATGCCGTCTA TGCCGAGGGC ACTCCGGGCT TTCAGGACAA ATATTTCGAG ATGCTCTCGG CGGGCGGCTC CAAGCATCAC AAAGAGCTTC TCGCCCCCTT CGGCCTCGAT GCGAGCGACC CGACCTTCTG GGACAAGGGG TTGAGCATGA TCGCAGGCTT CATCGACGAG CTCGAAGCCA TGGACGATTG A
|
Protein sequence | MTLPLPRPVF DANASAGGLG NLPDWDLRDL YPTPDGPEFR DDMAWLKEAC AGFAASYEGK LASLDAAGLL ACIEAYEKID IVAGRLMSYA GLRYYQNTMD SERAKFMADA QDKVTDSTTA LVFFSLEFNR LEDAHLEARL AESAALARYK PVFDRMRAMR PHQLSDELER FLHDESTVGA AAWNRLFDET MAGLTFTLEG EELNLESTLN LLTDPERPRR EAAARALAEV FGRNIKLFAR VHNTLAKEKE IHDRWRKMPT PQYGRHLANH VEPEVVEALR NAVVAAYPKL SHRYYRLKAK WLGLEKLQVW DRNAPLPTET PRLVGWDEAQ STVMEAYSAF DPRMAEIAKP FFEKGWIDAG VKPGKAPGAF AHPTVTTVHP YVMLNYLGKP RDVMTLAHEL GHGVHQVLAA GQGELLSSTP LTLAETASVF GEMLTFRKLL DAARTPAERK TLLAGKVEDM INTVVRQIAF YDFECKLHEA RRQGELTPED INALWMSVQA ESLGDAFEFM EGYETFWSYV PHFVHSPFYV YAYAFGDGLV NALYAVYAEG TPGFQDKYFE MLSAGGSKHH KELLAPFGLD ASDPTFWDKG LSMIAGFIDE LEAMDD
|
| |