Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_2052 |
Symbol | |
ID | 5455356 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | + |
Start bp | 2239316 |
End bp | 2241160 |
Gene Length | 1845 bp |
Protein Length | 614 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640877629 |
Product | pepF/M3 family oligoendopeptidase |
Protein accession | YP_001413323 |
Protein GI | 154252499 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1164] Oligoendopeptidase F |
TIGRFAM ID | [TIGR02290] oligoendopeptidase, pepF/M3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.841739 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.00000309933 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTGCGTC TCAAGCCCGT TTCTGCCCCT GCCTCATCCG TGAGTAACGC AAAAGATCAG CAGGAATTGC TGGGCGAGCT GCCGGTCTGG AACCTGGGCG ATCTTTATTC GGCACCAGAT GCGCCTGAAG TTAAAGCGGA TCTCCGGTGG GCGGCGGAGG CGGCGGCGGC TTTCAACAAG CTCTACAAGG GCCGCATCGG CGCGCTGGCA GCGGCGCCCG AGGGCGGCAA GGAGATTGCC GCGGCCTTGA AGGCCTATGA AGAGATCGAG GACAAGCTCG GCCGGTTGAT CTCCTTTGCC GGGCTTCTCT ATGCGCAGGA CAGCGTCAAT CCGGCCTATG CCAAGTTCTA TGGCGACATG CAGGGCCAGA TCACCACGAT TTCGACGGAC CTTCTCTTTT TCGCTCTCGA ACTGAACCGG GTTGAAGACG ACGTGCTGGC TAAGGCTCTC GAAGATCCGG CGCTCGGCCA CTACGCGCCT TGGCTGCGCG ACCTGCGCGC GATGCGGCCT TATCAGCTTT CCGATGAGCT CGAAACGCTT CTGCATGAGA AAAGCCTGAC GGGGCATGCC GCATGGAACC GCCTTTTCGA CGAAACGATG GCGCGGCTTA CATTCGAAGT CGATAGCGAG ACGCTGACGG TCGAAGGTGC CCTGCATCTT CTTTCTGACC ATGATGCGGC GAAGCGCGAA GTAGCGGCGC ATGCGCTTGC GAAGACCTTC AAGGCGAACA CGCCGCTTTT CACGCTCATC ACCAATACGC TTGCAAAGGA CAAGGAGATC GAGGATCGCC TGCGCGGTTT TGACGACATC GCGCAATCGC GGCATATCGC CAATCGCGTG GAGCCGGAAG TCGTGGAGGC GCTGGTGGAT GCAGTGCGGC GCGCCTATCC CGATCTCTCG CATCGCTACT ATGCGATGAA AGCCAAATGG CTCGGTCTCG ATCGCCTGGA ATATTGGGAC CGCAATGCGC CTTTGCCACA TTCAGACGAT GCAGTGATCC CCTGGAATCA GGCGCGGGAT ACAGTACTCG ACGCCTATCG CGGCTTCACG CCGGACATGG CGAAGATCGC CGGTCAGTTT TTCGACAAGG GCTGGATCGA TGCGCCCATG CGGGCGGGCA AGGCGACCGG CGCCTTCGCT CATCCGACCG TGCCGAGCGC ACATCCCTAT GTGCTGGTCA ATTACCAGGG CAAGACGCGC GACGTGATGA CGCTTGCGCA TGAACTCGGT CACGGCGTTC ACCAGGTGCT CGCGGCGAAG CAAGGTCCGC TCATGGCCGA TACACCGCTG ACGCTTGCCG AAACGGCGAG TGTCTTCGGC GAGATGCTCA CCTTCCGCAA ACTTCTCGCC GGAGCGCAGA CGAAGGAACG CCGCAAGGCG ATGCTCGCCT CCAAGGTCGA GGACATGCTC AACACGGTTG TCCGCCAGAC AGCGTTCTAC ACGTTCGAGC GCCGTGTTCA CACCGCGCGC CGGGAAGGCG AACTGACATC GGAAGATATC GGCCGGATCT GGCTCGACGT GCAAGGCGAG AGCCTTGGTC CGGCCATCCA TCTCGGCGAA GGCTACGAGA CCTACTGGTG CTACATCTCT CACTTCATAC ATGCGCCGTT CTATGTCTAC GCCTATGCCT TCGGCGACTG CCTCGTGAAC TCGCTCTATG CGGTCTACGA GCAGGCACAG GAAGGTTTCG CGGAACGCTA TTTCGAAATG CTTTCCGCCG GCGGCACGTT GCGCCACAAG GAACTGCTCG CTCCGTTCGG GCTCGATGCG AGCGACCCTG CCTTCTGGGA CAAGGGTCTC TCCATGATCC GCGGCTTCAT CGACGAGCTT GAGGCGATGG AATGA
|
Protein sequence | MLRLKPVSAP ASSVSNAKDQ QELLGELPVW NLGDLYSAPD APEVKADLRW AAEAAAAFNK LYKGRIGALA AAPEGGKEIA AALKAYEEIE DKLGRLISFA GLLYAQDSVN PAYAKFYGDM QGQITTISTD LLFFALELNR VEDDVLAKAL EDPALGHYAP WLRDLRAMRP YQLSDELETL LHEKSLTGHA AWNRLFDETM ARLTFEVDSE TLTVEGALHL LSDHDAAKRE VAAHALAKTF KANTPLFTLI TNTLAKDKEI EDRLRGFDDI AQSRHIANRV EPEVVEALVD AVRRAYPDLS HRYYAMKAKW LGLDRLEYWD RNAPLPHSDD AVIPWNQARD TVLDAYRGFT PDMAKIAGQF FDKGWIDAPM RAGKATGAFA HPTVPSAHPY VLVNYQGKTR DVMTLAHELG HGVHQVLAAK QGPLMADTPL TLAETASVFG EMLTFRKLLA GAQTKERRKA MLASKVEDML NTVVRQTAFY TFERRVHTAR REGELTSEDI GRIWLDVQGE SLGPAIHLGE GYETYWCYIS HFIHAPFYVY AYAFGDCLVN SLYAVYEQAQ EGFAERYFEM LSAGGTLRHK ELLAPFGLDA SDPAFWDKGL SMIRGFIDEL EAME
|
| |