Gene Plav_2052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_2052 
Symbol 
ID5455356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp2239316 
End bp2241160 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content60% 
IMG OID640877629 
ProductpepF/M3 family oligoendopeptidase 
Protein accessionYP_001413323 
Protein GI154252499 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR02290] oligoendopeptidase, pepF/M3 family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.841739 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.00000309933 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTGCGTC TCAAGCCCGT TTCTGCCCCT GCCTCATCCG TGAGTAACGC AAAAGATCAG 
CAGGAATTGC TGGGCGAGCT GCCGGTCTGG AACCTGGGCG ATCTTTATTC GGCACCAGAT
GCGCCTGAAG TTAAAGCGGA TCTCCGGTGG GCGGCGGAGG CGGCGGCGGC TTTCAACAAG
CTCTACAAGG GCCGCATCGG CGCGCTGGCA GCGGCGCCCG AGGGCGGCAA GGAGATTGCC
GCGGCCTTGA AGGCCTATGA AGAGATCGAG GACAAGCTCG GCCGGTTGAT CTCCTTTGCC
GGGCTTCTCT ATGCGCAGGA CAGCGTCAAT CCGGCCTATG CCAAGTTCTA TGGCGACATG
CAGGGCCAGA TCACCACGAT TTCGACGGAC CTTCTCTTTT TCGCTCTCGA ACTGAACCGG
GTTGAAGACG ACGTGCTGGC TAAGGCTCTC GAAGATCCGG CGCTCGGCCA CTACGCGCCT
TGGCTGCGCG ACCTGCGCGC GATGCGGCCT TATCAGCTTT CCGATGAGCT CGAAACGCTT
CTGCATGAGA AAAGCCTGAC GGGGCATGCC GCATGGAACC GCCTTTTCGA CGAAACGATG
GCGCGGCTTA CATTCGAAGT CGATAGCGAG ACGCTGACGG TCGAAGGTGC CCTGCATCTT
CTTTCTGACC ATGATGCGGC GAAGCGCGAA GTAGCGGCGC ATGCGCTTGC GAAGACCTTC
AAGGCGAACA CGCCGCTTTT CACGCTCATC ACCAATACGC TTGCAAAGGA CAAGGAGATC
GAGGATCGCC TGCGCGGTTT TGACGACATC GCGCAATCGC GGCATATCGC CAATCGCGTG
GAGCCGGAAG TCGTGGAGGC GCTGGTGGAT GCAGTGCGGC GCGCCTATCC CGATCTCTCG
CATCGCTACT ATGCGATGAA AGCCAAATGG CTCGGTCTCG ATCGCCTGGA ATATTGGGAC
CGCAATGCGC CTTTGCCACA TTCAGACGAT GCAGTGATCC CCTGGAATCA GGCGCGGGAT
ACAGTACTCG ACGCCTATCG CGGCTTCACG CCGGACATGG CGAAGATCGC CGGTCAGTTT
TTCGACAAGG GCTGGATCGA TGCGCCCATG CGGGCGGGCA AGGCGACCGG CGCCTTCGCT
CATCCGACCG TGCCGAGCGC ACATCCCTAT GTGCTGGTCA ATTACCAGGG CAAGACGCGC
GACGTGATGA CGCTTGCGCA TGAACTCGGT CACGGCGTTC ACCAGGTGCT CGCGGCGAAG
CAAGGTCCGC TCATGGCCGA TACACCGCTG ACGCTTGCCG AAACGGCGAG TGTCTTCGGC
GAGATGCTCA CCTTCCGCAA ACTTCTCGCC GGAGCGCAGA CGAAGGAACG CCGCAAGGCG
ATGCTCGCCT CCAAGGTCGA GGACATGCTC AACACGGTTG TCCGCCAGAC AGCGTTCTAC
ACGTTCGAGC GCCGTGTTCA CACCGCGCGC CGGGAAGGCG AACTGACATC GGAAGATATC
GGCCGGATCT GGCTCGACGT GCAAGGCGAG AGCCTTGGTC CGGCCATCCA TCTCGGCGAA
GGCTACGAGA CCTACTGGTG CTACATCTCT CACTTCATAC ATGCGCCGTT CTATGTCTAC
GCCTATGCCT TCGGCGACTG CCTCGTGAAC TCGCTCTATG CGGTCTACGA GCAGGCACAG
GAAGGTTTCG CGGAACGCTA TTTCGAAATG CTTTCCGCCG GCGGCACGTT GCGCCACAAG
GAACTGCTCG CTCCGTTCGG GCTCGATGCG AGCGACCCTG CCTTCTGGGA CAAGGGTCTC
TCCATGATCC GCGGCTTCAT CGACGAGCTT GAGGCGATGG AATGA
 
Protein sequence
MLRLKPVSAP ASSVSNAKDQ QELLGELPVW NLGDLYSAPD APEVKADLRW AAEAAAAFNK 
LYKGRIGALA AAPEGGKEIA AALKAYEEIE DKLGRLISFA GLLYAQDSVN PAYAKFYGDM
QGQITTISTD LLFFALELNR VEDDVLAKAL EDPALGHYAP WLRDLRAMRP YQLSDELETL
LHEKSLTGHA AWNRLFDETM ARLTFEVDSE TLTVEGALHL LSDHDAAKRE VAAHALAKTF
KANTPLFTLI TNTLAKDKEI EDRLRGFDDI AQSRHIANRV EPEVVEALVD AVRRAYPDLS
HRYYAMKAKW LGLDRLEYWD RNAPLPHSDD AVIPWNQARD TVLDAYRGFT PDMAKIAGQF
FDKGWIDAPM RAGKATGAFA HPTVPSAHPY VLVNYQGKTR DVMTLAHELG HGVHQVLAAK
QGPLMADTPL TLAETASVFG EMLTFRKLLA GAQTKERRKA MLASKVEDML NTVVRQTAFY
TFERRVHTAR REGELTSEDI GRIWLDVQGE SLGPAIHLGE GYETYWCYIS HFIHAPFYVY
AYAFGDCLVN SLYAVYEQAQ EGFAERYFEM LSAGGTLRHK ELLAPFGLDA SDPAFWDKGL
SMIRGFIDEL EAME