Gene Plav_0844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_0844 
Symbol 
ID5456343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp915941 
End bp918040 
Gene Length2100 bp 
Protein Length699 aa 
Translation table11 
GC content64% 
IMG OID640876415 
Productoligopeptidase B 
Protein accessionYP_001412124 
Protein GI154251300 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.740368 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTCA CACCTCCCGC CGTTGCGCCC CTGGCCCCGA AGCGGCCGCA GACGGATGTG 
CATCACGGCA TTTCGCGGAC CGACGATTAT GCGTGGCTGC GCGACGAGAA CTGGCGCGAG
GTGATGCGCG ATCCGGCGGT TCTCGATACC GACATCCGCG CCTATCTCGA CGCCGAGAAC
GCCTATACGG AAGCGGCGCT CAAACCTGTC GCCGAATTGC GCGAGACACT CTTCAAGGAG
ATGAAGGGGC GGATCAAGGA GGATGACAGC TCCGTGCCGT CGCCCGACGG CGCTTTTGCC
TATTACACGC GCTTTGTCGA AGGGGCGCAG CATCCGCTTT TCTGCCGCAG GCCGCGAGAG
GCGGAAGCGG GCGAGGAAAT ATTGCTTGAC GCGAACAAGG AAGCGGAGGG CGAGGCCTAT
TTCAAGATCG GCGATGTGGA CCATGCGCCG ACGCACAAGC TGATCGCATG GTCGGCGGAC
CGCAAGGGCT CGGAATATTT CACGGTGCGG CTGCGCGACG CCGCGACGGG GAAGGACCTC
GCCGACGAGG TGCCGGACAC TTCGCCCGGC ATCGCATGGG ACGCGGCGGG CACGAGCTTT
CTCTATACGC AGGTGGATGA CGAACACCGG CCGCTGAAAG TGTTCCGCCA TGTCGTCGGC
ACGGCGGCGA GCGAAGACAC ACTCGTCTAT GAAGAAGAGG ACGAAGGCTT TTTCGTCGGC
GTGGGGAAGA CGCAGAGCGG CAAGTGGCTC GTCATTTCAA GCCATGACCA CCAGACCAGC
GAATGCAGGC TGATCCCCGC CGATGCGCCG GAGACTGCGC CGCTGCTCGT TGCGCCGCGC
GAGGAGGCTG TCGAATACGA TATCGAGCAT GACGAACCGC GAGAACGCTT CCTCATTCTC
ACGAATGCGG ACGGGGCGGA GGACTTCAAG ATCGTTGAGG CGCCCGAGGC GGCGCCCGGA
CGCGAGAACT GGCGCGACTT CGTGCCCCAC CGGCCGGGGA CGCTGGTGCT GCATCATGTG
GCCTATCGCG GGCATCATGT GCGGCTCGAA CGGCGGGACG GGCTGCCGCG CATTGCGGTG
CGGCGGCTTG CGGACGGCGC GGAACATGAG ATCGGCTTCG ACGAGGAAGC CTATGATCTC
AACATGGGCG CGGGCTACGA ATACGACACG ACGCGGCTGC GCTTTTCCTA CAGCTCGATG
ACGACGCCCG CGGAAGTCTA TGATTATGAC GTCGAGACGC GGGAGCGGAC ATTCCGCAAG
CGGCAGGAAG TGCCCTCGGG CCACAACCCG GCCGACTACG AGACAAGGCG GATTTTCGCG
CGCGCCTCGG ACGGCGAGAT GGTGCCGATT TCGCTCGTCC ACCGAAAGGG GCTGAGCCTC
GACGGGAGCG CGCCCTGCCT GCTTTACGGC TATGGCTCTT ACGGGATCAG CATTCCGGCA
TCGTTTTCCA CGACCTGCCT TTCGCTGGTC GATCGCGGCT TCGTCTATGC GATCGCGCAT
ATTCGCGGCG GCAAGGAGAA GGGCTATCGC TGGTATACGG ACGGGAAGCT CAACAAGAAG
CGCAACACCT TCACCGATTT CATCGCGGCG GGCGAGCACC TGGCGAAGGA AGGCTTCACG
TCGCGCGGCA ACATCGTGGC GCATGGCGGC AGCGCGGGCG GGATGCTGAT GGGGGCGGTT
TCCAACATGG CGCCCGATCT CTTCAAGGGC ATTCTGGCGG AAGTGCCGTT TGTCGATGTG
CTGGCGACGA TCCTCGATGC GTCGCTGCCG CTGACGCCGC CGGAATGGAA CGAATGGGGC
AACCCGATCG AGAGCAAGGA AGCCTACGAG TACATGGCTT CCTACAGCCC TTACGACAAT
GTGAAGCCCC AAGCCTATCC GCATCTCTTC GCGCTTGGCG GGCTCACCGA TCCGCGCGTG
ACCTATTGGG AGCCGGCGAA GTGGGTGGCG AAGCTGCGGG AGCTCAAGAC CGGCGATGCG
GTGACGCTGC TTCACATCAA CATGGAGGCC GGACATGGCG GCGCTTCCGG CCGCTTCGAG
CGGCTGAAGG AAGTGGCGCG GGTCTATGCC TTTGCGCTGG CGGTGACGGA ACGCGCGTGA
 
Protein sequence
MKLTPPAVAP LAPKRPQTDV HHGISRTDDY AWLRDENWRE VMRDPAVLDT DIRAYLDAEN 
AYTEAALKPV AELRETLFKE MKGRIKEDDS SVPSPDGAFA YYTRFVEGAQ HPLFCRRPRE
AEAGEEILLD ANKEAEGEAY FKIGDVDHAP THKLIAWSAD RKGSEYFTVR LRDAATGKDL
ADEVPDTSPG IAWDAAGTSF LYTQVDDEHR PLKVFRHVVG TAASEDTLVY EEEDEGFFVG
VGKTQSGKWL VISSHDHQTS ECRLIPADAP ETAPLLVAPR EEAVEYDIEH DEPRERFLIL
TNADGAEDFK IVEAPEAAPG RENWRDFVPH RPGTLVLHHV AYRGHHVRLE RRDGLPRIAV
RRLADGAEHE IGFDEEAYDL NMGAGYEYDT TRLRFSYSSM TTPAEVYDYD VETRERTFRK
RQEVPSGHNP ADYETRRIFA RASDGEMVPI SLVHRKGLSL DGSAPCLLYG YGSYGISIPA
SFSTTCLSLV DRGFVYAIAH IRGGKEKGYR WYTDGKLNKK RNTFTDFIAA GEHLAKEGFT
SRGNIVAHGG SAGGMLMGAV SNMAPDLFKG ILAEVPFVDV LATILDASLP LTPPEWNEWG
NPIESKEAYE YMASYSPYDN VKPQAYPHLF ALGGLTDPRV TYWEPAKWVA KLRELKTGDA
VTLLHINMEA GHGGASGRFE RLKEVARVYA FALAVTERA