Gene Plav_0814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_0814 
Symbol 
ID5455666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp881706 
End bp882821 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content62% 
IMG OID640876385 
Productpeptidase M42 family protein 
Protein accessionYP_001412094 
Protein GI154251270 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID[TIGR03106] hydrolase, peptidase M42 family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0829922 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAACC TCAGTATCGA TAGCAACTAT ATGTGCGACG TAATGGAGCG CATGCTGGAA 
ACGCCGAGCC CCTCCGGCAT GACCGACATG ATCGTCGGGC TCGTCTGCGA AGAGCTGGAG
AAATTCGGCA TCAACTTCGA GCTTACGAGA CGTGGCGCAA TCCGCGCGGA TCTGGAGGGC
GGCCTTCATT CTCCCGACCG CGCCATTATC GGGCATCTCG ATACGCTTGG CGCAATGGTC
AAGGGCTACC GGGCGAATGG CCGCCTTGAG GTCGTTCCCA TCGGTACGTG GTCCGCGCGT
TTTGCCGAAG GGGCGCGCTG CACGATTTAT GCAGATGGCG GCGCGCGCTA TCGCGGCAGC
ATCCTGCCGC TCAAGGCTTC CGGCCACACG TTCAACGAGG AGATCGACAC GCAGCCGGCT
TCATGGAGCA ACCTGGAGCT GCGTATCGAC GCAAGGACGG GGAGCGAAGC CGAGACCCGT
GCGCTCGGCA TCCATGTCGG CGACACGATC TCGATCGACC CGGAGACGGA GTTCTCCGAC
ACCGGCTTCG TGACCTCGCG GCATCTCGAC GACAAGGCGG GTGTGGCTTC CATGCTTGCC
GCCGCCAAGG CCGTCACGCA ATCGGAGGTG ACGCTGCCCA TCGACTGTCA TCTCCTGTTC
ACCATCTCCG AGGAAGTGGG CGTCGGCTCA TCGCATGTGC TGCATGGCGA TGTCGCGGAA
ATGGTATCTA TCGACAACGG CACCGTTGCG CCGGGTCAGT ATACAAGCGA GTACGGCGTT
ACCGTCGCGA TGCAGGATTC GTCCGGGCCT TTCGACTGGC ATCTGACGAG AAGTCTGCTG
GGGCTCTGCG AGCAGCACGA CATCGAGCAC GCGCGCGATG TGTTTCGCTA CTACCGCAGC
GATGCGGCGG CGGCGCTTGA AGCAGGGAAC GACATTCGCA CCGCCCTCCT CTGCTTCGGC
CTTGATGCAT CGCATGGCTA TGAGCGGGTG CATCTCAGTT CGCTGGAAGC GCTCTCCCGC
CTGCTCGTGC TCTATATGCA GTCGAAGCCG CTCTTCCGGC GCGACAGGGA AGCGCTTGGC
CCCCTCGACG ACCTGCCGAC CGGCGAGCCG AACTAG
 
Protein sequence
MKNLSIDSNY MCDVMERMLE TPSPSGMTDM IVGLVCEELE KFGINFELTR RGAIRADLEG 
GLHSPDRAII GHLDTLGAMV KGYRANGRLE VVPIGTWSAR FAEGARCTIY ADGGARYRGS
ILPLKASGHT FNEEIDTQPA SWSNLELRID ARTGSEAETR ALGIHVGDTI SIDPETEFSD
TGFVTSRHLD DKAGVASMLA AAKAVTQSEV TLPIDCHLLF TISEEVGVGS SHVLHGDVAE
MVSIDNGTVA PGQYTSEYGV TVAMQDSSGP FDWHLTRSLL GLCEQHDIEH ARDVFRYYRS
DAAAALEAGN DIRTALLCFG LDASHGYERV HLSSLEALSR LLVLYMQSKP LFRRDREALG
PLDDLPTGEP N