Gene Plav_0943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_0943 
Symbol 
ID5454152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp1013426 
End bp1014901 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content63% 
IMG OID640876514 
Productprotease Do 
Protein accessionYP_001412223 
Protein GI154251399 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.250261 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAATG AAAGCAATAT CGGGCGGCAG AGGACGCTGG CGCTGGCAGT GGCGGGCGCA 
CTCGTATTCG GCGCCTGGTC GTCCGGCGCC TTGGCGCAGG AGCCCCTGGA CACGCGCTCA
GCCGCGCCTG CGTCGACGGC CCTGGAAAAT CTCCCGAGCT TCGCCGACCT CGTCGAGAAG
GTGAACCCGG CTGTGGTCAG CATTCGTGTC GACGAGGAGG TTGCCGCCCG TAGCTCCGGT
GTCCCCGATC TTCCGTTCCC GCCCGGCAGC CCCTTCGAGA AATTCTTCCG TGACATGCAG
CCCCAGCAGG GCCCCGACGG CGCGCCGCCG CGCCGGCATG CTACAGCACT CGGCTCCGGA
TTTCTGATTT CCGCGGATGG CTTTGTGGTC ACCAACAATC ACGTCGTTGG CGACGGCAAG
GACATCACTG TCGTTCGCAG CGACGGCAGC GAGATGAAGG CGAAACTCAT CGGCCGGGAT
CCGAAGACGG ATCTCGCGCT TGTGAAAGTC GAAAGCAAGG AACCTCTGCC TTACGTGGTG
TTCGGCAATT CCGACAATGT GCGCGTGGGA GACTGGGTGC TCGCGGTCGG CAATCCCTTC
GGCCTTGGAG GCACCGTCAC CACCGGCATT GTTTCCGCGC GCGGCCGTGA AATAGGCGCT
GGCCCCTATG ACGATTTCAT TCAGATCGAT GCTTCGATCA ACAAGGGAAA TTCAGGCGGT
CCGACCTTCG ACGTCCGGGG TAATGTTGTG GGCGTCAACA CGGCCATCTT TTCGCCCACT
GGCGGCAGTG TCGGTATCGG CTTCGCCATT CCGTCCTCGA TCGCGCAGAA CGTCATCGCT
CAGCTGAAGG AAGACGGAAA GGTCACGCGC GGCTGGCTCG GCGTCACCAT TCAACAGGTT
GACGAGGACG TCGCCTCCAC GCTCGCCCTG GACAAGCCCC GTGGCGCGCT CGTCGCACAG
GTTGCGGAAG ACAGCCCCGC GAAGAAAGCC GGCATCCAGA CGGGCGACGT CATTCTCAAT
GTTGACGGAA AAGAAATGGA AGACGTCCGT GCCGTCAGCC GCACGGTTGC GGATCTGCAG
CCAGATACGC GCTCGCAGAT CGTCCTGTGG CGCGATGGCA AGCGGAAAAA CATCTCCGCG
CAGATTGGCA CCTTCCCCGA GGAGATCGCG GCCGCAGCGG CTTCGCCCAC TGGGGAGGCG
CCGGCCGCCG GCACGACGGA GAGCCTGGGC CTTGCGCTTA CCCGCTCGCC GGAAGGCGTC
ATGGTGCAGA GCGTCGACCC CGCCAGCGAT GCTGCCGAAA AGGGCGTCCG TCCCGGCGAC
ATTATCGTCA AGGTATCCGG CAAGGACGTG ACGGAGCCCG CCGATGTGGT GGCGCGCGTC
GCGGAAGCAG GAAAGGCCGA CAAGAACTCG GTCCTGCTTC TTCTCCGAAC CGACAATCAG
CAGCGTTTCG TTGCACTGAC GCTTGAGAAA TCCTGA
 
Protein sequence
MRNESNIGRQ RTLALAVAGA LVFGAWSSGA LAQEPLDTRS AAPASTALEN LPSFADLVEK 
VNPAVVSIRV DEEVAARSSG VPDLPFPPGS PFEKFFRDMQ PQQGPDGAPP RRHATALGSG
FLISADGFVV TNNHVVGDGK DITVVRSDGS EMKAKLIGRD PKTDLALVKV ESKEPLPYVV
FGNSDNVRVG DWVLAVGNPF GLGGTVTTGI VSARGREIGA GPYDDFIQID ASINKGNSGG
PTFDVRGNVV GVNTAIFSPT GGSVGIGFAI PSSIAQNVIA QLKEDGKVTR GWLGVTIQQV
DEDVASTLAL DKPRGALVAQ VAEDSPAKKA GIQTGDVILN VDGKEMEDVR AVSRTVADLQ
PDTRSQIVLW RDGKRKNISA QIGTFPEEIA AAAASPTGEA PAAGTTESLG LALTRSPEGV
MVQSVDPASD AAEKGVRPGD IIVKVSGKDV TEPADVVARV AEAGKADKNS VLLLLRTDNQ
QRFVALTLEK S