Gene Plav_1754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_1754 
Symbol 
ID5455894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp1898625 
End bp1900334 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content64% 
IMG OID640877329 
Productpeptidase M23B 
Protein accessionYP_001413030 
Protein GI154252206 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.598264 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTACGACA CGGGGACCCA GGTGACAGAC GAACATAACG GCAAGGGCAC GATCGACGGG 
GACGTGCCTG CTACCTTCTG GCGGCGCGCG CAGCGCAATG CTCAATTCTC CTTCAAACTC
TATACATCGG CGAGCCGGCT GGCGCTTGGT CAGTCGCGCT GGATCGCGGC GTCATGTGTC
CTTGGCGCGA TGTCGCTTGG CCTTGTCGGT CTTGGCGTCG CATCGCTGGT CGCCCCTTAT
GAAGGCGTGG CAACGGCGGC GCTGACGGCG GAGCGCGCAC CGCGCGTCGT CGCCTGGGAC
GCCTCGCCTG CCCAGATGGC GGAACGCACG GAAACCCGCG CGCTGCAGGA AACCGCAGCC
CTTCTGCCCA CGGATGATCT GCTGGCGGCG GCGGAAACGG AAACGGCGGC AGCAACGGCC
GAGCAGGAGC TCGCGGCCGT CTCTCTCGCC GATCCCGGCC TCTTCCTGCC GCAGACGGAA
GAAAGCCTCG CCGCGGAAGC CGCATCGCTT CCCGCCGCGC CGCTCGAACC GCGCGCAACG
GAAACCACCG TCTCCCTCGC CAATGGCGAG ACGCTGATGG AAGTGCTGCA GAACGCCGGC
GTCGATCGCA TCGACGCCTA TCACGCGGTC GCGGCCATGA CGTCTTACTA CAGCCCGCGC
AAACTCCGCG CCGGCCAGGA GATTTCTCTC GCCTTCATGG AAATGCCGGA TGGCATGATC
GGCGAGGAAG GCGAAACGCC GGCGAAATAC CTGACCGCCA TTTCACTGCA GCCCGACATC
GAACGCGCGA TCGAAGTAAC GCGCAACGAA GACGGCAGCT TCGGCGGCCG CGAAACCATT
CGCGAATTCA CGGAAGGCTT CGTGCGCGCT TCCGGCACCA TCAACAATTC TCTCTTCCTC
GACGCGGAAC AGGCGGGCAT CCCGCCGCAG ATCATCATTG AAATGATCCG CATGTACTCC
TACAGCGTCG ACTTCCAGCG CGAAATCCAG CCTGGCGACA AGTTCGAAGT CTATTTCAGC
CGCAAATTCG ACGAGATGCA GCTCCCCGTG AAAGAGGGCG ACGTCCTCCA CGCATCGCTC
ACCGTCGGCG GCAAGACGCA CAAGCTCTGG CGTTTCGATC CCGGCAAGGA TGGCGAGTGG
GACTATTTCG ACGAGTCCGG CCAGAGCATG AAGAAGTTCC TGATGAAGAC GCCGATCGAC
GGTGCGCGTC TTTCCTCCGG CTACGGATTG CGCAAGCATC CGATCCTCGG CTACTCGAAG
ATGCATGCGG GCGTAGACTT CGCCGCGCCC CGCGGTACGC CGATCTACGC CGCCGGCGAC
GGCACGGTGA CACGCGCGAA CCGCTTCGGC AGCTTCGGCA ACTACATCTC CATCCGCCAC
GCAAATGGCT ATGAAACCGC TTACGCGCAT CTCAACGGCT TCGCGCGCGG CGTCAGGGCG
GGCACGCGCG TCCGTCAGGG CCAGGTGATC GGTTATGTCG GCACAACCGG CCGCTCGACC
GGCCCGCATC TTCACTACGA AGTGCATGTG AACGGCAAGA AGATGAACCC GCTCGCGCTG
AAAGTGCCGA CAGGCCGCAA GCTCGAGGGC AATCAGCTCG CCGCCTTCAA GAACCTGCGC
GCCGACATCG CGACGCAAAT GGCCGAAGCG CCGCTTGCAA CGAAGGTGGC CGCGGCGGGC
GCGAAGGACG ATACGCGCAC CGCCAACTGA
 
Protein sequence
MYDTGTQVTD EHNGKGTIDG DVPATFWRRA QRNAQFSFKL YTSASRLALG QSRWIAASCV 
LGAMSLGLVG LGVASLVAPY EGVATAALTA ERAPRVVAWD ASPAQMAERT ETRALQETAA
LLPTDDLLAA AETETAAATA EQELAAVSLA DPGLFLPQTE ESLAAEAASL PAAPLEPRAT
ETTVSLANGE TLMEVLQNAG VDRIDAYHAV AAMTSYYSPR KLRAGQEISL AFMEMPDGMI
GEEGETPAKY LTAISLQPDI ERAIEVTRNE DGSFGGRETI REFTEGFVRA SGTINNSLFL
DAEQAGIPPQ IIIEMIRMYS YSVDFQREIQ PGDKFEVYFS RKFDEMQLPV KEGDVLHASL
TVGGKTHKLW RFDPGKDGEW DYFDESGQSM KKFLMKTPID GARLSSGYGL RKHPILGYSK
MHAGVDFAAP RGTPIYAAGD GTVTRANRFG SFGNYISIRH ANGYETAYAH LNGFARGVRA
GTRVRQGQVI GYVGTTGRST GPHLHYEVHV NGKKMNPLAL KVPTGRKLEG NQLAAFKNLR
ADIATQMAEA PLATKVAAAG AKDDTRTAN