Gene Plav_0458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_0458 
Symbol 
ID5456957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp493044 
End bp496082 
Gene Length3039 bp 
Protein Length1012 aa 
Translation table11 
GC content66% 
IMG OID640876024 
Productpeptidoglycan binding domain-containing protein 
Protein accessionYP_001411738 
Protein GI154250914 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATCGG GGGTTCCATG GAGCGTCAAG GGCATAGAGC CCGAGGCGCG AGAGGCGGCC 
AAGCAGGCAG CGCGGCGCGC CGGCGTGACG CTGGGTGCAT GGCTGAACCA GGTCATCATG
GATACCGGAA CCGATGAAGT GGGACCGCAG GAGGAGTCGC CGATGACGTC CCAATCCCCC
TACGGACGGC CGCAGGCGGC GCCGGGGATC GCGATACCCG AACCGAAGGT CGACCTCGGC
CCCGTGGCCG AAGCCGTGCG GGAACTCGTG CAGCGGGTGG ACGGCAGCGA ACGCCGGACA
GCGGAGATGA CGCGCAAGCT CGAAGCGACG GTGAGCCAAC TCGCCGCCCG CCTCGACGAG
CCGGAGCACG ACATGGACGA CAGATACCAG GAAGCGAGAT CGCTCGACCC GCTGGAGCGC
AAGCTGCAGC AGCTCGGCGA ACGCATGGAA CGCGCCGAGC GCGGCCGAGG CGGCCTGCGC
CCGGAAGACG CGCGCGCGAT CCAGACGCTG GAGAAGGCCA TGAACGCGGT CGTCGACCAT
CTCGACGCGA CCGAACGCCG GACCGACGAG ACGCTGATCG AAATCCGCCA GTCGCTCGCC
TCCCTCTCGC ACCGCATCGA AAACGCCGAA CAGGAATCCG AACGCGAGGA AGCGAAGAAA
CGCGCCCGCG CCCTCGAAGA CACGCTCATG CAGCTCGCGA CGCGCATGGA AAAAATGGAA
ACGGGCGTGA GCGGCATCGG CTCGCAGGCC GTCAATGCGG CGCTGAAGGC GATCGAAGAA
AAGTCGAACG CGGAAAACCA GCGTGCGACC ATCGACCGGC TCCAGAAGAG CATCGAACAG
ATTTCCGCGC GCATCGAACA GACGGAACAG CGGAGCGACC AGACCGCGAA GACACTCGAG
ACGACCGTTT CGAGCATTGT CCGCAAGATC GAAGAGATCG ACCTCAACAG CCGCACACAT
ATTCCCGACG CGCTGGCACA GCGGCTCGAG CAGATGGCCG AACGGCTGCA GCACAATGAA
CAATTGACGG TGGAAGCCGC GCAAACGGTG GAACGGGCGA TCGCCGGTAT CGGCGAAAAT
CTGACGGCGA CCGAAAGCCG CGACCGGGAA GCGCTTTCGT CTCTGCAGAC GATGATCGAA
CGCATGACGA ACCGGCTCGG CCAGCTCGAG AAGGAAACGA AAGCCGCAAA GGCACAGGCA
GCGCTGTCGC CGCAGCTCAA TGCAGGTTCG CTGGCGGCAG GCTTTCCGCC TGCGGGGCCC
GGCTATGGCA TGAATTTCGA CGCGCCGCAG ATGATGAACC CCAGCATGGG CGGCAATCTC
GGCCCTTCCT TTGGCCCGAG CGACTGGGGA CGCCAGGACG CAACTCGCCA GGATTGGGGA
CGCCAGCACG AAGCGGAGCG TTCCGCACCT GCCGTTACAG CGCCGCCGCT CCATGAAGCG
CCGCCTTACG CCGCGCGCGA TGCGCGCTCC TTTGAGGAAG ACAGCGTGCC GCCGCCTTTC
GTGGCGGAGA CGCACGACGA GACTTATGAA GATTTCGGCG ACCATCAGGA GATGCGCGGC
CAATATGACG ACGGCATCAT TCCGCCGGAT CCGGTGGAAC CGATGCAGAA TGCCGGTCAA
CGCGCCGCCA GCGATTTCCT CGCCGCCGCG CGCCGCGCCG CGCAGGCCGC CGCAGAAGGC
GGCACGGGCA GGCAGGAACC TTACTACGGC TCACCGCCCT CCCCCGGCTT CGCGGAATCT
TCGTCCCGCT TTTCGGCGCA GGACCAGGGC GAGACGCGCC GGCGCAAACT CTTTCTTGCC
GTCGCCGGCA GTTTTGTGCT GCTTGCCTTG CTCGCAGGCG CCTATGTGAT GCTGAAGAAC
GGCTCGACGC CGACGCAGGC GCCTGTCGTT CTCAACAATC CGGGCAATGC GCGGCCGACC
GTTCTCGACA CTACGCCGCT TACGGCGGCG CCGGATGGAC CGGCGGCCAA TCCTGCAGAG
ACCGGGGCGG GCGAGACGGC CGCACCTGTT ACGGATGCCG CTCCGAATGA AACACCTGCC
GTGCCGAACG AGGCGCTGGT GCCCGCGCCG AAAGCACCGG CGGCAGCACA ACCTTCGACA
GGAACGCCCG CCGGGCAGGC AACGCTGACA CCGGCACCTT CGCTGGCGCC CGCACCTTCG
ACGCCTGTGG AAACGGCACC TGTGGAGCCC GCCAAGGTGA CGCTGCTCGA TGCGGCGCGC
GGCGGCAACG CGGCTGCGCA ATATGAAGTC GGCCAGCGTT ATGCCAATGG CGAGGGTGTG
ACACAGGACA TGTCGGAAGC GGCGCGCTGG TTCGAGCGGG CGGCCAATCA GGACCTGACG
ATCGCGCAGT ATCGCCTTGC CACCCAATAT GAAAAGGGAC GCGGCGTGCC GCAGGACGAT
GCGAAGGCGC GCGACTGGTA TGAAAAGGCG GCAGCCGGCG GCAATGTGAA GGCGATGCAC
AATCTTGCCG TGATCCATGC CGAGGGCCGG GGCACGGCGC AGGATTTTGA AACGGCATCG
CGCTGGTTTA CGCAGGCCGC GGATTTCGGA CTTGGCGACA GCCAATACAA TCTCGCGATC
CTGAACGAAC GCGGGCTTGG CATCGAGAAG AACCTCGTCG AAGCCTATAA GTGGCTCGAC
ATCGCGGCGA AGGGCGGCGA CAAGGGCGCG GCCGCAAAGC GCGACGCCAT CGCTACCGAA
CTCAGCGCAG ACGATTTGGC GCGGGCGAAG ATCGCGTCGG GCACATGGCG GGCGAAGAAG
CCGGAGCCTG TTGCCAATGG CGATATGGGA ACGCTGAAGC GCTGGGACAT TTCATCGATG
GAAGGCGCGA CGAGCGCATC GGCCCCCGTC ACGCGTGCGG ATGTGGCGCG GGTGCAGGAA
CTGCTCAACC GGCTCGGCTA TAACGCCGGA TCGCCGGACG GACTGATGGG ACCGCGCACG
CGGGACGCGA TACTGGAATA CCAGCTGACT GAAGGGCTGG AGACGACAGG GACGGCGACA
CGCGAAACGC TGACATCGCT CGAAGCGAGG CTCAGCTAG
 
Protein sequence
MRSGVPWSVK GIEPEAREAA KQAARRAGVT LGAWLNQVIM DTGTDEVGPQ EESPMTSQSP 
YGRPQAAPGI AIPEPKVDLG PVAEAVRELV QRVDGSERRT AEMTRKLEAT VSQLAARLDE
PEHDMDDRYQ EARSLDPLER KLQQLGERME RAERGRGGLR PEDARAIQTL EKAMNAVVDH
LDATERRTDE TLIEIRQSLA SLSHRIENAE QESEREEAKK RARALEDTLM QLATRMEKME
TGVSGIGSQA VNAALKAIEE KSNAENQRAT IDRLQKSIEQ ISARIEQTEQ RSDQTAKTLE
TTVSSIVRKI EEIDLNSRTH IPDALAQRLE QMAERLQHNE QLTVEAAQTV ERAIAGIGEN
LTATESRDRE ALSSLQTMIE RMTNRLGQLE KETKAAKAQA ALSPQLNAGS LAAGFPPAGP
GYGMNFDAPQ MMNPSMGGNL GPSFGPSDWG RQDATRQDWG RQHEAERSAP AVTAPPLHEA
PPYAARDARS FEEDSVPPPF VAETHDETYE DFGDHQEMRG QYDDGIIPPD PVEPMQNAGQ
RAASDFLAAA RRAAQAAAEG GTGRQEPYYG SPPSPGFAES SSRFSAQDQG ETRRRKLFLA
VAGSFVLLAL LAGAYVMLKN GSTPTQAPVV LNNPGNARPT VLDTTPLTAA PDGPAANPAE
TGAGETAAPV TDAAPNETPA VPNEALVPAP KAPAAAQPST GTPAGQATLT PAPSLAPAPS
TPVETAPVEP AKVTLLDAAR GGNAAAQYEV GQRYANGEGV TQDMSEAARW FERAANQDLT
IAQYRLATQY EKGRGVPQDD AKARDWYEKA AAGGNVKAMH NLAVIHAEGR GTAQDFETAS
RWFTQAADFG LGDSQYNLAI LNERGLGIEK NLVEAYKWLD IAAKGGDKGA AAKRDAIATE
LSADDLARAK IASGTWRAKK PEPVANGDMG TLKRWDISSM EGATSASAPV TRADVARVQE
LLNRLGYNAG SPDGLMGPRT RDAILEYQLT EGLETTGTAT RETLTSLEAR LS