Gene Plav_0833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_0833 
Symbol 
ID5455982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp905988 
End bp907958 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content63% 
IMG OID640876404 
Producthypothetical protein 
Protein accessionYP_001412113 
Protein GI154251289 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.982432 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTTGC CGAGGCTCGG AATTGAAGGC GAGTGGTTTC AGGGAACGGA TGGCCGCAAG 
CGCATCCTGC GTGGCGTCAA TCTTGGCGGC GACTGCAAGG TGCCATATGC GCCGAACGGC
CACACAAATA TCCCGACCGA TTTTTCCGAT CACCGCACCG TCTCCTTCGT CGGCCGTCCC
TTTCCCCTCG CGGAAGCGGA TGAACATTTC TCCCGCCTCA AGGCATGGGG CTTCAACTGC
CTCCGCCTGC TGACCACATG GGAAGCGGTC GAACATGAAG GCCCCGGCCG CTACGACGAG
GCCTATCTCG ACTATTTCGC CGAACTCTGC CGCAAGGCCG ACGATTACGG CCTCTATGTC
TTCGTCGATT TCCATCAGGA TGTCTGGAGC CGCATGACAG GCGGCGACGG CGCCCCCGGC
TGGCTCTTCG ACGCGGTCGG CCTCGACTTC ACGAAGTTCC ATGCCGCCGG CGCGGCGCAT
GTCATGCAAT ACAAATACGA CTTCGCGAAA GGCGGCCGCC AGGAAGAAAA CTACCCGACC
ATGACCTGGT CGCAGAATTA CAAATACCCC GCCAACGCGA TCATGTGGAC GCTCTTCTTC
GCCGGCAACA CCTTCTGCCC GGACTTTCTC GTGCAGGGCC GCCCCGCGCA GGATTTCCTT
CAGGAGCATT ATCTCGGCGC GATGGAAGCC GTCGCACGCC GCGTGAAAGA CCTGTCCAAC
GTCATCGGCT TCGACTCGCT GAACGAGCCG GGCTCAGGCT ATGTCGGCCA GTGGCTGAGC
TACCGGCACA CAGGCCCCAG CGAGAAAAAC CCCATGCCCG CGCGTCCCGG CCTCGCCTGG
TCGCCGCTTG ACGGTTTCGC CGCCGCGCGC GGTCTCGCCC GCGAACTCCC GGAAATGCAT
ATCGACTGGG AGCAGCGCGC CGTCGTGAAG AAGCGCGACG TGCTGGTGAA CGGCGATGGC
GTCTCGATCT GGAAGCAGGG CCGCCATTGC CCCTTCGAGC GCGCAGGCGT CTACCGCATC
AATGGCGGCG AGATCGAAGC CATGGATGAA AAATTCTTCC GCGAGCGCAA CGGCCGCAGA
TTTGAAATGG AAAAGGATTT CATGGGCCCG TTCTTCGCCC GTGTCGCGGA GCGCGTGCGT
GCCGTCAACA ATGACTGGCT GCTCTTCGCC GAGCTCGATC CCGGCGCCGG CCTCGGCCAC
GGCTTCCCGC CCGACACGCC CGAGCGCACG GTCAATGCCA GCCACTGGTA CGACATCGTC
ACGCTCTCGA CCAAGCGGTT CGACTTCCCC GTGAAGATCA ATCCCTATAC CGGCCGCACC
ACCGAAGGCG CGGAAGCGAT CGAAGCCTCC TACACGCGCC AGCTCGGCCG CCTGAAGGAT
GCCTCGCACG CATTGAACGG CGGCACGGGC GCACCGGCCC TTCTCGGCGA ATTCGGCATC
CCCTTCGATC TCGACAATGC CGCCGCCTAC AAGGCGTGGA AAGCGGGCGA CCGCACGGAT
GCGCCTTGGG AAAAACACAT CATCGCCCTC GACCTCATGT ACAACGCGCT CGACCAGTTG
CTCATGCACT CGACGCAATG GAACTACACC GCGTCGAACC GCAACGATCA GGCGGTGGGC
GATGGCTGGA ACCAGGAAGA CCTCTCGATC TACAGCATCG ACCAGCGCGT CGACCCGTCG
GATGTGAACA GCGGCGGCCG CGCGCTTGCC GGCTTCGTCA GGCCTTATGC CCGCGCCGTC
GCGGGCCGCC CCTTGAAGAT GAAGTTCAAG CGCGAGACCG GCGCCTTCCG TTTCATCTAT
CAGGCGGAAG GCGAGGGCGA GACGGAAATC TTCGTACCGA ACCTGCAATA TCCGAATGGC
TATGACGTCG AAGTCGAAGG CGGCACCGTC ACCCGCGATG AGGAAAACCA GTGCCTCCGC
GTCCATGCCG TCGGGTCCGA CAAGGTCGGC GTCATGATCA CGAGGCGGTA G
 
Protein sequence
MALPRLGIEG EWFQGTDGRK RILRGVNLGG DCKVPYAPNG HTNIPTDFSD HRTVSFVGRP 
FPLAEADEHF SRLKAWGFNC LRLLTTWEAV EHEGPGRYDE AYLDYFAELC RKADDYGLYV
FVDFHQDVWS RMTGGDGAPG WLFDAVGLDF TKFHAAGAAH VMQYKYDFAK GGRQEENYPT
MTWSQNYKYP ANAIMWTLFF AGNTFCPDFL VQGRPAQDFL QEHYLGAMEA VARRVKDLSN
VIGFDSLNEP GSGYVGQWLS YRHTGPSEKN PMPARPGLAW SPLDGFAAAR GLARELPEMH
IDWEQRAVVK KRDVLVNGDG VSIWKQGRHC PFERAGVYRI NGGEIEAMDE KFFRERNGRR
FEMEKDFMGP FFARVAERVR AVNNDWLLFA ELDPGAGLGH GFPPDTPERT VNASHWYDIV
TLSTKRFDFP VKINPYTGRT TEGAEAIEAS YTRQLGRLKD ASHALNGGTG APALLGEFGI
PFDLDNAAAY KAWKAGDRTD APWEKHIIAL DLMYNALDQL LMHSTQWNYT ASNRNDQAVG
DGWNQEDLSI YSIDQRVDPS DVNSGGRALA GFVRPYARAV AGRPLKMKFK RETGAFRFIY
QAEGEGETEI FVPNLQYPNG YDVEVEGGTV TRDEENQCLR VHAVGSDKVG VMITRR