Gene Plav_1122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_1122 
Symbol 
ID5456092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp1236026 
End bp1237318 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content62% 
IMG OID640876692 
Product17 kDa surface antigen 
Protein accessionYP_001412400 
Protein GI154251576 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4520] Surface antigen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCGGC ATGGGCATAT AACGAAACTC ACTGGCGTCG CGACGGCGAT TGCCTTGTGT 
CTTGGAACGA GCGCAACGGC TTTCGCTGCC CCGGGCGATA ATGGACGGGA CGCCCGCTGG
GGAAGAGAAC CGCAACGCGA GCATAGAGAG CGGCAGACGC AGGAAAGGCG AGTTCAGCCT
CAGCACCGGG ACGCCGATCG ACAGCCGCAC CAGCGTCAGC AAGCCGACCA ACAGCAGCGG
CAACGCCAGC AGGCGGAGCA GCAACAGCGC CAGCGTCAGC AAGCCGAGCA TCAGCAGCGG
CAGCGTCAGC AGGCGGAGCA GCAACAGCGC CAGCGTCAGC AAGCCGAGCA ACATCAGCGT
CAGCGCCAGC AGGCGGAGCA GCAACAGCGC CAGCGTCAGC AAGCCGAGCA ACACCAGCGT
CAGCGCCAAC AGGCGGAGCA ACAGCAGCGC CAACGTCAAC AAGCCGAGCA ACAGCAGCGC
CAACGTCAAC AAGCCGAGCA ACAGCAGCGC CAACGTCAGC AAGCGGAGCA CCAGCAGCGG
CAGCGCCAAC GGGCGGAGCA GCAGCGCCAG CGGCAACACG TCCACCAGAA TGAACGTGAC
CGGCAGCGTT ATCAGGCCCC GCCGGATCGC GGCTCACATT ACAAACAAGT GAACCATCGT
CAGACCTATC GGCCTGTCTA CAAGGACCGC CCCTGGTATG GCCACCACTA TGCAAAAGGG
CATCGTCCGA GCCATCGTGG TCCTGTGGTC ATATACCGCA ACTCTTATCA CGTCGTGACG
ATCCCGCAGC CGGTCTATAT CGAACCCATG CCGGGCTACT ACGATGCGCC TTACGGCTAT
TACGATTCCG TCTATGGCTA TGACGATAAT TATGGTGGCT ATGGCTATCG CCGCTCGGCT
TGCAACAGCG ACAAGGTCGG TGCGGTTATT GGCGCCGTAC TTGGCGGCGT TATTGGCGCC
GAGCAGGGCA GGGGCGCCGG TCCGCTGGTT GGCGGTGCGG TCATTGGCGC GGTGCTTGGC
GGCGTCCTCG GCCATGCAAT CGATGCGAAC AATCAGGCCT GTGTCGGCGA CGTGCTGGAA
TATGTTCCCT CCAACCAATC CGTCTACTGG ACAGATCCGG GCAATGGCTA TGGCTATGGC
TATGAGGTAA CGCCCATGCG TACTTACGAG CCGTATGAGG GCCAGTATTG CCGCGAATAT
CAGACGATCG TCACGATTGG TGGCCGGACC GAGCAGGCCT ATGGCACCGC CTGCCGTCAG
CCGGACGGCG CTTGGAAGAC CGTCAACAGC TGA
 
Protein sequence
MVRHGHITKL TGVATAIALC LGTSATAFAA PGDNGRDARW GREPQREHRE RQTQERRVQP 
QHRDADRQPH QRQQADQQQR QRQQAEQQQR QRQQAEHQQR QRQQAEQQQR QRQQAEQHQR
QRQQAEQQQR QRQQAEQHQR QRQQAEQQQR QRQQAEQQQR QRQQAEQQQR QRQQAEHQQR
QRQRAEQQRQ RQHVHQNERD RQRYQAPPDR GSHYKQVNHR QTYRPVYKDR PWYGHHYAKG
HRPSHRGPVV IYRNSYHVVT IPQPVYIEPM PGYYDAPYGY YDSVYGYDDN YGGYGYRRSA
CNSDKVGAVI GAVLGGVIGA EQGRGAGPLV GGAVIGAVLG GVLGHAIDAN NQACVGDVLE
YVPSNQSVYW TDPGNGYGYG YEVTPMRTYE PYEGQYCREY QTIVTIGGRT EQAYGTACRQ
PDGAWKTVNS