Gene Plav_0906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_0906 
Symbol 
ID5454190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp977106 
End bp978413 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content66% 
IMG OID640876477 
ProductHK97 family phage major capsid protein 
Protein accessionYP_001412186 
Protein GI154251362 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.225289 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.497938 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCATT GGAATGACGG CGTGCCGCGC ATCGGCGCGT CGCTGAAGAA GGAACCGCGC 
GCGCCCGAGA CGAAAAGCGC CGAGACGCGG AGCGCGAGCG CGCATGAAGT GCGGGAAGCG
ATGGACGAAT TTCTCTCGTC CTTCGAGGAT TTCAAATCGG CGAATGATGA GCGGCTCGGC
GAGCTCGAGC GCAAGCTCAC CGCCGACGTG CTGACGGAAG AGAAGGTCGA CCGCATCAAC
CGCGCGCTCG ACACGCAGAA GAAGAAGATG GACGAGCTGA CGCTCGCCGC CGCCCGCCCC
GAGATCGGCG GCACCCGCGC GGGCGAGACC TATGCGGGGC GCGAACACAA GCGCGCCTTC
GACCGCTATG TCCGCAAGGG CGAGGCGCAT GAATTGCGCG GGCTGGAAGC GAAGGCGCTT
TCGGTCGGCT CCGATCCCGA TGGCGGCTAC CTGGTGCCGG TGGAGACCGA GAAGCTGATC
GACCGCATCA TCTCCGAAGT CTCGCCGATC CGCGCCATTG CGGGCATCCG GCAGATCGGT
TCGGCAAGCT ACAAGAAGCC CTTCGCCGCC GGCGGCATGC AGACCGGCTG GGTCGGCGAA
ACGGAAGCGC GGCCGCAGAC GGCAACGCCG TCGCTCGCCG AAATCGAGTT TCCGGCGATG
GAGCTCTATG CGATGCCGGC GGCGACGCCG ACGCTGCTCG ACGACGCGGC GGTGAACATC
GACCAGTGGC TGGCGGAAGA AGTGCAGACG GCCTTCGCCG AACAGGAAGG CGCCGCCTTC
GTCATCGGCG ACGGCGTGAA GAAACCGCGC GGCTTCCTCG ACTACGACAT GGTGGCGGAG
AATGCCTGGG AATGGGGCAA GCTCGGCTTC ATCGCGACGG GGAACGCGGG CGGCTTTCCG
ACCTCGAACC CGGCCGACAA GCTGATCGAC CTCGTCTATG CGGTGAAGGC GGGCTACCGC
GCCAATGGCC GCTTCGTCAT GAACCGCTCG ACGCAATCCT CGATCCGCAA GTTCAAGGAT
ACGGACGGCA ACTATCTCTG GCAGCCGGCC GTCGCCGCCG GTCAGCCGCC GACGCTCCTC
AACTACGCGG TGACGGAAGC GGAGGACATG CCTTCGATGG AAGCGGGCGC TCCGGCGGTT
GCCTTCGGCG ATTTCCGGCG CGGCTACCTG ATCGTCGACC GGCTCGGCGT GCGGGTGCTG
CGCGATCCCT ACAGCGCCAA GCCCTATGTG CTCTTCTACA CGACGAAGCG CGTGGGCGGC
GGCGTGCAGA ACTTCGAGGC GATCAAACTC CTCAAGTTCC AGGCCTGA
 
Protein sequence
MSHWNDGVPR IGASLKKEPR APETKSAETR SASAHEVREA MDEFLSSFED FKSANDERLG 
ELERKLTADV LTEEKVDRIN RALDTQKKKM DELTLAAARP EIGGTRAGET YAGREHKRAF
DRYVRKGEAH ELRGLEAKAL SVGSDPDGGY LVPVETEKLI DRIISEVSPI RAIAGIRQIG
SASYKKPFAA GGMQTGWVGE TEARPQTATP SLAEIEFPAM ELYAMPAATP TLLDDAAVNI
DQWLAEEVQT AFAEQEGAAF VIGDGVKKPR GFLDYDMVAE NAWEWGKLGF IATGNAGGFP
TSNPADKLID LVYAVKAGYR ANGRFVMNRS TQSSIRKFKD TDGNYLWQPA VAAGQPPTLL
NYAVTEAEDM PSMEAGAPAV AFGDFRRGYL IVDRLGVRVL RDPYSAKPYV LFYTTKRVGG
GVQNFEAIKL LKFQA