Gene Plav_0903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_0903 
Symbol 
ID5454730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp974469 
End bp975644 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content67% 
IMG OID640876474 
ProductHK97 family phage portal protein 
Protein accessionYP_001412183 
Protein GI154251359 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.580107 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAATC CGTTGACGTC GCTGGCGCGG CTGGTGCGGC CGCGTGAGGC GAAGCACTCG 
CGCGTGGCGC CGGTGATTGC GTTGCATATG CAGGGGAGGG CCGTGTGGAC GCCGCGGGAT
TATGCGCCGC TGGCGGAGGA GGGCTATCAG CGGAACGCAA TTGCCTATCG CTGCGTGCGG
ATGATTGCCG AGGCGGCGGC GAGTGTGCCC TGGCTGCTTT ATGACGGGGC GCGGGAGCTG
AGCGAGCATC CGCTGCTGCG GCTGATCGAA AGCCCGAACA GGGGGCAGGC GGGGGCGGAG
CTTTTCGAGA CCTGGTACAG CTACCTGCAG GTGGCGGGGA ATGCCTATCT CGAACTTGTG
GAGGTGGACG GGGCCCCGCG CGAGCTTTAT GCGCTGAGGC CAGACCGCAT GAAGGCGGTG
CCGGGGCGGG CGGGCTGGCC GGAGGCTTAC GAATATTCCG TGAACGGACG GAGCGTGACT
ATTCCCTGCG GCGAGCGGAG CCCGGTGCTG CATATGCGGC TCTTCCACCC TTCCGACGAT
CATTATGGCT TGAGCCCGCT GGAAGCGGCG GCCTATGCCA TCGACATTCA CAATGCGGCC
GGCGCCTGGA ACAAGGCGCT GCTCGACAAT GCGGCGCGGC CTTCCGGCGC GCTGGTCTAC
AAGGGCGGCG AGGCGGGCGC GAACCTCACC GAAGATCAGT TCGAGCGGCT GAAGCGGGAG
CTGGCGGAAA ATTATCAGGG CGCGGCCAAT GCCGGGCGGC CGCTGCTGCT GGAAGGCGGA
CTCGACTGGC AGAGCATGGG GCTTTCGCCG AAGGATATGG ACTTCATCGA GGCGAAGCGG
ACGGCGGCGC GGGAAATCGC GCTCGCTTTC GGCGTGCCGC CGATGCTGCT CGGCATTCCG
GGCGACAATA CCTATTCCAA TTACCGCGAG GCGAACCGGG CCTTCTGGCG CGGCACCGTG
CTGCCGCTGG TCGGCCGCTC GGCACGCGCG CTGACGCATT GGCTGGCACC CCGCTATGAG
GGGAAGCTCA GGCTCTGGTA TGACGCCGAC CAGGTGGAGG CGCTGGCCGC CGACCGCGAC
GCGCTGTGGG CGCGGGTGGG CGCGGCCGAT TTCCTCAGCG ACGAGGAAAA GCGCGAGGCA
GTGGGCTATG GCAAGGTCAA AGCGTCTTCG ACTTGA
 
Protein sequence
MPNPLTSLAR LVRPREAKHS RVAPVIALHM QGRAVWTPRD YAPLAEEGYQ RNAIAYRCVR 
MIAEAAASVP WLLYDGAREL SEHPLLRLIE SPNRGQAGAE LFETWYSYLQ VAGNAYLELV
EVDGAPRELY ALRPDRMKAV PGRAGWPEAY EYSVNGRSVT IPCGERSPVL HMRLFHPSDD
HYGLSPLEAA AYAIDIHNAA GAWNKALLDN AARPSGALVY KGGEAGANLT EDQFERLKRE
LAENYQGAAN AGRPLLLEGG LDWQSMGLSP KDMDFIEAKR TAAREIALAF GVPPMLLGIP
GDNTYSNYRE ANRAFWRGTV LPLVGRSARA LTHWLAPRYE GKLRLWYDAD QVEALAADRD
ALWARVGAAD FLSDEEKREA VGYGKVKASS T