Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_0906 |
Symbol | |
ID | 5454190 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | + |
Start bp | 977106 |
End bp | 978413 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640876477 |
Product | HK97 family phage major capsid protein |
Protein accession | YP_001412186 |
Protein GI | 154251362 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.225289 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 0.497938 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCATT GGAATGACGG CGTGCCGCGC ATCGGCGCGT CGCTGAAGAA GGAACCGCGC GCGCCCGAGA CGAAAAGCGC CGAGACGCGG AGCGCGAGCG CGCATGAAGT GCGGGAAGCG ATGGACGAAT TTCTCTCGTC CTTCGAGGAT TTCAAATCGG CGAATGATGA GCGGCTCGGC GAGCTCGAGC GCAAGCTCAC CGCCGACGTG CTGACGGAAG AGAAGGTCGA CCGCATCAAC CGCGCGCTCG ACACGCAGAA GAAGAAGATG GACGAGCTGA CGCTCGCCGC CGCCCGCCCC GAGATCGGCG GCACCCGCGC GGGCGAGACC TATGCGGGGC GCGAACACAA GCGCGCCTTC GACCGCTATG TCCGCAAGGG CGAGGCGCAT GAATTGCGCG GGCTGGAAGC GAAGGCGCTT TCGGTCGGCT CCGATCCCGA TGGCGGCTAC CTGGTGCCGG TGGAGACCGA GAAGCTGATC GACCGCATCA TCTCCGAAGT CTCGCCGATC CGCGCCATTG CGGGCATCCG GCAGATCGGT TCGGCAAGCT ACAAGAAGCC CTTCGCCGCC GGCGGCATGC AGACCGGCTG GGTCGGCGAA ACGGAAGCGC GGCCGCAGAC GGCAACGCCG TCGCTCGCCG AAATCGAGTT TCCGGCGATG GAGCTCTATG CGATGCCGGC GGCGACGCCG ACGCTGCTCG ACGACGCGGC GGTGAACATC GACCAGTGGC TGGCGGAAGA AGTGCAGACG GCCTTCGCCG AACAGGAAGG CGCCGCCTTC GTCATCGGCG ACGGCGTGAA GAAACCGCGC GGCTTCCTCG ACTACGACAT GGTGGCGGAG AATGCCTGGG AATGGGGCAA GCTCGGCTTC ATCGCGACGG GGAACGCGGG CGGCTTTCCG ACCTCGAACC CGGCCGACAA GCTGATCGAC CTCGTCTATG CGGTGAAGGC GGGCTACCGC GCCAATGGCC GCTTCGTCAT GAACCGCTCG ACGCAATCCT CGATCCGCAA GTTCAAGGAT ACGGACGGCA ACTATCTCTG GCAGCCGGCC GTCGCCGCCG GTCAGCCGCC GACGCTCCTC AACTACGCGG TGACGGAAGC GGAGGACATG CCTTCGATGG AAGCGGGCGC TCCGGCGGTT GCCTTCGGCG ATTTCCGGCG CGGCTACCTG ATCGTCGACC GGCTCGGCGT GCGGGTGCTG CGCGATCCCT ACAGCGCCAA GCCCTATGTG CTCTTCTACA CGACGAAGCG CGTGGGCGGC GGCGTGCAGA ACTTCGAGGC GATCAAACTC CTCAAGTTCC AGGCCTGA
|
Protein sequence | MSHWNDGVPR IGASLKKEPR APETKSAETR SASAHEVREA MDEFLSSFED FKSANDERLG ELERKLTADV LTEEKVDRIN RALDTQKKKM DELTLAAARP EIGGTRAGET YAGREHKRAF DRYVRKGEAH ELRGLEAKAL SVGSDPDGGY LVPVETEKLI DRIISEVSPI RAIAGIRQIG SASYKKPFAA GGMQTGWVGE TEARPQTATP SLAEIEFPAM ELYAMPAATP TLLDDAAVNI DQWLAEEVQT AFAEQEGAAF VIGDGVKKPR GFLDYDMVAE NAWEWGKLGF IATGNAGGFP TSNPADKLID LVYAVKAGYR ANGRFVMNRS TQSSIRKFKD TDGNYLWQPA VAAGQPPTLL NYAVTEAEDM PSMEAGAPAV AFGDFRRGYL IVDRLGVRVL RDPYSAKPYV LFYTTKRVGG GVQNFEAIKL LKFQA
|
| |