Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bphy_7144 |
Symbol | |
ID | 6248654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia phymatum STM815 |
Kingdom | Bacteria |
Replicon accession | NC_010625 |
Strand | + |
Start bp | 1799069 |
End bp | 1800373 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642598789 |
Product | HK97 family phage major capsid protein |
Protein accession | YP_001863191 |
Protein GI | 186471873 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.00647338 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACAAGA AAATCCGTGC GCTGCAACAG CGCAAGGCTG AAAAGGTCGC GGCCATGCGC GCACTCGCTG ATGCGTGCGC AGAGCGCGAC ATGACCGACG AAGAGCAGAC GCAGTTCGAT GCGCTGCAAG CGGAAGTCGG CAGCATCAAT GCGAGCATCG AGCGCGAAAC CGCACTGGCT GCGGAAGAGC AAAGCGTCGG CATCCAGATC GCAGAGGATG CACGTATCGA AGTCATCGAG AACCGCGCCA CTGATCCGCG TCGCGGCTTT CAGACGTTCG GCGAGTTCGC ACAGCGCGTT CGCGCTGCTG CCGGTGGCGG TCGTATCGAT GACCGGCTGA TGCTCGGCGG CGGTGGCCCG AACGCAGCAG CGCCGACGAC GTTCGGCAAT GAAGCGGGCG GGCAGGATGG CGGTTTCCTC GTGCCGCCGC AGTTCGCCAG CGAAATTTTC ACGCTGTCGC TCGAAGAGCA GGCACTGCTC CCGATGACCG ACTCGACGCC GATCAGCGGT AATTCGATGG TGTTTCCGAA GGATGAAACG ACGCCGTGGG GCACCGACGG TATTCGCGCT TACTGGCAGG CAGAGGCGAG CGTGGCGACT GCCACGAAGC CTAAGCTGGG TGTGAGCACG AACCGGCTGC ACAAGCTCAT GGCGCTCGTC CCTGTGACGG ACGAACTGCT CGACGATGCC AGCGCGCTTG CGTCGTATCT GCCGGGCAAG ACCGCCGCTT CGATCCGCTG GAAGACCGAC GAGTCGATTC TGTTCGGCAC TGGCGCGGGT CAGCCGTGGG GCGTCATGAA GTCGGGCGCG CTGATCGTGG TCGCGAAGGA TAGCGGCCAG GCAACCAACA CCCTCACGCC GACGAACATC AGCAACATGA TTTCGCGTCT GCCGGTGGGC TCGTTCGGGC GCTCGTTCTG GCTCATCAAT CCCGATGTGC TGCCCGCGCT CGACAATCTG ACGCTCGGCA ATTACCCGAT TTACATGCCG GTCGGCGGCG GCGACCGCGC TGCGGGCGGC TCGCCCTATG GCATGTTGAA GGGTCGGCCC ATCGTGCTCA GTGAGCATGC GTCGCCGTTC TCGTCGCAGT CCGATATCTC GCTGCTCGAT CTGTCCTACT ACCGCTCGAT CACGTCGCGC GGCGGCATCC AGACGGCGAC GAGTATGCAC GTGTATTTCG ATGCAGATGC AACGGCTTTC CGCACCACGT TCCGCGTGGA CGGCGGGCCC AAGATCGAAA ACGCCATCAC GCCACCCAAG AGCACGAACA AGCGTTCGCC GTTCGTGACG CTTGCGGCTC GTTAA
|
Protein sequence | MNKKIRALQQ RKAEKVAAMR ALADACAERD MTDEEQTQFD ALQAEVGSIN ASIERETALA AEEQSVGIQI AEDARIEVIE NRATDPRRGF QTFGEFAQRV RAAAGGGRID DRLMLGGGGP NAAAPTTFGN EAGGQDGGFL VPPQFASEIF TLSLEEQALL PMTDSTPISG NSMVFPKDET TPWGTDGIRA YWQAEASVAT ATKPKLGVST NRLHKLMALV PVTDELLDDA SALASYLPGK TAASIRWKTD ESILFGTGAG QPWGVMKSGA LIVVAKDSGQ ATNTLTPTNI SNMISRLPVG SFGRSFWLIN PDVLPALDNL TLGNYPIYMP VGGGDRAAGG SPYGMLKGRP IVLSEHASPF SSQSDISLLD LSYYRSITSR GGIQTATSMH VYFDADATAF RTTFRVDGGP KIENAITPPK STNKRSPFVT LAAR
|
| |