Gene Bphy_7144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphy_7144 
Symbol 
ID6248654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phymatum STM815 
KingdomBacteria 
Replicon accessionNC_010625 
Strand
Start bp1799069 
End bp1800373 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content63% 
IMG OID642598789 
ProductHK97 family phage major capsid protein 
Protein accessionYP_001863191 
Protein GI186471873 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.00647338 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACAAGA AAATCCGTGC GCTGCAACAG CGCAAGGCTG AAAAGGTCGC GGCCATGCGC 
GCACTCGCTG ATGCGTGCGC AGAGCGCGAC ATGACCGACG AAGAGCAGAC GCAGTTCGAT
GCGCTGCAAG CGGAAGTCGG CAGCATCAAT GCGAGCATCG AGCGCGAAAC CGCACTGGCT
GCGGAAGAGC AAAGCGTCGG CATCCAGATC GCAGAGGATG CACGTATCGA AGTCATCGAG
AACCGCGCCA CTGATCCGCG TCGCGGCTTT CAGACGTTCG GCGAGTTCGC ACAGCGCGTT
CGCGCTGCTG CCGGTGGCGG TCGTATCGAT GACCGGCTGA TGCTCGGCGG CGGTGGCCCG
AACGCAGCAG CGCCGACGAC GTTCGGCAAT GAAGCGGGCG GGCAGGATGG CGGTTTCCTC
GTGCCGCCGC AGTTCGCCAG CGAAATTTTC ACGCTGTCGC TCGAAGAGCA GGCACTGCTC
CCGATGACCG ACTCGACGCC GATCAGCGGT AATTCGATGG TGTTTCCGAA GGATGAAACG
ACGCCGTGGG GCACCGACGG TATTCGCGCT TACTGGCAGG CAGAGGCGAG CGTGGCGACT
GCCACGAAGC CTAAGCTGGG TGTGAGCACG AACCGGCTGC ACAAGCTCAT GGCGCTCGTC
CCTGTGACGG ACGAACTGCT CGACGATGCC AGCGCGCTTG CGTCGTATCT GCCGGGCAAG
ACCGCCGCTT CGATCCGCTG GAAGACCGAC GAGTCGATTC TGTTCGGCAC TGGCGCGGGT
CAGCCGTGGG GCGTCATGAA GTCGGGCGCG CTGATCGTGG TCGCGAAGGA TAGCGGCCAG
GCAACCAACA CCCTCACGCC GACGAACATC AGCAACATGA TTTCGCGTCT GCCGGTGGGC
TCGTTCGGGC GCTCGTTCTG GCTCATCAAT CCCGATGTGC TGCCCGCGCT CGACAATCTG
ACGCTCGGCA ATTACCCGAT TTACATGCCG GTCGGCGGCG GCGACCGCGC TGCGGGCGGC
TCGCCCTATG GCATGTTGAA GGGTCGGCCC ATCGTGCTCA GTGAGCATGC GTCGCCGTTC
TCGTCGCAGT CCGATATCTC GCTGCTCGAT CTGTCCTACT ACCGCTCGAT CACGTCGCGC
GGCGGCATCC AGACGGCGAC GAGTATGCAC GTGTATTTCG ATGCAGATGC AACGGCTTTC
CGCACCACGT TCCGCGTGGA CGGCGGGCCC AAGATCGAAA ACGCCATCAC GCCACCCAAG
AGCACGAACA AGCGTTCGCC GTTCGTGACG CTTGCGGCTC GTTAA
 
Protein sequence
MNKKIRALQQ RKAEKVAAMR ALADACAERD MTDEEQTQFD ALQAEVGSIN ASIERETALA 
AEEQSVGIQI AEDARIEVIE NRATDPRRGF QTFGEFAQRV RAAAGGGRID DRLMLGGGGP
NAAAPTTFGN EAGGQDGGFL VPPQFASEIF TLSLEEQALL PMTDSTPISG NSMVFPKDET
TPWGTDGIRA YWQAEASVAT ATKPKLGVST NRLHKLMALV PVTDELLDDA SALASYLPGK
TAASIRWKTD ESILFGTGAG QPWGVMKSGA LIVVAKDSGQ ATNTLTPTNI SNMISRLPVG
SFGRSFWLIN PDVLPALDNL TLGNYPIYMP VGGGDRAAGG SPYGMLKGRP IVLSEHASPF
SSQSDISLLD LSYYRSITSR GGIQTATSMH VYFDADATAF RTTFRVDGGP KIENAITPPK
STNKRSPFVT LAAR