Gene Bcenmc03_1284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcenmc03_1284 
Symbol 
ID6122962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia cenocepacia MC0-3 
KingdomBacteria 
Replicon accessionNC_010508 
Strand
Start bp1410533 
End bp1411783 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content64% 
IMG OID641637858 
Productmajor capsid protein HK97 
Protein accessionYP_001764581 
Protein GI170732634 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0461297 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACTGA ATGAAATCCG CGAGCAGAAG GCTCGCAAAA TCACCGAAAT GCGCGCGCTG 
CTCACGAACG CCGAGCGCGA AAAGCGCAGC CTGTCGGCCG ACGAGCAAGC GAAGTTCGAC
GCACTGAAGG CCGACGTGAC GGCACTCGAA GCCGACGAGC AACGCGCGCA GTTCATGGCC
GATCTGGAAC GCCGCACGAT GGGCGACCGC GTGACGGCTA CCGACTTCGC AACCGTCGAA
AATCGCGTCG TGCTGCTCGA TGTGATCCGC GCCGGCATGG AAGGCCGCTC GCTCGATGGC
GCGGCGGCTG AGTTCGCACA GGAAACCGAG CGCCGCACCG GTCGCAAGGC GCAGGGCTTC
TATGTACCGA TGGCGGCATT CGATACGCGT CCGGTCGAGC AGCGCGACGC GCAGACCACG
ACGACGGCGG CCGGCATCGT CCCGAACGAC TTCCGCGCCG ATCAATTCAT CGGCCCCCTG
CGTAACGCGC TGGTGATGCG TGCGCTCGGC GCGCGCGTGC TGACTGGCCT GCGTGGAGAC
GTTGAAATTC CGAAGTTCAA AACCGGCATG ACGGCCGGCT GGGTGTCGGA AAACGAACCG
CTGCAACGCT CGGGCATGTC GTTCGACAAT CCCGTCACGC TCAAGCCGCG TCACGTCGGC
GCACTCACCG AGTTGTCCCG CCAGTTGATC CAGCAATCCA GCCCGGACAT TAACGGCCTG
GTGCGCGACG ACCTGTCGGC CGTGATGGGC GAAGCACTGG ATTCTGCGCT GCTGTCGGGC
GACGGCCAGA AACAGCCGCT CGGCCTGCTC AACATGGTCG GCATCCAAAC GGCCTCGCTC
GCGGCGCCAT CGTGGGACGG CGTGCAGCGC ATTGTCGAAA AGCTCGATCT GGTGAACGTC
AACGGCGGCC GCTGGCTGTC GAATCCGAGC GTCAAGCGCG TGCTGGCGAC GACGGAAAAG
GCCGCGAACA CGGGCATTTT CCTGACCGAT GGCGCGTCGC TGGCCGGCTA TCCGCTGGTG
ACGACGAATC AGGTCAAGGC GAAGGGCGGC GCTACGCCGA CCGGCCGCCT GATCTTCGGC
GATTTCTCGC AGCTGATTCT CGGCATCTGG AGCGAGGTCG ACATTCTTGT GAACCCGTAT
GCGGAAAGCG CCTACGAAAA GGGGAATGTG CTGGTGCGGG CAATGATGAC GTGCGATCAA
GCCGTGCGCC ATCCGGAAGC GTTCGTCGCC GTCGACGACG TGAAGATCTG A
 
Protein sequence
MRLNEIREQK ARKITEMRAL LTNAEREKRS LSADEQAKFD ALKADVTALE ADEQRAQFMA 
DLERRTMGDR VTATDFATVE NRVVLLDVIR AGMEGRSLDG AAAEFAQETE RRTGRKAQGF
YVPMAAFDTR PVEQRDAQTT TTAAGIVPND FRADQFIGPL RNALVMRALG ARVLTGLRGD
VEIPKFKTGM TAGWVSENEP LQRSGMSFDN PVTLKPRHVG ALTELSRQLI QQSSPDINGL
VRDDLSAVMG EALDSALLSG DGQKQPLGLL NMVGIQTASL AAPSWDGVQR IVEKLDLVNV
NGGRWLSNPS VKRVLATTEK AANTGIFLTD GASLAGYPLV TTNQVKAKGG ATPTGRLIFG
DFSQLILGIW SEVDILVNPY AESAYEKGNV LVRAMMTCDQ AVRHPEAFVA VDDVKI