Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcenmc03_1284 |
Symbol | |
ID | 6122962 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia cenocepacia MC0-3 |
Kingdom | Bacteria |
Replicon accession | NC_010508 |
Strand | + |
Start bp | 1410533 |
End bp | 1411783 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641637858 |
Product | major capsid protein HK97 |
Protein accession | YP_001764581 |
Protein GI | 170732634 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0461297 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACTGA ATGAAATCCG CGAGCAGAAG GCTCGCAAAA TCACCGAAAT GCGCGCGCTG CTCACGAACG CCGAGCGCGA AAAGCGCAGC CTGTCGGCCG ACGAGCAAGC GAAGTTCGAC GCACTGAAGG CCGACGTGAC GGCACTCGAA GCCGACGAGC AACGCGCGCA GTTCATGGCC GATCTGGAAC GCCGCACGAT GGGCGACCGC GTGACGGCTA CCGACTTCGC AACCGTCGAA AATCGCGTCG TGCTGCTCGA TGTGATCCGC GCCGGCATGG AAGGCCGCTC GCTCGATGGC GCGGCGGCTG AGTTCGCACA GGAAACCGAG CGCCGCACCG GTCGCAAGGC GCAGGGCTTC TATGTACCGA TGGCGGCATT CGATACGCGT CCGGTCGAGC AGCGCGACGC GCAGACCACG ACGACGGCGG CCGGCATCGT CCCGAACGAC TTCCGCGCCG ATCAATTCAT CGGCCCCCTG CGTAACGCGC TGGTGATGCG TGCGCTCGGC GCGCGCGTGC TGACTGGCCT GCGTGGAGAC GTTGAAATTC CGAAGTTCAA AACCGGCATG ACGGCCGGCT GGGTGTCGGA AAACGAACCG CTGCAACGCT CGGGCATGTC GTTCGACAAT CCCGTCACGC TCAAGCCGCG TCACGTCGGC GCACTCACCG AGTTGTCCCG CCAGTTGATC CAGCAATCCA GCCCGGACAT TAACGGCCTG GTGCGCGACG ACCTGTCGGC CGTGATGGGC GAAGCACTGG ATTCTGCGCT GCTGTCGGGC GACGGCCAGA AACAGCCGCT CGGCCTGCTC AACATGGTCG GCATCCAAAC GGCCTCGCTC GCGGCGCCAT CGTGGGACGG CGTGCAGCGC ATTGTCGAAA AGCTCGATCT GGTGAACGTC AACGGCGGCC GCTGGCTGTC GAATCCGAGC GTCAAGCGCG TGCTGGCGAC GACGGAAAAG GCCGCGAACA CGGGCATTTT CCTGACCGAT GGCGCGTCGC TGGCCGGCTA TCCGCTGGTG ACGACGAATC AGGTCAAGGC GAAGGGCGGC GCTACGCCGA CCGGCCGCCT GATCTTCGGC GATTTCTCGC AGCTGATTCT CGGCATCTGG AGCGAGGTCG ACATTCTTGT GAACCCGTAT GCGGAAAGCG CCTACGAAAA GGGGAATGTG CTGGTGCGGG CAATGATGAC GTGCGATCAA GCCGTGCGCC ATCCGGAAGC GTTCGTCGCC GTCGACGACG TGAAGATCTG A
|
Protein sequence | MRLNEIREQK ARKITEMRAL LTNAEREKRS LSADEQAKFD ALKADVTALE ADEQRAQFMA DLERRTMGDR VTATDFATVE NRVVLLDVIR AGMEGRSLDG AAAEFAQETE RRTGRKAQGF YVPMAAFDTR PVEQRDAQTT TTAAGIVPND FRADQFIGPL RNALVMRALG ARVLTGLRGD VEIPKFKTGM TAGWVSENEP LQRSGMSFDN PVTLKPRHVG ALTELSRQLI QQSSPDINGL VRDDLSAVMG EALDSALLSG DGQKQPLGLL NMVGIQTASL AAPSWDGVQR IVEKLDLVNV NGGRWLSNPS VKRVLATTEK AANTGIFLTD GASLAGYPLV TTNQVKAKGG ATPTGRLIFG DFSQLILGIW SEVDILVNPY AESAYEKGNV LVRAMMTCDQ AVRHPEAFVA VDDVKI
|
| |