Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II1047 |
Symbol | |
ID | 3845039 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | + |
Start bp | 1221666 |
End bp | 1222991 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637838350 |
Product | HK97 family phage major capsid protein |
Protein accession | YP_439244 |
Protein GI | 83716436 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.144894 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCAGCTTC GCTGCATACC CTCATCAACG CTCTTCAAGG ATGACAAGTT GAGCAAAAAA CTTCTAATTA CCGCGCTCGC GGCAGCCCTG TCGGGCACGG CTGGTGCGGT GCCGCGCGGC ATCATGTCCG TGCGCGCCGA GACTCCGGGC GAAATCAAAG CCCTGATCGA CAATTTGCAG AAAGCGTTTC ACGATTTCAA GGCTGAGCAT ACGAGGCAAC TCGATGCGGT GAGAGCCGGT CTGCCGATGT CGGACGCTAT GGCGAAGGTC GACAAGGTCA GTGCCGATCT CGAAGCACTT CAGGCGGCCG TCGACGAGAC CAATATCAAG CTCGCTGCAG CGCAGATGGG TGCGAACGGC GCGAAGCCGC TGCGCGACGC CGAATATACC GACGCGTTCA ACGCTCATTT CAAGCGTGGC GACATCAACG CCGCACTTCA CAAGGGCGAG GACGGCGAGG GCGGATACCT GACGCCGATC GAGTGGGACC GCTCGATCAC GAAGAAGCTC GTGCAGATCT CGCCGATGCG TCAACTGTGC CGCGTTCAGT CGGTTTCGAA GGCCGGCTTT TCGAAGCTGT TCAACATGGG AGGCACGGCA AGTGGGTGGG TCGGTGAAGC CGACGATCGC CCGCAAACGG GCACGGCCGC GTTTGCGTCG CTCGCGTTCG GGCACGGCGA GATTTACGCG AATCCCGCCG CGACGCAGGG CATTCTCGAC GACAGCGAAA TCGATCTCGA ATCGTGGCTC GCCGAAGAGG TGCAAACTGA ATTCGCGAAG CAGGAGGGAC GGGCATTCCT CGCCGGTGAT GGTACGAAGA AGCCGACCGG CATTCTTACG TACGTGGACG GCGGCGCAAA CGCGAAGAAG CATCCATTCG GGGCGATCGG GGTGGTGAAC AGCGGCGCTG CCGCGGGCAT CACGTCGGAC GGCATCATCG ATCTGATCTA CGATCTGCCG AGCGCGTTCA CGGGCAACGC GCGCTTCACG ATGAACCGCA ATACGCAACG AGCGGTACGC AAGCTGAAAG ACGGCCAAGG CAATTACCTG TGGCAACCGT CGTACGTTGC TGGCCAGCCG GCGACGCTCG CGGGCTACCC CGTGACGGAA GTGCCCGACA TGCCGGATGT CGCGGCGAAC TCAACGCCGA TTCTCTTCGG CGACTTCATG CAGACGTATC TGATCATCGA TCGCATCGGC GTGCGCGTGC TGCGCGATCC GTATACGGCG AAACCGTACG TCCTGTTCTA TACGACGAAG CGCGTCGGTG GCGGCCTGCT TAATCCGCAG CCGATGCGCG CGCTGAAGGT GGCGGTGAGC GCGTAA
|
Protein sequence | MQLRCIPSST LFKDDKLSKK LLITALAAAL SGTAGAVPRG IMSVRAETPG EIKALIDNLQ KAFHDFKAEH TRQLDAVRAG LPMSDAMAKV DKVSADLEAL QAAVDETNIK LAAAQMGANG AKPLRDAEYT DAFNAHFKRG DINAALHKGE DGEGGYLTPI EWDRSITKKL VQISPMRQLC RVQSVSKAGF SKLFNMGGTA SGWVGEADDR PQTGTAAFAS LAFGHGEIYA NPAATQGILD DSEIDLESWL AEEVQTEFAK QEGRAFLAGD GTKKPTGILT YVDGGANAKK HPFGAIGVVN SGAAAGITSD GIIDLIYDLP SAFTGNARFT MNRNTQRAVR KLKDGQGNYL WQPSYVAGQP ATLAGYPVTE VPDMPDVAAN STPILFGDFM QTYLIIDRIG VRVLRDPYTA KPYVLFYTTK RVGGGLLNPQ PMRALKVAVS A
|
| |