Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II1044 |
Symbol | |
ID | 3844871 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | + |
Start bp | 1217844 |
End bp | 1219586 |
Gene Length | 1743 bp |
Protein Length | 580 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637838347 |
Product | phage terminase, large subunit, putative |
Protein accession | YP_439241 |
Protein GI | 83717767 |
COG category | [R] General function prediction only |
COG ID | [COG4626] Phage terminase-like protein, large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.176224 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATGGA CTACGGCATG CCCTGATTGG GAAACGCGGC TGATCGAGCG CCGATCAATC ATTCCACCGC CAATCTTCCG TGATCCGGCC GAGCATGCTC TGGCGATCTT CAAGCAACTG AAGGTGGTGG ATCTCCCGAA GGTATGGGAC GCTGAGATCG AGGAATGGCG GCCGCCCACG TTTGGCGAAT GCTCCGAAGA GTGGGTCTTC GACTTCGTGC GCGCCATCTT CGGCGGTTGC GATCCCGAGA CAGGTAAGCA GCTGATCCGT GAGTATGGGC TGCTGATCTC GAAAAAGAAC ACGAAATCGA CTATCGCCGC CGGCATCATG CTGACGGCGC TCATTCTGTG CTGGCGCGAG GAAGAAGAAC ATCTGATCTT GGCACCGACA AAGGAGGTCG CAGATAACAG CTTTAAGCCG GCAGCGGGGA TGATTCGGAC TGACGAAGAG CTGTCCGCGT TGTTCCACAT CCAAGATCAC ATTCGCACGA TCACGCACCG CGTGAACCGA AATAGCCTGA AGGTCGTCGC TGCCGACACG GACACCGTTT CCGGTAAGAA GTCAGGCCGT ATTCTGGTCG ACGAGCTTTG GCTGTTTGGC AAGCGCGCGA ACGCTGCGGC AATGTTTCTC GAAGCGCTGG GCGGTCAGGT GTCGCGAGAC GAGGGTTGGG TCATTTTCCT GACCACTCAG TCCGACGACC CGCCGGCCGG TGTGTTCCGA CAGAAGTTGA ATTACTGGCG TGACGTGCGG GACGGGAAGA TCAACGACCC GAAGACGCTA GGCATCCTGT ACGAGTTTCC TGCGGCGATG GTGATGGCGA AAGCCTACAT GCGGCCGGAG AACTTCTACA TCACGAACCC GAATCTCGGG CGGTCCGTCA GTGCTGAATG GCTGCAGGAC CAACTGCGAC TGTATGAGGG GGAGCGCGAT GGCACCTTCC AGAAGTTCCT CGCGAAGCAC CTGAATATCG AAATCGGCAT AAACCTCCGG ACTGATCGGT GGGCCGGCGC CGATTTCTGG ATAAGCGCCG GACTTCCGGA GCGTGTGGAG CTGTTCGACC TACTAGAACA ATGCGAGGTG ATCGCAGCCG GAATCGACGG TGGCGGGCTT GATGACTTGC TCGGGCTGGG GGCCGTCGGG CGCATGTGCG GATCGCGAAA CTGGCTCGCG TGGGCGCACG CTTGGGCGCA TCCGTCTGTA CTGGAACGCC GGAAAGAGAT TGCTCCGGCA CTTCACGATT TTGAGAAGGC GGGTGATTTG ACGATCGTCT CCCGGATCGG TGATGACGTT GTGCAGGCTG CCGAGTATGT CGCGCGTATC GAGCGCGCGG GGCTTCTGTA CAAAGCCGGT GTCGACCCGG CTGGCATCGG CGCCGTTCTC GATGCGCTGG CGGCTGCAAA GGTGCCGGAA GACAAAGTGA TCGGCATTTC GCAGGGCTGG AAGCTCTCCG GGGCTATCAA AACGACGGAA CGGCGCATCG CGGCGGCATC GGGACAGCGA ATCGATGGCG ATGAATCCCC GGATGGCGCG CTATATCACG GCGGCCAGCC TCTGTTGACG TGGGCCGTCG GAAACGCGCG TGTCGTGCCG GTCGGTAACG CCGTGAATAT CACCAAGCAG GTGAGCGGGA CGGCCAAGAT CGACCCGCTG ATGGCACTGT TTAATGCGGT TTCGCTTATG GGGCTCAATC CGCCAGCGCA AGGCCAATCG GTCTACGAAA CGCGCGGCAT TCGTTTTCTC TGA
|
Protein sequence | MEWTTACPDW ETRLIERRSI IPPPIFRDPA EHALAIFKQL KVVDLPKVWD AEIEEWRPPT FGECSEEWVF DFVRAIFGGC DPETGKQLIR EYGLLISKKN TKSTIAAGIM LTALILCWRE EEEHLILAPT KEVADNSFKP AAGMIRTDEE LSALFHIQDH IRTITHRVNR NSLKVVAADT DTVSGKKSGR ILVDELWLFG KRANAAAMFL EALGGQVSRD EGWVIFLTTQ SDDPPAGVFR QKLNYWRDVR DGKINDPKTL GILYEFPAAM VMAKAYMRPE NFYITNPNLG RSVSAEWLQD QLRLYEGERD GTFQKFLAKH LNIEIGINLR TDRWAGADFW ISAGLPERVE LFDLLEQCEV IAAGIDGGGL DDLLGLGAVG RMCGSRNWLA WAHAWAHPSV LERRKEIAPA LHDFEKAGDL TIVSRIGDDV VQAAEYVARI ERAGLLYKAG VDPAGIGAVL DALAAAKVPE DKVIGISQGW KLSGAIKTTE RRIAAASGQR IDGDESPDGA LYHGGQPLLT WAVGNARVVP VGNAVNITKQ VSGTAKIDPL MALFNAVSLM GLNPPAQGQS VYETRGIRFL
|
| |