Gene BTH_II1044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II1044 
Symbol 
ID3844871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp1217844 
End bp1219586 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content60% 
IMG OID637838347 
Productphage terminase, large subunit, putative 
Protein accessionYP_439241 
Protein GI83717767 
COG category[R] General function prediction only 
COG ID[COG4626] Phage terminase-like protein, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.176224 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATGGA CTACGGCATG CCCTGATTGG GAAACGCGGC TGATCGAGCG CCGATCAATC 
ATTCCACCGC CAATCTTCCG TGATCCGGCC GAGCATGCTC TGGCGATCTT CAAGCAACTG
AAGGTGGTGG ATCTCCCGAA GGTATGGGAC GCTGAGATCG AGGAATGGCG GCCGCCCACG
TTTGGCGAAT GCTCCGAAGA GTGGGTCTTC GACTTCGTGC GCGCCATCTT CGGCGGTTGC
GATCCCGAGA CAGGTAAGCA GCTGATCCGT GAGTATGGGC TGCTGATCTC GAAAAAGAAC
ACGAAATCGA CTATCGCCGC CGGCATCATG CTGACGGCGC TCATTCTGTG CTGGCGCGAG
GAAGAAGAAC ATCTGATCTT GGCACCGACA AAGGAGGTCG CAGATAACAG CTTTAAGCCG
GCAGCGGGGA TGATTCGGAC TGACGAAGAG CTGTCCGCGT TGTTCCACAT CCAAGATCAC
ATTCGCACGA TCACGCACCG CGTGAACCGA AATAGCCTGA AGGTCGTCGC TGCCGACACG
GACACCGTTT CCGGTAAGAA GTCAGGCCGT ATTCTGGTCG ACGAGCTTTG GCTGTTTGGC
AAGCGCGCGA ACGCTGCGGC AATGTTTCTC GAAGCGCTGG GCGGTCAGGT GTCGCGAGAC
GAGGGTTGGG TCATTTTCCT GACCACTCAG TCCGACGACC CGCCGGCCGG TGTGTTCCGA
CAGAAGTTGA ATTACTGGCG TGACGTGCGG GACGGGAAGA TCAACGACCC GAAGACGCTA
GGCATCCTGT ACGAGTTTCC TGCGGCGATG GTGATGGCGA AAGCCTACAT GCGGCCGGAG
AACTTCTACA TCACGAACCC GAATCTCGGG CGGTCCGTCA GTGCTGAATG GCTGCAGGAC
CAACTGCGAC TGTATGAGGG GGAGCGCGAT GGCACCTTCC AGAAGTTCCT CGCGAAGCAC
CTGAATATCG AAATCGGCAT AAACCTCCGG ACTGATCGGT GGGCCGGCGC CGATTTCTGG
ATAAGCGCCG GACTTCCGGA GCGTGTGGAG CTGTTCGACC TACTAGAACA ATGCGAGGTG
ATCGCAGCCG GAATCGACGG TGGCGGGCTT GATGACTTGC TCGGGCTGGG GGCCGTCGGG
CGCATGTGCG GATCGCGAAA CTGGCTCGCG TGGGCGCACG CTTGGGCGCA TCCGTCTGTA
CTGGAACGCC GGAAAGAGAT TGCTCCGGCA CTTCACGATT TTGAGAAGGC GGGTGATTTG
ACGATCGTCT CCCGGATCGG TGATGACGTT GTGCAGGCTG CCGAGTATGT CGCGCGTATC
GAGCGCGCGG GGCTTCTGTA CAAAGCCGGT GTCGACCCGG CTGGCATCGG CGCCGTTCTC
GATGCGCTGG CGGCTGCAAA GGTGCCGGAA GACAAAGTGA TCGGCATTTC GCAGGGCTGG
AAGCTCTCCG GGGCTATCAA AACGACGGAA CGGCGCATCG CGGCGGCATC GGGACAGCGA
ATCGATGGCG ATGAATCCCC GGATGGCGCG CTATATCACG GCGGCCAGCC TCTGTTGACG
TGGGCCGTCG GAAACGCGCG TGTCGTGCCG GTCGGTAACG CCGTGAATAT CACCAAGCAG
GTGAGCGGGA CGGCCAAGAT CGACCCGCTG ATGGCACTGT TTAATGCGGT TTCGCTTATG
GGGCTCAATC CGCCAGCGCA AGGCCAATCG GTCTACGAAA CGCGCGGCAT TCGTTTTCTC
TGA
 
Protein sequence
MEWTTACPDW ETRLIERRSI IPPPIFRDPA EHALAIFKQL KVVDLPKVWD AEIEEWRPPT 
FGECSEEWVF DFVRAIFGGC DPETGKQLIR EYGLLISKKN TKSTIAAGIM LTALILCWRE
EEEHLILAPT KEVADNSFKP AAGMIRTDEE LSALFHIQDH IRTITHRVNR NSLKVVAADT
DTVSGKKSGR ILVDELWLFG KRANAAAMFL EALGGQVSRD EGWVIFLTTQ SDDPPAGVFR
QKLNYWRDVR DGKINDPKTL GILYEFPAAM VMAKAYMRPE NFYITNPNLG RSVSAEWLQD
QLRLYEGERD GTFQKFLAKH LNIEIGINLR TDRWAGADFW ISAGLPERVE LFDLLEQCEV
IAAGIDGGGL DDLLGLGAVG RMCGSRNWLA WAHAWAHPSV LERRKEIAPA LHDFEKAGDL
TIVSRIGDDV VQAAEYVARI ERAGLLYKAG VDPAGIGAVL DALAAAKVPE DKVIGISQGW
KLSGAIKTTE RRIAAASGQR IDGDESPDGA LYHGGQPLLT WAVGNARVVP VGNAVNITKQ
VSGTAKIDPL MALFNAVSLM GLNPPAQGQS VYETRGIRFL