Gene BTH_II1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II1047 
Symbol 
ID3845039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp1221666 
End bp1222991 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content61% 
IMG OID637838350 
ProductHK97 family phage major capsid protein 
Protein accessionYP_439244 
Protein GI83716436 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.144894 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCAGCTTC GCTGCATACC CTCATCAACG CTCTTCAAGG ATGACAAGTT GAGCAAAAAA 
CTTCTAATTA CCGCGCTCGC GGCAGCCCTG TCGGGCACGG CTGGTGCGGT GCCGCGCGGC
ATCATGTCCG TGCGCGCCGA GACTCCGGGC GAAATCAAAG CCCTGATCGA CAATTTGCAG
AAAGCGTTTC ACGATTTCAA GGCTGAGCAT ACGAGGCAAC TCGATGCGGT GAGAGCCGGT
CTGCCGATGT CGGACGCTAT GGCGAAGGTC GACAAGGTCA GTGCCGATCT CGAAGCACTT
CAGGCGGCCG TCGACGAGAC CAATATCAAG CTCGCTGCAG CGCAGATGGG TGCGAACGGC
GCGAAGCCGC TGCGCGACGC CGAATATACC GACGCGTTCA ACGCTCATTT CAAGCGTGGC
GACATCAACG CCGCACTTCA CAAGGGCGAG GACGGCGAGG GCGGATACCT GACGCCGATC
GAGTGGGACC GCTCGATCAC GAAGAAGCTC GTGCAGATCT CGCCGATGCG TCAACTGTGC
CGCGTTCAGT CGGTTTCGAA GGCCGGCTTT TCGAAGCTGT TCAACATGGG AGGCACGGCA
AGTGGGTGGG TCGGTGAAGC CGACGATCGC CCGCAAACGG GCACGGCCGC GTTTGCGTCG
CTCGCGTTCG GGCACGGCGA GATTTACGCG AATCCCGCCG CGACGCAGGG CATTCTCGAC
GACAGCGAAA TCGATCTCGA ATCGTGGCTC GCCGAAGAGG TGCAAACTGA ATTCGCGAAG
CAGGAGGGAC GGGCATTCCT CGCCGGTGAT GGTACGAAGA AGCCGACCGG CATTCTTACG
TACGTGGACG GCGGCGCAAA CGCGAAGAAG CATCCATTCG GGGCGATCGG GGTGGTGAAC
AGCGGCGCTG CCGCGGGCAT CACGTCGGAC GGCATCATCG ATCTGATCTA CGATCTGCCG
AGCGCGTTCA CGGGCAACGC GCGCTTCACG ATGAACCGCA ATACGCAACG AGCGGTACGC
AAGCTGAAAG ACGGCCAAGG CAATTACCTG TGGCAACCGT CGTACGTTGC TGGCCAGCCG
GCGACGCTCG CGGGCTACCC CGTGACGGAA GTGCCCGACA TGCCGGATGT CGCGGCGAAC
TCAACGCCGA TTCTCTTCGG CGACTTCATG CAGACGTATC TGATCATCGA TCGCATCGGC
GTGCGCGTGC TGCGCGATCC GTATACGGCG AAACCGTACG TCCTGTTCTA TACGACGAAG
CGCGTCGGTG GCGGCCTGCT TAATCCGCAG CCGATGCGCG CGCTGAAGGT GGCGGTGAGC
GCGTAA
 
Protein sequence
MQLRCIPSST LFKDDKLSKK LLITALAAAL SGTAGAVPRG IMSVRAETPG EIKALIDNLQ 
KAFHDFKAEH TRQLDAVRAG LPMSDAMAKV DKVSADLEAL QAAVDETNIK LAAAQMGANG
AKPLRDAEYT DAFNAHFKRG DINAALHKGE DGEGGYLTPI EWDRSITKKL VQISPMRQLC
RVQSVSKAGF SKLFNMGGTA SGWVGEADDR PQTGTAAFAS LAFGHGEIYA NPAATQGILD
DSEIDLESWL AEEVQTEFAK QEGRAFLAGD GTKKPTGILT YVDGGANAKK HPFGAIGVVN
SGAAAGITSD GIIDLIYDLP SAFTGNARFT MNRNTQRAVR KLKDGQGNYL WQPSYVAGQP
ATLAGYPVTE VPDMPDVAAN STPILFGDFM QTYLIIDRIG VRVLRDPYTA KPYVLFYTTK
RVGGGLLNPQ PMRALKVAVS A