Gene BURPS668_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2021 
Symbol 
ID4884918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2014643 
End bp2017891 
Gene Length3249 bp 
Protein Length1082 aa 
Translation table11 
GC content67% 
IMG OID640127949 
ProductHpt sensor hybrid histidine kinase 
Protein accessionYP_001059056 
Protein GI126440276 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0784] FOG: CheY-like receiver 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGGAC TTCTGCAAGA GCTCGACGGA TCGCCGCTGA GGAAGTTCTA TTCGCTCGAG 
TCGAATCTGA AGCGCGAGCG GCGGGTCTTC ACGATCGTCA TCGTGCTGCT CGTCTGCGCG
GCCCTCAGCA TCGCGGCCAT GACCGTCACC GGCTTGTTCC AGACCGCTTT CCGGCAGGAG
GAGCAATCCG CGCGCATCCA CGAAAAGGAA GTGGTCGACG TGTTCCTGCA GCGCCGCATG
ATGTTGACGA CGGCAAGCCT CGTGCTGCAA CTGCGGATGA ACGGCGCGCC TTCGGCGCTG
AACGTGCCGG CGCCGAACGC GTGCACGCCG ATGGCCCACA ATGTGCGCGA CGATGCGATC
CTGCGCGAGA GCTGCGATTA CACGGTGCAG TTGCTGGCCA ACTCGGGGCA GACGCCGAGC
GTCGAAATGG TGACGGCCGA CGGTTCGGTC GGCTATGGAT ATCTGTTTCC GACGGGCGAC
CTGAGCGCGC TACGCTCCAG CACGCCGTCC GAACTCGTGT CGGCCGTGCT CGAGCGCTAC
GGCAAGCGCG GCCTGGACCC GCTGGAGGCC GCGCGCAAGA AGCGGATTCT CTGGTTCGCG
GTGGGCCGCG GCGGGCGCGG CGAGGAGCTG CATATGATCG GCGCGTCAGT CGTGTTCAAG
GACGAGCGGC TCTACGCGCT CGTCTTGACG AGCGTGGATC TTCACAGCCT CGTTTCGCCG
ATCGAGCGCG CCGGCCGCGT GCAGCAGCCG GTCGTCGTGG ATTCCGAGGG CGTGCCGCTC
GTGAACGCGG ACGACGCGGA AACGGTCCGG AAGGTCGACG GGCGGCTCGC CGGACAACAG
GATGGCCTGT ATCACTGGAT TCCCGGCTTC GGGTGGGCCC TACGCCGTCC CGCGCCGTTT
TCCGGTTTCG GGCACATGAC GTATCTGCTT CCGCTCGATC TGCAGTTGCG CTCGATGCGC
TACGAGTTGA GCCTCGTCGG CGGCGCGACG CTCGTGCTGA TCGTGTTGCT GTTCGTCGCG
TTCCGGTACT GGAATTACCG GTTCTTGACG CGCATCTACG AGGAAGCGTC GCGCGCGCTC
GAGAGCGAAA TGCTCAACCA TCTGCTGGTT CATGCGACGC CGGTCGGGTT GTGCATCGTG
CGGCGGGCGA CACTGGAGAT CGTCGTCGCC AACCCGATCG CGCGCACGAT GCTCGGCTTG
CGGCTGTCGG ATCGGCACCT GCCGCAGGAA TTGCTGAGCG CGTTCGAATC GTCGCTGGCC
GAGCAGGACA CCCAGTCGGA CGACGCGCGC ATTTTCCAGT TCCCGTTCAC GCTGTCGCGC
GCCGGGCATG CGGCGGTCCA TATCGAAATC ACGTACGCGC CCGCGATGCT GAACGCGCGG
GAGGTGTTCT TTTGCGCGAT CACGGACATG ACGGCGCACC ACCAGGCGGA GATCCTGCTG
CGCGAGGCGA AGCTGACGAG CGACGCGGCG GCCAAGGCGA AGGTGGCGTT CTTCGCATCG
ATGAGCCATG AAATCCGCAC GCCGCTGTCG TCGCTCGTGG GCAACATCGA GCTGATCGCG
CGCGGGCCGC TCGCGCCCGA GCAGCAGGCG CGCGTGAAGG CGATGGAGAC GTCGGCGCGC
GGCTTGATGC AGATCGTCAA CGATGTGCTC GATTTCTCGA AGATCGACGT GGGCGAGCTG
AGCCTCATGG AGGAGTGGTC GAACATCGCC GAGCTGCTCG ACCGGCTCGC GCTCTCGCAC
GCGCCGCTCG CGACGCAGCA GGGTTTGAAG TTCTACATGG TGTTCGATCG CAGCCTGCCC
GCGCGGCTCT ACTTCGATCC GATCCGGGTC TCGCAGATCG TGAACAATCT GCTGAGCAAC
GCGCTGAAGT TCACGCCGTC CGGCAAGATC GTGCTGCGCG CCGGCTGGCG TGCCGGCGCG
CTCGAAATCA GCGTGACGGA TTCCGGCATC GGCATCCCCG ATGACCTGAA GCACCGCCTC
TTCCTGCCTT TCACGCAGGG CGACAGCAAC CGGCTGCGGC AGGCACGCGG CACCGGCCTC
GGATTGTCGA TCTGCGCGCG TCTGTGCGAG CTGATGAAAG GGCGCATCGA TCTGGAAAGC
ACTGTCGGCG TGGGAACCCG GATCGCGGTG ACACTGCCGC TCGGCGTGTC GGAGGCCGAT
TCGAGCGATG CGTACTGGAC GCTTCCGTAT CGGCGCGTGG CCGTGCTCGG TCGCGCACAG
GAAAATCTCG AGTGGCTGGC CAACCTGTTC GACCCGGGCG TCACGGCCGT GACGGCTTTC
TCGCGGCCGG CCGAGCCGAT CGATGCGCAC GCGCACGATT TCCTGATGGT CACCGACGAA
TTCTCGCCGG CCGAGGTGCT GCCGTGGTGG AGGCGGCCGG ACTCGATCGT GTGGGTCGGG
CAGGCCGGCC CGCTCGTGCC GAGACGGCGC GACGACGGCG GAGTGGAAAT CAGCATGTAT
AGCCTCGCGG GGCTGAAATC CGCGACTCAC ATGCTCGCGG CCGGCCGCAC GGCGCTCGCC
GAAGCGGGGC ACGAGCCGCC GGGAGCCGAG GCGGGAATGA CGGTGCTGAT CGCCGAGGAC
AATCTGCTCA ACCGCAGCCT GCTGCTCGAT CAGCTGACGA CGCTGGGCGT GCGGGTCATC
GAGGCGAAGA ACGGCGAGGA GGCGCTCGCG TTGCTGTTGA AGGAGCCGGT GGACGTCGTG
ATGACCGACA TCGACATGCC GATGATGGAC GGTTTCCAGT TGCTCGCCGA GATGAGGCGG
CTCGGCATGA CGATGCCGGT GTACGCGGTG AGTGCGAGCG CGCGGCCGGA AGATGTGGCG
GAAGGCCGGG CGCGCGGCTT TACCGACTAT CTCGCGAAGC CGGTTTCGCT CGAGCGGCTC
GAGACGGTGG TACGCGCATG TTGCAGCGCG CCGGCGGGCG CGCGCGCCGA CGAAGACGCG
CAGGACGAAC TGCCGGGCCT ACCCGACGTG CCGCCCGCCT ATGCGAGCGC GTTCGTCGCG
CAGGCCGGCA GCGAAATCGC GGAATTCGAC GCGATCCTGC GCGAACGCGC GCTGCCGAAA
CTGCGGCGGT GGCTGCACGG CGTATCGGGC GGCATCGCGG TCCTCGGGCC TTCCGCGCTG
CATGAGCAAT GCCAGGAGCT TCGAGCCTAC GCGCGCGAAT CCGGCGAATG GAATCGCGAA
ATCGAACTGC AGGCGCTGGC CATTCGGAAC GCGCTGGAGC GAATGGTCGC GGCGCTGACG
AGCGCGTGA
 
Protein sequence
MQGLLQELDG SPLRKFYSLE SNLKRERRVF TIVIVLLVCA ALSIAAMTVT GLFQTAFRQE 
EQSARIHEKE VVDVFLQRRM MLTTASLVLQ LRMNGAPSAL NVPAPNACTP MAHNVRDDAI
LRESCDYTVQ LLANSGQTPS VEMVTADGSV GYGYLFPTGD LSALRSSTPS ELVSAVLERY
GKRGLDPLEA ARKKRILWFA VGRGGRGEEL HMIGASVVFK DERLYALVLT SVDLHSLVSP
IERAGRVQQP VVVDSEGVPL VNADDAETVR KVDGRLAGQQ DGLYHWIPGF GWALRRPAPF
SGFGHMTYLL PLDLQLRSMR YELSLVGGAT LVLIVLLFVA FRYWNYRFLT RIYEEASRAL
ESEMLNHLLV HATPVGLCIV RRATLEIVVA NPIARTMLGL RLSDRHLPQE LLSAFESSLA
EQDTQSDDAR IFQFPFTLSR AGHAAVHIEI TYAPAMLNAR EVFFCAITDM TAHHQAEILL
REAKLTSDAA AKAKVAFFAS MSHEIRTPLS SLVGNIELIA RGPLAPEQQA RVKAMETSAR
GLMQIVNDVL DFSKIDVGEL SLMEEWSNIA ELLDRLALSH APLATQQGLK FYMVFDRSLP
ARLYFDPIRV SQIVNNLLSN ALKFTPSGKI VLRAGWRAGA LEISVTDSGI GIPDDLKHRL
FLPFTQGDSN RLRQARGTGL GLSICARLCE LMKGRIDLES TVGVGTRIAV TLPLGVSEAD
SSDAYWTLPY RRVAVLGRAQ ENLEWLANLF DPGVTAVTAF SRPAEPIDAH AHDFLMVTDE
FSPAEVLPWW RRPDSIVWVG QAGPLVPRRR DDGGVEISMY SLAGLKSATH MLAAGRTALA
EAGHEPPGAE AGMTVLIAED NLLNRSLLLD QLTTLGVRVI EAKNGEEALA LLLKEPVDVV
MTDIDMPMMD GFQLLAEMRR LGMTMPVYAV SASARPEDVA EGRARGFTDY LAKPVSLERL
ETVVRACCSA PAGARADEDA QDELPGLPDV PPAYASAFVA QAGSEIAEFD AILRERALPK
LRRWLHGVSG GIAVLGPSAL HEQCQELRAY ARESGEWNRE IELQALAIRN ALERMVAALT
SA