Gene BURPS668_0041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_0041 
Symbol 
ID4884536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp38076 
End bp39455 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content72% 
IMG OID640125969 
Productsensor histidine kinase 
Protein accessionYP_001057096 
Protein GI126439832 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCACA GCCTGCGCGG CCGACTGCTT TGGTGGCTGC TGCTGCCGCT CGCCGTGTTC 
GTCGCGATCG CGGGCGCGAT GTCGTACGAC ACCGCGCGCA AGACGGCCGA TCTCGTGCAG
GACGGCGCGC TCGTCGCGTC CGCGCGCGTG ATCGCCGAGG ACGTCGATTG GGAAGGCGGC
GCGCTCGTCG CGAACGTGCC GCCCGCCGCG CTCGAGCTGT TCGCATCGCC CGCGCAGGAT
CACGTGTACT ACAAGGTGCG CACGGGCGGC GGGCGGCTGC TCGCCGGCAA TCCCGATCTC
GACGGCCCGG CCGCGCCGGC CGCGTCCGGC GCGCAGCCGG TGCTGTTCGA CACGGCGCTC
GGCGGGCTCG CGATTCGCGC GGTGGCATAC ACGCGCGAGC TGTACAACGC GGGCAACACG
GAAACGGTGA CGGTTGTCGT CGGCAAGACG CAGACCTCGC GGCAGATGAT GATCGCGGCG
ATCTGGCATC CGCAGCTCTG GCGGCTCGCG CTGATGCTCG CGCTCGCGAT GGCGCTCGTC
TATCTCGGGC TCACGTTCGA GCTGCGGCCG TTGATGAAGC TGAAAGAAGA CGTCGCGGAC
CGCGGGCCGA TGGAGCTCGA GCCGATCCGC ACCGAGCGGC TGCATTTCGA GCTGCGGCCG
ATCGTCGACG CGATCAACCA GTGCATCGCG CGGCTGAACC TGCACGCGGC GACGCAGCGA
CGCTTCATCG CCGACGCCGC GCACCAGCTA CGCACGCCGA TCGCGGTGAT CGACACGCAG
ATCCAGTGCG CGCGGCAGCG CGAGAACGGC GACGCGGCGC TCGCCGCGCT GCTCGCGTCG
ATGCAGCGCA GCAGCCGCCG GATGGCGGAC GTCACCGACA AGCTGCTGCT GCTCGCGCAC
GCGGAAGCCG CGTCGCCCGC GCGGCTCGCC GCGCGCGTCG ACATCGCGGC CGTCGTGTCG
GGCGTGCTCG AGGAGGCGAT CGTGCTCGCC GAGCGGCGGC GCATCGATCT CGGCGCGGAG
CTCGACGACG ATCTGCAGGT GGCCGGCAGC GAAAGCCTGC TGTCGGCGCT GCTGATGAAT
CTCGTCGACA ACGCGGTGCG CTATGCGCAC GAAGGCGGAC GCGTGACGGT GAGCGCGCGG
CGCGACGGCG ACGCGGTGGT GCTCGAGGTC GTCGACGACG GCCCGGGCAT CCCGGCCGAG
GCGCGGCCGC ACGTGTTCAA GCGCTTCTAT CGCGTCGCGA GGGACGAGGA AGGCACGGGC
CTCGGGCTCG CGATCGTCGA GGAGATCGCG CAGTCGCACG GCGGCGCGGT GTCGCTCGCC
ACAGGCCCCG GCAACCGGGG CGTGAGGATG ACCGTGCGGC TGCCCGCCTA TCGCAATTGA
 
Protein sequence
MSHSLRGRLL WWLLLPLAVF VAIAGAMSYD TARKTADLVQ DGALVASARV IAEDVDWEGG 
ALVANVPPAA LELFASPAQD HVYYKVRTGG GRLLAGNPDL DGPAAPAASG AQPVLFDTAL
GGLAIRAVAY TRELYNAGNT ETVTVVVGKT QTSRQMMIAA IWHPQLWRLA LMLALAMALV
YLGLTFELRP LMKLKEDVAD RGPMELEPIR TERLHFELRP IVDAINQCIA RLNLHAATQR
RFIADAAHQL RTPIAVIDTQ IQCARQRENG DAALAALLAS MQRSSRRMAD VTDKLLLLAH
AEAASPARLA ARVDIAAVVS GVLEEAIVLA ERRRIDLGAE LDDDLQVAGS ESLLSALLMN
LVDNAVRYAH EGGRVTVSAR RDGDAVVLEV VDDGPGIPAE ARPHVFKRFY RVARDEEGTG
LGLAIVEEIA QSHGGAVSLA TGPGNRGVRM TVRLPAYRN