Gene BURPS1106A_A2229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2229 
Symbol 
ID4904812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2215341 
End bp2216510 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content65% 
IMG OID640145334 
Productsensor histidine kinase 
Protein accessionYP_001076262 
Protein GI126457101 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCTGAAT CGAGTCTCGT CGACGACGCG TCGGCCGACA TCCCCTCCCT GAAGAAGGAG 
ATCGTGCGGT TGAACAAGAT CGTGCGCTCG CTGATGGATC GCGCGGAGCG CAGCACGATC
GTCTGCGGAT CGGATTTCAG CCTGTTTCAG ATGGCCGTCA CGCTCGAGGA TCAGGTGCGG
CATCGCACGC GCGAGCTGGA GGCGGCGCTG CACGAGAACC AGAAGATCAT GCACGCGCTG
CAGCGCACGC AGGCGCTGAT GGCGCAGGAG ATCGAGGAGC GCAAGAGGAC GCAGGCGGAG
CTCGAGACCG AGCGCGAGGC GCAGCGCCAT TTGATCGAGC AGCTCGCGCA GGCGCACGGG
CAACTGCTGC AATCAGAGAA GCTCGCGTCG ATCGGCCAGC TCGCGGCGGG CGTCGCGCAC
GAGATCAACA ATCCGATCGG CTTCGTCGAT TCGAACCTGC GCACGCTGAA GACATGGGTG
CGGCAATTGC TCGACGTGAT GGCGATCGAG GACGCGCTGA TCGCCGATTG CGGCGACGCC
GCGCTCGCGC GCCTGCGTGC CGCGCACGCT GAGGTCGATC TCGACTATCT GCGCGGCGAC
ATCGGAACGC TGATCGACGA ATCGATCGAA GGCGCGTCGC GCGTGCGGCG GATCGTGCAG
GACCTGCGCG ACTTCTCGCG GGCGGGCAGC GAGGAATGGA ACTTCGCCGA CGTCCACGAG
GGGCTGGAGG CGACGTTGAA CGTGTTGCGC AACGAACTGA AGTACAAGGC GGAGGTCGTC
AAGGATTACG GCGAGCTGCC GGCCGTCGAA TGCATGCCGT CGCAGTTGAA CCAGGTCGTG
ATGAATCTGC TGATGAACGC CGCGCAGGCG ATCGTCGAGC ACGGCACCAT CACGATCCGC
ACGCGCCGCG AAGGCGACGG CGTGACGATC GCGATCGAGG ATACGGGCGT CGGCATTCCG
GCGGACCGGC TCGCGAAGAT CTTCGATCCG TTCTACACGA CGAAGCCGGT CGGCAAGGGC
ACCGGGCTCG GGCTATCGGT TTCGTACGGC ATCGTCGAAA AGCACGGCGG CCGGATCACG
GTCGACAGCG AGCCGGGCAA CGGATCGCGC TTCACGATCT GGCTGCCGAT CGTCCGGCAG
CGCTCGTTGC AGGACGTGGC GGCCGGCTAA
 
Protein sequence
MAESSLVDDA SADIPSLKKE IVRLNKIVRS LMDRAERSTI VCGSDFSLFQ MAVTLEDQVR 
HRTRELEAAL HENQKIMHAL QRTQALMAQE IEERKRTQAE LETEREAQRH LIEQLAQAHG
QLLQSEKLAS IGQLAAGVAH EINNPIGFVD SNLRTLKTWV RQLLDVMAIE DALIADCGDA
ALARLRAAHA EVDLDYLRGD IGTLIDESIE GASRVRRIVQ DLRDFSRAGS EEWNFADVHE
GLEATLNVLR NELKYKAEVV KDYGELPAVE CMPSQLNQVV MNLLMNAAQA IVEHGTITIR
TRREGDGVTI AIEDTGVGIP ADRLAKIFDP FYTTKPVGKG TGLGLSVSYG IVEKHGGRIT
VDSEPGNGSR FTIWLPIVRQ RSLQDVAAG