Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_0743 |
Symbol | |
ID | 4881880 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 721745 |
End bp | 723628 |
Gene Length | 1884 bp |
Protein Length | 627 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640126672 |
Product | sensor histidine kinase |
Protein accession | YP_001057796 |
Protein GI | 126439646 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3300] MHYT domain (predicted integral membrane sensor domain) [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.573337 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATGGAA CCTTCAATCT GCCGCTCGCC GCGCTGTCGC TCGCGATCGC AACGCTTGCG TCATACACGG CGCTCGACCT GAGCGCGCTC ATCACGCTCC TCGACCAGCC CCGGATGCGC CGCGCGTGGC TCGCGGGCGG CGCGGCCGCG ATGGGCACGG GCATCTGGGC GATGCACTTC GTCGGCATGC TCGCGTTCTC GCTGCCGATC CCGCTCGGCT ACGATTTCGG CCACACGTTC GCGTCGCTCG CGATCGCGGT GGTCGTGTCG TACTTCGCGC TGAACGCGGT CACGCGCGCC GCGCTCACGC GCGAGCGGCT CGCGATCGGC GGCGTGCTGA TGGGGCTCGG CATCGCGGGC ATGCATTACA CCGGGATGAG CGCGCTGCGG ATGCAGCCCG CGATCGCCTA TGACTTCACG CTGTTCGTCG CATCGATCGC GATCGCGATC GGCGCGTCGA CGACCGCGCT GTGGATCGCG CACCGCTTGA GCAACGAGAA CGAGCCGCGC GTGCTCATGA AGCGGATCGC CGCGGCGGGC GTGATGGGGC TCGCGATCAC CGGCATGCAC TACACGGGCA TGGCGGCCGC GCATTTCGGG GCGAACGCCG TATGCGGCGC GGCGGGCGAG ATCAGCGGCG CGTGGCTCGG CGCGACGATC GCCCTCTTTA CCGTGACGAT CCTGAGCGCG ACGCTCGTCG TCTCGCGCTT CGATGCGCGC ACCGCGTTCC TGCGCGGGAT GACCGATGCG CTCGAGGAAC TCGTCGCGAA GCGCACGAGC GAGCTCGAGG GCGCGCTGCG GCAATACGAG CGCACGACGC ACGTGCTGCA GCGCACGCGC CGGAAAATGG AGCAGGAGAT CGACGAGCGC AAGGCCGCGC AGGCGCGCCT CGAGCACGAG AAGGACGAGC AGCGCCGCCT GATCCGGCGG CTCGAGGAAA CGCACGTGCA GTTGCTGCAA TCGGAGAAGC TCGCGTCGAT CGGCCAGCTC GCGGCGGGCG TCGCGCACGA GATCAACAAC CCGATCGGCT TCGTCAACGC GAATCTGAAC ACGCTGAGGA GCTGGGTCCA GGGCCTGCTC GACGTGATCG CCGTGCAGGA GGCGCTGACG GGCACGCTCG CGGCCGACGC GCGCGCGCCG CTCGCCGCGG TGGCGCGCGA CATCGATCTC GATTACGTGC GCGGCGACAT CCTCGCGCTC ATCGACGAAT CGATCGACGG CGCGATGCGC GTGCGGCGCA TCGTCTGCGA CCTGCGCGAC TTCTCGCGGC CGAGCGGCGA CGCCTGGGCG TTCGCCGATC TGCACGCGAG CCTCGAGAGC ACGCTGAACG TCGTCCACAA CGAGCTCAAG TACAAGGCGG ACATCGTGCG CGAGTACGGC GTGCTGCCGC TCGTCGAATG CAACGCCGCG CAGTTGAGCC AGGTGTTCAT GAACCTGCTC GTCAACGCCG CGCAGTCGAT CGGCACGCAC GGCACGATCA CGATTCGCAC GACGCACGAC GGCGACACCG TGTCGATCTC GATCGCCGAC ACGGGCGCCG GCATACCGGA GGACGCGATC GGCCGGATCT TCGATCCGTT CTTCACGACG AAACCGGTCG GCCATGGCAC GGGGCTCGGG CTGTCGATCT CGCACGGCAT CGTCGAGCAT CATGGCGGGC GCATCGACGT CGAGAGCCAC GTCGGCCGCG GCTCGACGTT GACGGTCACG CTGCCGGTCC GGCGCAAGCC CGAGCCGGCC GCGCGAACGA TCGCGGGCGA CGGCGCGAAC AACGTGAACG ACGTGCACGC CACGAACGGC GCGAGCGCCG CGCGCGACGC GGCCCGCGCG ATCGAGCGCG CCGCCGCGTC GGCCGCGCCG CTTACGCGCG GCGCGGCTTG CTGA
|
Protein sequence | MHGTFNLPLA ALSLAIATLA SYTALDLSAL ITLLDQPRMR RAWLAGGAAA MGTGIWAMHF VGMLAFSLPI PLGYDFGHTF ASLAIAVVVS YFALNAVTRA ALTRERLAIG GVLMGLGIAG MHYTGMSALR MQPAIAYDFT LFVASIAIAI GASTTALWIA HRLSNENEPR VLMKRIAAAG VMGLAITGMH YTGMAAAHFG ANAVCGAAGE ISGAWLGATI ALFTVTILSA TLVVSRFDAR TAFLRGMTDA LEELVAKRTS ELEGALRQYE RTTHVLQRTR RKMEQEIDER KAAQARLEHE KDEQRRLIRR LEETHVQLLQ SEKLASIGQL AAGVAHEINN PIGFVNANLN TLRSWVQGLL DVIAVQEALT GTLAADARAP LAAVARDIDL DYVRGDILAL IDESIDGAMR VRRIVCDLRD FSRPSGDAWA FADLHASLES TLNVVHNELK YKADIVREYG VLPLVECNAA QLSQVFMNLL VNAAQSIGTH GTITIRTTHD GDTVSISIAD TGAGIPEDAI GRIFDPFFTT KPVGHGTGLG LSISHGIVEH HGGRIDVESH VGRGSTLTVT LPVRRKPEPA ARTIAGDGAN NVNDVHATNG ASAARDAARA IERAAASAAP LTRGAAC
|
| |