Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMASAVP1_A2231 |
Symbol | |
ID | 4681298 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei SAVP1 |
Kingdom | Bacteria |
Replicon accession | NC_008785 |
Strand | + |
Start bp | 2215519 |
End bp | 2218176 |
Gene Length | 2658 bp |
Protein Length | 885 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639846495 |
Product | sensory box histidine kinase |
Protein accession | YP_993543 |
Protein GI | 121598766 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2202] FOG: PAS/PAC domain [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGATCAT TTCTCGCAAT GCGGAATATT GCTGCGCCGC ACCGACTCGA CCCGCGTTTG CCATGCGTCG TGTATCGACA ATTCCGCTTT CGGTTCAACC TTTCTTGTAC GCCTTTTGTG CAGGACGCGC TGAGCTACAA TCCTGCCATG TTGACCGATC GGCTTTTCGC TCGCTCGGCG CGACCGTCGG GCTCGCCTGC GGAGTCGCAG CCGTCCCGCT GGCACCACGG ACCGTGGTGG TCCAACTCTT ACCTGCTCAC CCCCCTGCTG TCGATCCTCG TCTTTCTCGT GGTGATGAGT CTCATTCTGT GGAGCCTCAA TCGCCGCGAG GAGCAGCAGC AGGAAGACAC GCTGTTTCGC AACGTCGCGT GGGCGCAGCA GCAGATCCGC CTGTCGATGA CGAGCGCGCA GGAACAGCTC CAGGCGTTCT CGCGCGACAT CGCCGCGGGC CGCATCGACG AGCATGCGTT CCAGGCGACG GTGGGCGACG TGATGCAGGC GCACCCCGAG ATCCTCTATC TGAACTGGTA CACGTCGCCC GGCACGAAAC GCTGGCCGAC GATGCAATTG CCGCTCCTCG GCCAGCGGCT CGCGAAGCCG AACGACGCGC AGATGGACGA AGTCGTGCGC GGCGCGTACG CGCAGGCGCG CGGCACGCGC CGCCAGTCGT ACTCGCCGCT CGTCTACGAC GACTTCGGCA ACGGCTACCT GACGCTGCAA ACGCCCGTGA TCCGCGAGCG CGAGTATCTC GGCTCGATCG CCGCGGTGTT CTCGGTCGAA GGCATCCTGA AGCACGACAT CCCGCCCGAG CTGTCCGCGA AATACAAGAT CTCGATCACC GACGCGAACA ACCGCGAGCT CGCATCGACG TCGTCGCGCC CGCGGCTGCC GCGCGACGCG CATTACGACC TGCCGCTCGA CCCGCCCGGC CAGGGCCTCA CGGTACGCGT CTACGCGTAC CCGCAAACGA CGAACCTGAC CAACAACACG CTCGTATGGC TCGTCGCGGG CCTGTCGTGC TTCGTGCTGT GGAGCCTCTG GAGCTTGTGG AAGCACACGC GGCAGCGCTT CGAGGCGCAG CAGGCGCTGT ACGCCGAGGC GTTCTTCCGC CGCGCGATGG AGAATTCGGT GCTGATCGGC ATGCGCGTGC TCGACATGCA CGGCCGGATC ACGCACGTGA ACCCGGCGTT CTGCCGGATG ACGGGCTGGG ACGAAAGCGA CCTCGTCGGC AAGACCGCGC CGTTCCCGTA CTGGCCGCGC GACGCTTACC CGGAAATGCA GCGCCAGCTC GACATGACGC TGCGCGGCAA GGCGCCTTCG TCCGGCTTCG AGCTGCGCGT GCGCCGCAAG GACGGCTCGC TCTTTCACGC ACGCCTGTAC GTATCGCCGC TCATCGACAG CGCCGGCCGG CAGACGGGCT GGATGTCGTC GATGACCGAC ATCACCGAGC CCAAGCGCGC GCGCGAGGAG CTCGCGGCCG CGCACGAGCG CTTCACGACA GTGCTCGAGA GCCTCGACGC CGCGGTGTCG GTGCTCGCCG CGGACGAAGC CGAGCTGCTG TTCGCGAACC GCTACTACCG GCACCTGTTC GGCATCCGCC CGGACGGCCA CCTCGAACTG TCGGGCGGCG GCTTCGACAC CGCGCAGGCG TCGTCCGATT CGATCGACAT GGTCGACGCC TACGCCGGCC TGCCCGCCGC GGCGCTCACC GAGAGCACGG CGGACGCGCA GGAGGTGTAC GTCGAGAGCA TCCAAAAGTG GTTCGAGGTG CGCCGCCAGT ACATCCAGTG GGTCGACGGC CACCTCGCGC AGATGCAGAT CGCGACCGAC ATCACGACGC GCAAGAAGGC GCAGGAGCTC GCGCACCAGC AGGAAGAAAA GCTGCAGTTC ACGAGCCGGC TGATGACGAT GGGTGAAATG GCGTCGTCGA TTGCACACGA ACTGAACCAG CCGCTCGCGG CGATCAACAA CTACTGCTCG GGCACGCTCG CGCTCGTGAA GAGCGGCCGC GCGTCGCCGG AGACGCTCGC GCCCGCGCTC GAGAAGACCG CGCAGCAGGC GCTGCGCGCC GGGATGATCG TCAAGCGGAT CCGCGAGTTC GTCAAGCGCA GCGAGCCGAA GCGGCAGCCG TCGCGGGTCG CCGACATCGT CGCCGACGCG GTCGGGCTCG CCGAAATCGA GGCGAGAAAG CGCCGGATTC GGATCGTCAC CGAAATCCGC GCAAGAATGC CTATTATTTA TGTCGACCCC GTGTTGATCG AGCAGGTGCT CGTGAACCTG ATGAAGAACG CGGCCGAGGC GATGCAGGAG GCGCGGCCGC AGGCGGAGAA CGGCGTGATC CGCGTCGTCG CCGACCTCGA GGCGGGTTTC GTCGACATTC GCGTGATCGA CCAGGGTCCG GGCGTCGACG AGGCGACGGC CGAACGCCTG TTCGAACCGT TCTACAGCAC CAAGTCGGAC GGGATGGGCA TGGGGCTCAA TATCTGCCGC TCGATCATCG AATCGCATCG CGGGCGTCTG TGGGTGGTCA ACAACGTCGA GCCGGACGGC CTCGTGTCGG GCGCGACGTT TCACTGCAGC CTGCCCATTG GGGAACCGGA GGATCTCGGT CGCGGATCCG AGACATCGCC ATCACAAACC GTAACGGGAG AGATATGA
|
Protein sequence | MRSFLAMRNI AAPHRLDPRL PCVVYRQFRF RFNLSCTPFV QDALSYNPAM LTDRLFARSA RPSGSPAESQ PSRWHHGPWW SNSYLLTPLL SILVFLVVMS LILWSLNRRE EQQQEDTLFR NVAWAQQQIR LSMTSAQEQL QAFSRDIAAG RIDEHAFQAT VGDVMQAHPE ILYLNWYTSP GTKRWPTMQL PLLGQRLAKP NDAQMDEVVR GAYAQARGTR RQSYSPLVYD DFGNGYLTLQ TPVIREREYL GSIAAVFSVE GILKHDIPPE LSAKYKISIT DANNRELAST SSRPRLPRDA HYDLPLDPPG QGLTVRVYAY PQTTNLTNNT LVWLVAGLSC FVLWSLWSLW KHTRQRFEAQ QALYAEAFFR RAMENSVLIG MRVLDMHGRI THVNPAFCRM TGWDESDLVG KTAPFPYWPR DAYPEMQRQL DMTLRGKAPS SGFELRVRRK DGSLFHARLY VSPLIDSAGR QTGWMSSMTD ITEPKRAREE LAAAHERFTT VLESLDAAVS VLAADEAELL FANRYYRHLF GIRPDGHLEL SGGGFDTAQA SSDSIDMVDA YAGLPAAALT ESTADAQEVY VESIQKWFEV RRQYIQWVDG HLAQMQIATD ITTRKKAQEL AHQQEEKLQF TSRLMTMGEM ASSIAHELNQ PLAAINNYCS GTLALVKSGR ASPETLAPAL EKTAQQALRA GMIVKRIREF VKRSEPKRQP SRVADIVADA VGLAEIEARK RRIRIVTEIR ARMPIIYVDP VLIEQVLVNL MKNAAEAMQE ARPQAENGVI RVVADLEAGF VDIRVIDQGP GVDEATAERL FEPFYSTKSD GMGMGLNICR SIIESHRGRL WVVNNVEPDG LVSGATFHCS LPIGEPEDLG RGSETSPSQT VTGEI
|
| |