Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_5000 |
Symbol | |
ID | 5150248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | - |
Start bp | 5228830 |
End bp | 5231511 |
Gene Length | 2682 bp |
Protein Length | 893 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640559781 |
Product | multi-sensor signal transduction histidine kinase |
Protein accession | YP_001240910 |
Protein GI | 148256325 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.546026 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAAAC CAGTTGATTT CGGCACCGCC TGCAGTTCGC TCCGATCGAT GACGATCGGT CGGATGATTT TTGGCAGCTT TCTGCTGCTG CTCGCCGTGA TCACCGCCAC CAGCATGGCG AGCGTCATCG CCATTCGCCA CATCGATTCC ACCTTTGCCG AGCTGCAGCG GCTGCAGGGC GTCGGCGATC TCGCCGAGGA GATCGATCAG CGCATGAACG AGCTGCGGCT CGCGGCGCGC GATTTCGTCA CGGATCCAGC CTCGCGCTCC GACCGCGTCA TGGAGGCGGC GACCGAGCTC AGCGCCTTGT TGAAGAAGAC GCGCCTGCAG CTCGCGCCGG AGCAGCAGGC GACGATCGAT GGCGTGGCGC AGCGGCTCAG CAATTATCGC GAGGGTATCG ACCGGATCAG CGCGCTGATC GCACAGCGCG CCGAGTTGCT GTCCGGCCTG CCGGCGGCGC GCGAAGGCTT CGAGGAGGCG ATCGCGGGCG TGACCGAGCG CGACACCGCG CGGGCGCTGT TCAAGGTGCA GAACCAGATC GGCGCCGCGC TGCTGGCGCG CGATCCGGCC GGCGCCGAGC AGGCGGCGCG CAGTATCCGC GCGACACCGA TCCAGGACAA GGCGCTGCGC GCGGCAGCGG ATCGTTACGC CGAGGCGATC ATCGCCATCG CCGGCACCGA GGACGAAATC GCGCGGCTCG ACAAGGAGGT GCTGGGCATC GAAGGCCGGC TGATCGGCCG CGTCACCGAG CTGCTGCGGG CGCTGAGCGC GAGCCAGGGC AAGGTCCTGG CGCGCGACTT CGCCCGCACC CTGGCGGAGA CCAAATGGCA GAGCATCATT TTCGGCACGG CCGGGGTTCT GATCGGCATT TTCGCCGCCT CCTTCGTGGT CCGCCGCACG GTGCGGCCGC TGGCCTCGAT CGCTCGCTCG ATCCGGGCGC TGGCCGCAGG CGAGAAGTCG ACGGCGATTC CGCAGACCGA CGTGCAGAAC GAGATCGGCG ACATCGCCCG CGCCGCGGAG GTGTTCCGAC AGACCTTGCA GGACGCCGAC GCCGCGCGCG AGGCGGCGGT GCGCGCGCTG GCCGAGCAGC GGCTGGCCGA GGAGAGCTAT CGCAAGCTGT TCGAGGCTTC GATCGACGGC ATCTATGTGA CCACGCCGAG CGGGCTCGTG CTCAATGCCA ATCCGGCGCT GGCGCGGATC ATGGGGTACG ATTCGGCGAG CGATCTGATC CAGAGCGTGA GCGACATCGC CGACACCATC TATGTGCACC CGCTGGCGCG CAAGCGCTAT CAGAAGCTGA TGCAGCGCGA TGGCATGGTG CGTGAGTTCG AGTACCAGGT GCGGCGGCGC AATGGCGAGA TCCGCTGGCT GTCGGACAGC GCCACCGTCG TGCGCGACGA GAGCGGCGAG GTGCTGCGCT ACGAAGGCGT CGTTCGCGAC ATCACCGATC AGAAGCGCGC CGAGACGGCC ATCGCCGAAG GCCGCCGATT GCTGCAGCAG GTCATCGACA CGGTGCCGGC GGTGATCAAC GTCAAGGATC GCGAGCTGCG CTACGTCTTG ATGAACCGCT ACATGGCCGG CATCTTCAAT GTCGAGCCGG AGGATGCGCT CGGCCGCACC ACCGGCGAAT TGATGTCGCG CTACGGCGCC AGCAAGACCG ACTCGAACGA CAAGCGCGTG CTGGCGACCA AGGCCGGCCT CGGATTCTAC GAAGAGGAGT ACCTCGATTC CTGCTGCGTG ATGCGGCAAT GGCTGGTCAA CAAGCTGCCG CTGCTGGATG CCGATGGCGA GGTCGACAAG ATCGTCACGG TGGCGCTGGA CATCGGCGAA CGCAAGAAAA GCGAGCTGGA AATGCGCAAG GCGAAGGAGG CTGCCGAAAC GGCGCTCCGG AACCTGCGCG AGACGCAGGC CTCGCTGATC GAAGCCGAGA AGCTCGCCGC TCTCGGGCGC ATGGTTGCGG GCGTCGCGCA TGAGGTCAAT AATCCCGTCG GCATCAGCCT GACGGTGGCC TCCGCGCTCG AGCGCAAGAC CGAGCGCTTC AGCGAGGCCG TGAGCCGCGG TGAACTGCGC CGCTCGAGCC TCAACGAGTT CATCGAGACC AGTCGCAACG CGGCCGGGCA ACTGGTCGCC AATCTCAATC GTGCGGCCGA GCTGATCCAG TCGTTCAAAC AGGTCGCGGC CGACCGCAAC TATTCGGACC AGCGGACCTT CGATCTCGCC GATCTCACCG AGCAGGTGAT GCTGAGCCTG CGGCCAGGCC TGCGCAAGCA GAACCTGACG CTGAACGTGA ACTGCCAGCC CGATCTGGTG ATGAACAGCT ATCCCGGCCC TTACGGCCAG GTCCTGACCA ACCTTTTCCT CAACGCCGTC GCGCATGCGT TCCCGGATGG CCGGGGAGGC ACGATCGAGA TCCAGGCCCG CGAGTCCGGA CGCGACAATG TCGAGATCAT CTTCTCCGAC AATGGCTGCG GCATGAGCCT GGACGTCCGG CGCCGCGCCT TCGATCCGTT CTTCACGACG CGGCGCGATC AGGGCGGCAC CGGCCTCGGC CTGCACATCG TCTACAACAT CGTCACCAAT CGTCTCGGCG GACGGCTCGA TCTCGAGTCC GAGCCCGGGG CGGGGACGCG CATCCAGATC GTCCTGCCGC GTGTCGCGCC GCTGGAGCAG GCGGCCGAGT AG
|
Protein sequence | MSKPVDFGTA CSSLRSMTIG RMIFGSFLLL LAVITATSMA SVIAIRHIDS TFAELQRLQG VGDLAEEIDQ RMNELRLAAR DFVTDPASRS DRVMEAATEL SALLKKTRLQ LAPEQQATID GVAQRLSNYR EGIDRISALI AQRAELLSGL PAAREGFEEA IAGVTERDTA RALFKVQNQI GAALLARDPA GAEQAARSIR ATPIQDKALR AAADRYAEAI IAIAGTEDEI ARLDKEVLGI EGRLIGRVTE LLRALSASQG KVLARDFART LAETKWQSII FGTAGVLIGI FAASFVVRRT VRPLASIARS IRALAAGEKS TAIPQTDVQN EIGDIARAAE VFRQTLQDAD AAREAAVRAL AEQRLAEESY RKLFEASIDG IYVTTPSGLV LNANPALARI MGYDSASDLI QSVSDIADTI YVHPLARKRY QKLMQRDGMV REFEYQVRRR NGEIRWLSDS ATVVRDESGE VLRYEGVVRD ITDQKRAETA IAEGRRLLQQ VIDTVPAVIN VKDRELRYVL MNRYMAGIFN VEPEDALGRT TGELMSRYGA SKTDSNDKRV LATKAGLGFY EEEYLDSCCV MRQWLVNKLP LLDADGEVDK IVTVALDIGE RKKSELEMRK AKEAAETALR NLRETQASLI EAEKLAALGR MVAGVAHEVN NPVGISLTVA SALERKTERF SEAVSRGELR RSSLNEFIET SRNAAGQLVA NLNRAAELIQ SFKQVAADRN YSDQRTFDLA DLTEQVMLSL RPGLRKQNLT LNVNCQPDLV MNSYPGPYGQ VLTNLFLNAV AHAFPDGRGG TIEIQARESG RDNVEIIFSD NGCGMSLDVR RRAFDPFFTT RRDQGGTGLG LHIVYNIVTN RLGGRLDLES EPGAGTRIQI VLPRVAPLEQ AAE
|
| |