Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_6820 |
Symbol | |
ID | 5152855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 7141607 |
End bp | 7144552 |
Gene Length | 2946 bp |
Protein Length | 981 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640561502 |
Product | PAS/PAC sensor signal transduction histidine kinase |
Protein accession | YP_001242613 |
Protein GI | 148258028 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0915227 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCACG CCGATTTCGA GTTGCAGGCC TTGAGCGACC CCCGGCTGGC CGTCCATGCC GCGGGCAGCC TGCCGGCATG GCTGTGGTCG GCGGACGGCA GTCGCGTATT GTGGGCCAAC GCCATGGCTG CGCGGGCTCT CGGTGCAGCC AACAGTTGGG CCCTGGTCGA CAAGAGGTTC GGACCGGCCG ATCAGCACCG CCGTCAAGTG GCTCAGCTGG CGCGCCGGCT GCCGGAAAAT GGCGCCATCC GACTCGAGCG CCTGCGCGGG TTCGGCGCGC CGCTCGGCGC GCTTGCGACG TGCGGTTGCG CAATGATCGA CCTGCCGGAC GGCATGCGCG CCGTCCTGAT CACCGGCACG GACAACAGCG GCCGCAGCTG GCCGCTGACG GAGCGCCTGA GGCGGCTTGT CGAGAACAGC GGTCTCGCGC TGATCGCCTT CACCGCGCAT GGCGAGTTCG CCGGCATCAG TACAGCCGGA CGGCGCCTGT TCGGCATCCA CGACCTCAGC GCGCCGCTGT TTGCGGATGT CCGCAACGAG GCGCTGACCG CAGGCTTCGC CGAGATACAG TTCGGCGACG AGCGCGCCGT GCTCCACCGG GTCGGCGCTG GCAGCCAAAT CGGGCTGGTC GCAGTCCCGG CGCCCGCCCC GGCCGTCAAG CAAGCGGAGC CGAACGCCGA CGTGCCCGTT GACGCCGTCC CTGTCGCCGA GACGGCTGCA ACACCTCCCC ACCCCGCCAA TCCTGCCCCC GAGGTCGCGG CCGAGTGTCC GCCTCCTCCC TCGATGGATG CAATGACTGA ACGCCCGTTG GACGACGAGA CATCAGCGCA ATCGCGCGAA GCGGCTGCCG CATTCGCTCC GCTCGACGAA CCCGCCGTGC AGGCGCCGCC GGTCGTCGGC ATCGACCGCC CGGAGCCAAC GTCTTCCATC GCGGCCGAGA CGGCTGACGC TGACGCGACG GCTTTTGCGC AATCCGACCT TGCGCCGGCC GCAGATGCGC CGCCACAGCA GGACAACATC ACCGTGTCGA CCCGTGACGC GTCCGTCGCG TCGGACGCGC CTCCCATCGA CAATGGTGGG CTCGAAGCGG CGGGCAATGT CGTGCCTTTC CGCCCAGCCG GCGAAACCAA ACATCCGGCG CTGACGCCGG TGGAGAACAA TGCCTTCCAT GAGCTCGCGC GCCAGCTGTC GGCAAGGCTT GAAGGCGACA ACCGCAATGC GCCGGCGGAC GCGGCAGCGG CGACGCATGA GCCGGCGGCG CAAGTGACCG AACCATTCGA CGCGCCGCAT GTCCCCACGT CATTGCTCGG CAGCGATGCG AGCGATCAGC CCACAGCCGG GCTGCAGGCC GAGTGGCTCA CGCCCGAGCA ACCGCCCGCC CAAGGCGACA GCCAGCGTGA CCGCATCCTG CTGGATCTGC TGCCGACGGG AATGCTGATC TATCGGCTTG ACCGCCTGCT CTATGCCAAT CCGGCATTTT TCAAGCAGAT GGGCTACACC GATCTCGCGA CTCTGGAGCA GGCCGGGGGC CTCGATGCGC TCTATGTCGA ACCCGGCGCG CCCGCCGGCT CTGGCAAGCC CGAGGTCGGC ACGCCGGTGA CGATCTCGGC GGCCGACGGC TCCTCCAGCG AGGAGATCAA AGCCGAGGCG CGTCTGTTCA CGATCACCTG GGACGGCGAC AGCGCGCATG CGCTGATCTT CGCGCCCGCA CCTGGCCCGG CGGCCACCGC CGTGTCCGCT CCGGCCGAGA CCGGATCGGC CCCCGCTCCT GCACCGGCCG CACCGGCGCC GCTGGTCCTC GCGCCGCCGC CCGTCGCCGG TCATGCCGAC GCGGAAGATC TCGCCGCCAT CCTTGACGTC ACCGCGGAGG GCATCCTGAT GTTCGATGCT GGCGGCAATA TCCACGCCTG CAATCGCAGT GCCGAGGCCA TGTTCGGCTA TGACGGAACG GAGCTCGTCC GCCGCAACAT CATCGAATTG TTCGCGCCGG AAAGCGCAAG GCTGGTGCTG GATCATCTCG ACCGTGTGCA GCAGGCGGGG ACCGCCAATG TGCCGGAGCC GAGCCGCGAA GTCTTGGGCA AGGTCGCCCA GGGCGGCGCC ATCCCGCTGT CGATGACGAT CGGCCGGACC AGACAAGACG GTCCGAATTT CTTTGCCGTG TTCCGCGACC TGTCGCAGCT TCGTCGCGGT GAAAGCGAGT TGCTGCAGGC GCGGCGCGTC GCCGATCGCG CCGCCAGCGC CAAGGCCGAC ATGCTCGCGC GGATCAGCCA CGAAGTGCGG GCCCCGCTCA ACGCCATCAT CGGATTTGCC GAGGTGATGA TCGGCGAACG CTTTGGAACG CTCGGCAACA CGCGCTACGC CGAGTACATG AAGGATATCC GCGCGTCCGG CGAGCGCGTG GTGTCGCTGA TCGACGATCT GCTCGAATTG TCGCGCATCG AGACCGGCAG ACTGGAGCTT GCCTTCACCA GCCAGAATCT GAATGAGCTC GTCGAGGGCT GTGTGGCGGT GATGCAGCCG CACGCCAACC GCGCCCGCAT CATCATCCGC ACCTCGCTGT CGCAGAGCCT TCCGCCGGTC ATCGCCGATG CCAGGGCGCT GCGCCAGATC GCCATGAACC TGATCTCGAC CTCGATCTCG CTCGCCAATG CCGGCGGCCA AGTCATCGTC TCCACGGCGA CGTCGGATTT TGGCGAGGTG GTGCTGCGCG TGCGCGACAC TGGCCACGGT CTCAACGACC AGGCGGTGGC CGCCGCGCTC GAACCGTTCC GCGCGACGGC GGGGCCCGAG CAAGGCCATG ACGGCTCGGG TCTCAGCCTC TCTCTCACCA AGGCGCTGGT CGAGGCCAAT CGCGCGCAGT TTCACATCAA GGCGGCGCCG AAATCCGGCA CCTTGCTCGA GGTGGTGTTC TCGCGCGCGG TGGCGCGCAA TCAGAGCACC GCCTGA
|
Protein sequence | MNHADFELQA LSDPRLAVHA AGSLPAWLWS ADGSRVLWAN AMAARALGAA NSWALVDKRF GPADQHRRQV AQLARRLPEN GAIRLERLRG FGAPLGALAT CGCAMIDLPD GMRAVLITGT DNSGRSWPLT ERLRRLVENS GLALIAFTAH GEFAGISTAG RRLFGIHDLS APLFADVRNE ALTAGFAEIQ FGDERAVLHR VGAGSQIGLV AVPAPAPAVK QAEPNADVPV DAVPVAETAA TPPHPANPAP EVAAECPPPP SMDAMTERPL DDETSAQSRE AAAAFAPLDE PAVQAPPVVG IDRPEPTSSI AAETADADAT AFAQSDLAPA ADAPPQQDNI TVSTRDASVA SDAPPIDNGG LEAAGNVVPF RPAGETKHPA LTPVENNAFH ELARQLSARL EGDNRNAPAD AAAATHEPAA QVTEPFDAPH VPTSLLGSDA SDQPTAGLQA EWLTPEQPPA QGDSQRDRIL LDLLPTGMLI YRLDRLLYAN PAFFKQMGYT DLATLEQAGG LDALYVEPGA PAGSGKPEVG TPVTISAADG SSSEEIKAEA RLFTITWDGD SAHALIFAPA PGPAATAVSA PAETGSAPAP APAAPAPLVL APPPVAGHAD AEDLAAILDV TAEGILMFDA GGNIHACNRS AEAMFGYDGT ELVRRNIIEL FAPESARLVL DHLDRVQQAG TANVPEPSRE VLGKVAQGGA IPLSMTIGRT RQDGPNFFAV FRDLSQLRRG ESELLQARRV ADRAASAKAD MLARISHEVR APLNAIIGFA EVMIGERFGT LGNTRYAEYM KDIRASGERV VSLIDDLLEL SRIETGRLEL AFTSQNLNEL VEGCVAVMQP HANRARIIIR TSLSQSLPPV IADARALRQI AMNLISTSIS LANAGGQVIV STATSDFGEV VLRVRDTGHG LNDQAVAAAL EPFRATAGPE QGHDGSGLSL SLTKALVEAN RAQFHIKAAP KSGTLLEVVF SRAVARNQST A
|
| |