Gene BBta_4234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_4234 
Symbol 
ID5149088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp4445725 
End bp4448010 
Gene Length2286 bp 
Protein Length761 aa 
Translation table11 
GC content65% 
IMG OID640559058 
Productsensor histidine kinase 
Protein accessionYP_001240195 
Protein GI148255610 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGGCC ATCGCCTCGC ATTGGGGCTC GGGCTGGGAA CTCTGCTTGC GATCGGAGCG 
GCATCGACGG CGCTCGACAT CAAGTCGCGC CATGACGCAG CCTGGGTCGA TTCATCGCTC
AACGTTCTGC AGAAGGCATC CGATCTGCGG TTGCTCCTGC GCGAGGTGGA GGCGTCCTCA
CGCGGCTTCG GGCTCACCGG CGCGGAGCAG TTCAACACCG AGTTCAACGA CGTCAGGGCC
CAAATTCCCA CGGCTTTGTC CGATCTGCAG CGAAGCGCGG CGGACAGTCC CTCCGCGAGG
AACCTGTTTG CCGAGACGGC CGCGCTGATC GACCGGAGGA TTGATCTCGC CGCCGAATTG
ATCCGTCTGC GCGCGGCATC GGATCGCTCC GGGCTCGACG CGCTTGGCAG TCGCGCCGAA
GGCCCGGCCA TCATGGCCAG GATCACGGCC AATCTCGATC GCTTCGTTGC CGAACAGCGT
CAGCTCCTGG CGCAGCGGAC GGCGACCTCC AAAGCGACCG GCTCCCTGCT GCTGGCGATC
GACCTGTGCG GGCTTGCGGT GCTCCTGCTC ATCGCTGGAT ACCTCATCCG ACGCACCTAT
CAGGACGACC GCACGCTCCG CGCCTCCCTG CTCCGTTCCG AGGCCGAAGC CGTCTCGCTG
GCCGAGAAGG CCGAAGAACA GCACCGCGAT CTGATGGCGG CGCATGAGAA GCTGCGCCAT
TCCGCCGAGA TCCTGCAAAG CACCTTCAAC AGCATGGCCG AGGGCGTGCT GGTGATCGAT
TCCGCCGACA CGGTCGTATT CTCGAACCCC GCCGTGGAGC GTCTGCTCGG CTACCGTCCC
GGCAGGAAGC TGTCCGAAGT TCGAGCCCGG CTTCGCGTCT ATGAAAGCGA CGGCGTGACG
CAAATCGCAG GTCATGACCT GCCGACGCCG CGCGCCCTGC GCGGCGAGGC ATTTGACCGT
CGCGAACTGA TCGTCCATCT GCAAGAGCGC GACGAGTTCT TGCGGTTGAT GGTGAGCGGC
CGGCCGTTGC GTGACGCCGC CGGCGCCATC ACCGGGGCGG CGATGGTCTA TCACGACATC
ACGATGGCGC ATGAGACCGA GCGATTGCTG TATCAGGCGC AGAAGCTCGA CGCGATCGGC
AAGCTCACGG GCGGTGTCGC GCATGATTTC AACAACATGC TCACGATCAT CACCGGCACG
ATCGAAACCC TCGCCGAAGA GGTTCAAGAC CGCCCAGCTG CCGCCGCCAC CGCAGCCCTG
ATCGGCCAAG CCGCAGACCG TTGCACCGAG CTGATCCGGC ATCTCCTCGC CTTCGCACGC
AAGCAGCCGC TGCATCCGCG CGACAGCGAC GTGAACAGCA CGATCCTCGA CATCGCCAAG
CTGCTGCGAC CGACCCTCGG TGAGCAGATC GAAATCAACT CGATCCTCGA TGAGGAAGAG
CTGATCGCTC ATATCGACTC GGCGCAGCTT GCCAACTCAC TCGTCAACAT GGCCATCAAC
GCCCGCGATG CGATGCCCAA TGGCGGCAAG CTGCTGCTGG AATCGCGCCG GGTCGTGCTC
GACGCCGCCT ATGCCGCCGC CAATGCCGGC GTGGAGCCCG GCGCCTATGT CATGGTCGCC
GTCAGCGATA CCGGCACCGG CATGTCGACA GAGGTCCGAG ATCGCGCGTT CGAGCCGTTC
TTCACGACCA AGGAGGCCGG CAAGGGCTCA GGCCTCGGGC TCAGCATGGT CTATGGCTTC
GTCAAGCAAT CCGGTGGGCA CATCAAGATC TATAGCGAGG AAGGCAAGGG CACCACGATC
CGGCTCTATA TTCCCGCACC CAAGGGGCAA CCCGCAGCCC CGACGGCGCC TGATCCGCCG
CCGCCCCGCG GCACCGAAAC GATTTTCATC GTCGAGGACG ATCCGCTGGT GCAGGATTTC
GTGGTGGCGC AACTGCAGAG CCTGGGCTAC CGGACCATCA CGGCATCTAC CGGAGTCGAA
GCCCTGGCGA AGATCGAGGC CGGGCAGACC TTCGACCTTC TCTTCACCGA CGTCATCATG
CCCGGTGGCG TCAACGGCAA GGAGCTCGCC GAGGAGACGC TGCGGCAACG TCCCGGCATG
AAGGTTCTCT ACACGTCGGG CTACACGGAT AACGCCATGA TCGAGCATGG CGGCCTCGAT
CAGGACGCGC TGCTTCTCAC CAAGCCCTAT CGCAAATCCG AGCTCGCGCA GATGGTCCGC
CTGGCGCTGC TGTCGGGCGC GGATGCCGGA CCGCCCGCCG CCGTCGGCGG CGCTCTCGCC
GGCTGA
 
Protein sequence
MIGHRLALGL GLGTLLAIGA ASTALDIKSR HDAAWVDSSL NVLQKASDLR LLLREVEASS 
RGFGLTGAEQ FNTEFNDVRA QIPTALSDLQ RSAADSPSAR NLFAETAALI DRRIDLAAEL
IRLRAASDRS GLDALGSRAE GPAIMARITA NLDRFVAEQR QLLAQRTATS KATGSLLLAI
DLCGLAVLLL IAGYLIRRTY QDDRTLRASL LRSEAEAVSL AEKAEEQHRD LMAAHEKLRH
SAEILQSTFN SMAEGVLVID SADTVVFSNP AVERLLGYRP GRKLSEVRAR LRVYESDGVT
QIAGHDLPTP RALRGEAFDR RELIVHLQER DEFLRLMVSG RPLRDAAGAI TGAAMVYHDI
TMAHETERLL YQAQKLDAIG KLTGGVAHDF NNMLTIITGT IETLAEEVQD RPAAAATAAL
IGQAADRCTE LIRHLLAFAR KQPLHPRDSD VNSTILDIAK LLRPTLGEQI EINSILDEEE
LIAHIDSAQL ANSLVNMAIN ARDAMPNGGK LLLESRRVVL DAAYAAANAG VEPGAYVMVA
VSDTGTGMST EVRDRAFEPF FTTKEAGKGS GLGLSMVYGF VKQSGGHIKI YSEEGKGTTI
RLYIPAPKGQ PAAPTAPDPP PPRGTETIFI VEDDPLVQDF VVAQLQSLGY RTITASTGVE
ALAKIEAGQT FDLLFTDVIM PGGVNGKELA EETLRQRPGM KVLYTSGYTD NAMIEHGGLD
QDALLLTKPY RKSELAQMVR LALLSGADAG PPAAVGGALA G