Gene BBta_4271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_4271 
Symbol 
ID5150486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp4480664 
End bp4482742 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content67% 
IMG OID640559091 
Producthistidine kinase 
Protein accessionYP_001240228 
Protein GI148255643 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.305016 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0625064 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGAGC AGCGTGACGG TGACTTGGCG TTTGAACAGG AGGTGGTCCA GCGCCTTGGC 
CTGATGCCCA ATTTATTCCG TCCCCGGCCA TCGGCGCCGG AGACCGCCCG CACGCTGTGG
CAATTCGCGA AGAGCGCCTA TCTCGATGCG CCATTGCCTG CGCTGTTCAA GGAACGTCTC
GCCGTCCACC TGTCGCGCCT CGGCTGCGCC CGCTATTGCG TGGCCCGCCA TTTCGGCTTT
CTGGTCGGGC TGGGTCATCC CGGCGGCGAT CCCACGGTCC GGCCGGAGAC GGTCGACCGC
GCCCGCCAGT TGCTCATGCG TCCCATTGTA GATTCGGGTC GCGTCGCAGC CGCGCTGGAG
CGGATGGGGA CGGTGCGCGA GCTGCCGGAG CTTCCGGCCC CCTTCAGCGC GGCAGAGGCT
GATCTCTTCG ATGCCGCCGG GGCGCTGTTG CTGCAGACCG AACTGGCCGG GCGGGCGCGG
CTCGCGCTCG GACAGGCGCT GGGCGATCGC AGGCTGGAAC TGCTTGCCGC CTTTCTCACC
TATCTGCGCA TGGCACATTT CTGGGCCGAC GCCCACCCCG ACATCGCGCT GGACCCGGAC
GTGCTGAGCT TGATGCAATC CGATCCCGAT CTGGCCACGC AGCTGAGCTG CGCCGACGCA
AGACAGGGTC GTGATCCGGA CCTGGCTGAG CAGCACGGGC CGGCGGAGCA CATCGTCTCC
AGCGCGCACC GGCTGCAGCG GCTGCAGGAG ATCGACTCGG TCGGCGTGCT GTTCTTCGAT
CATGCCGAGG GCAGGCTGAT CGACGCCAAT GACGCCTTTC TCGCCATGAC CGGCTATTCG
CGCGAGGAGG TGCGCGGCGG ACGGCTGACC TGGCAGAGCA TGACGCCACC GGAATGGCAG
GACAGCTCCG CGACCCAGAT CGATCTTCTG CAGCGGACCG GCCGGATCGG TCCCTACGAG
AAGGAATATT TCCGCAAGAA CGGCGAACGC ACCTGGATGA TGTTCGCGGG CCGCGACCTC
GGCGACGGTA CCCTGGTCGA GTTGGCGATG AATATCGACG ATCGCAAGCG CACCGAGGCC
GCGCTGCGCG AGAGCGAGGC GCGCTTCCGC CAGTTCGGCG CGGCGTCGTC CGATGTGGTG
TGGATCCGCG ATGCCGCGAC CATGCGCTGG GAATATCTCA GTCCCGCGTT CGAGACGATG
TATGGCCTGC CGCGCGAAGC CGTGCTGCAA GGCGACCATC TCGCGGCGTG GACCGAGCTG
ATGCTGCCGG AGGACCGACC GCGCGTCCTC GAGGGGCTCG CCCGCGCCCG CCAGGGCGAG
CGCGACAGCT TCGATTTTCG CATCATCAGG CGCAGCGATG GCAGCCTGCG CTGGCTCCGC
ATCCGCAGCT TTCCGATGAT CGACGAGACC GGGCGGGTGC AGCGAATCGG CGGCATTAGC
CAGGACATCA CCGCGCTCGT GTCAGCGACC GAGCACCAGA AATTGCTGCT GGCCGAGCTG
CAGCATCGGG TTCGCAACAC GCTGGCGGTG ATCCGCTCGA TCATCCGGCG CACCGGCGAC
AGCAGCGACA GCATCGAGGA TTTTGCCAGC CATCTCGAGG GCCGCATCAC GGCGCTCTCG
CGCGTTCAGA GCGCCATCAC CCGCGATCCG TTTGCAGGTT TCGACCTCGC TCAACTGATC
GCCGACGAGC TGCGCGCCGG CGCAGCGCGT GAAGACGAGC AGTTTTCGCT GGCCGGCCCG
CCCTTGCGAA TCAGGGCCAA GGCTGCCGAA AGCATCGGCC TTGCCGTGCA TGAGCTCGTG
ACCAACGCGC TGAAATTCGG CGCCCTGACC AGGCCGCGCG GCTTCATCAG CATCGGATGG
CGGCTCGACG GCAGCGAGGA CAGCGGCTGG ATCGTGCTGG ACTGGAGCGA GACCGGCATG
TCGGGTCACC CGATCGCCGC GCATCGCCAG GGCTTTGGAA CGGTCCTGCT CGAACAGATG
CTGCCCTATG ACGTCGGGGC TCGCGTGACC AGGCGATTCG AGCCGAGCGG CCTGCGTTGC
GAGATCCGGC TGCCGGCGCG GGACATCCTC AAGCGTTGA
 
Protein sequence
MVEQRDGDLA FEQEVVQRLG LMPNLFRPRP SAPETARTLW QFAKSAYLDA PLPALFKERL 
AVHLSRLGCA RYCVARHFGF LVGLGHPGGD PTVRPETVDR ARQLLMRPIV DSGRVAAALE
RMGTVRELPE LPAPFSAAEA DLFDAAGALL LQTELAGRAR LALGQALGDR RLELLAAFLT
YLRMAHFWAD AHPDIALDPD VLSLMQSDPD LATQLSCADA RQGRDPDLAE QHGPAEHIVS
SAHRLQRLQE IDSVGVLFFD HAEGRLIDAN DAFLAMTGYS REEVRGGRLT WQSMTPPEWQ
DSSATQIDLL QRTGRIGPYE KEYFRKNGER TWMMFAGRDL GDGTLVELAM NIDDRKRTEA
ALRESEARFR QFGAASSDVV WIRDAATMRW EYLSPAFETM YGLPREAVLQ GDHLAAWTEL
MLPEDRPRVL EGLARARQGE RDSFDFRIIR RSDGSLRWLR IRSFPMIDET GRVQRIGGIS
QDITALVSAT EHQKLLLAEL QHRVRNTLAV IRSIIRRTGD SSDSIEDFAS HLEGRITALS
RVQSAITRDP FAGFDLAQLI ADELRAGAAR EDEQFSLAGP PLRIRAKAAE SIGLAVHELV
TNALKFGALT RPRGFISIGW RLDGSEDSGW IVLDWSETGM SGHPIAAHRQ GFGTVLLEQM
LPYDVGARVT RRFEPSGLRC EIRLPARDIL KR