Gene Rleg2_0224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_0224 
Symbol 
ID6978936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp227757 
End bp231260 
Gene Length3504 bp 
Protein Length1167 aa 
Translation table11 
GC content64% 
IMG OID643394935 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_002279750 
Protein GI209547833 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0591] Na+/proline symporter
[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCCAG GCTGGGTCAT ATTCGCGTCT GCCTTCGGCT ATCTGCTCCT GCTTTTCGCC 
GTGGCAAGCT ATGGCGACCA CAAGAAGCGC GGCCCGGGGG GGCTCGAAGG CGGATGGCCG
GTGGTCTATG CGCTGAGCCT GGCGATCTAC TGCACCTCCT GGACCTATTT CGGCAGCGTC
GGGCTTGCCG CGCAGCGCGG CCTAGAATTT GCCGGCATCT ATATCGGCCC CATTCTGGTC
TTCACGCTCG GCATGCCGCT GCTTCGCCGG ATCATCGAAC TCGCCAAGGC GGAGAAACTC
ACCTCGGTTG CCGATTTCGT CGCCGCGCGC TACGGCAAGA ACCCGACGGT CGCCACCATC
GTCGCGCTGA TCTCGCTGAT CGGCACCATT CCCTATATCG CGCTGCAGCT GAAGGCGATC
TCCAGCACCG TCAGCGCCAT GGTCAACCCG TCCGACTACG GCATCGGCAG CGGCAATCTC
TACTTCCTCG ACCTGCCGCT GGTTGCGACG CTGGTGCTTG CCTGCTTCGC CATCATGTTC
GGCACGCGGC ATACCGATGC GACTGAGCAT CAGGACGGCC TGATCCTCGC CGTCTCGATG
GAATCCGTGG TCAAGCTCGT CGCCTTCCTG ACGGCCGGCA TTTGCGTCAT CTGGTTTCTC
TTCGACGGCC CGACCGATCT CTGGCAGAAA ACGGTCGACA ACGGCCTTGT GATGTCGGCG
CTGAGTTACC ACACGCCGAT CAGCCGCTGG ATGACGCTGA TCCTGTTGTC GGCCTTCGCG
ATCATCCTGC TGCCGCGGCA ATTCCATGTG ACGGTCGTCG AAAACCGGAC GCCGAAACAG
CTGAAACTCG CAGGCTTCCT GTTTCCCACC TATCTGATCG CGATCAATCT CTTCGTGCTG
CCGGTGGCGA TCGGCGGGCT GCTGACCTTC GGCGGTGCCG GCAATGCCGA TTTCTACATG
CTGTCGCTGC CGCTTGCCGG TGAGATGCCT GTGGTGTCGC TGATCACCTT CATCGGCGGC
TTCTCCGCCG CAACGGCGAT GGTGATCGTC GATTCCGTGG CGCTGTCGAT CATGGTGTCG
AACGACATCA TCATGCCCAT CTTCCTCAGG CGCAAACTTG CCGGCCGCGC CAGCCAGCGC
GACAATTTCG CCAAGACGCT GCTCAACATC CGCCGCAGCG CCATTTTTGC CGTGCTGCTG
TTCGGCTACG CTTATTACCG CTCGACCGAC AGCACCGCCG GCCTTGCCTC GATCGGCCTT
CTCTCCTTTG CCGCCATCGC CCAGATCGCC CCGGCGCTGT TCGGTGGGCT GATCTGGCGG
AGGGCGAATG CACGCGGCGC CATCCTCGGG CTTTCCTCCG GCTTCGTCAT CTGGATCTAC
CTGCTGTTCC TGCCCTCGCT CGGCGGTCCC GATTATTCCT ATGTGGCAAG CGCCGTGCTC
GGCTTCATCT TTCCGGGCAC GACGCTGTTT GCCGCGCCCG ACGCCGATCC GCTGGTCAAT
GCGACGGTGA TGAGCCTGCT CGTCAACACC GCTTTCTTCA TCGTCGGCTC GCTCACTCGC
AATGCCAGGC CGCTCGAACG CATCCAGGCC GGCATCTTCG TCAAACGGCA TTCACGCTCG
CAATTTGCCA CGCGCGGCTG GAAGACCCGC ATCAGCGTCG GCGATCTCAA GGCGGCGATC
TCACGTTATC TCGGCGAAGA GCGCATGCAG CGCTCGTTGA CAACCTACGA ACAAAGCTCC
GGCCGCAAGC TGGAGGACGA TCAGCCGGCC GACATGGCGC TGATCCATTT CAGCGAACAG
CTGCTCGGCA GCGCCATCGG CTCGTCCTCC GCCCGGTTGG TGCTGTCGCT GATCCTGCAG
AAGATCGAGG ATGCCTCGTC CGACACCGCC TGGCTGCTCG ACCAGGCGAG CGAGGCGCTG
CAATATAACC AGGACATGCT GCAGACCGCA CTTTCGCAGA TGGACCAGGG CATTGCGGTG
TTCGACAGCT CCAACCGGCT GACCATCTGG AACCGGCGCT TCCGGCAATT GCTGGATCTG
CCGGAAAGTG CCGGTCAGGT CGGCTTTCCC CTCGCCGACA TCGTCACCAC GCTGAGTGAG
CGCGGCGACA TCGCGCCCGG CGATCTCAAT CAGACGGTGC GGCATTTCCT GACGCTCGAC
AAACCCTTCG CGCTGGTGCT CAGCGGCGGC GAGCGGATCA TCGAGGTGCG CTCCAACGCC
ATGCCCGACA AGGGCATCGT CGCCACCTTC ACCGATATCA CCCAGCGCGT CAGCGCCGAC
CAGGCGCTGA AACAGGCAAA TGAGACGCTG GAGCAGCGCG TTGCCGAACG CACGGCCGAG
CTGACCCGCG TCAATCACGA ACTCGCCGAG GCGCGGGCCG CCGCCGACGA GGCGAATATC
GGCAAGACCC GCTTTTTCGC CGCCGCCGGC CACGATATCC TGCAGCCGCT GAACGCCGCC
AGGCTCTATT CCTCGGCACT GGTCGAGCGC ATGGCGCAAT CCGACAACAG CCCGATCGTG
CGCAATATCG ATTCGGCGCT GGAATCGGTC GAAACCATTC TCGGCGCCGT GCTCGATATC
TCCAGGCTCG ATACCGGCGC CATGCGGCCG CGGCTCGCGG CCGTTGCACT TTCCGACCTG
TTGGAGCGGA TCGAGACCGA TTTCGCGCCG ATCGCCCGGG AAAAACAGCT GAAGCTGGTG
GTCATGCCGA CGTCGCTCAG GGTGCGTTCC GACCCCAATC TTCTGCGCCG GCTGGTGCAG
AACCTCGTTT CCAACGCGAT CAAATATACG ATCACCGGCA AGGTGCTGGT CGGCGCGCGG
CGGCGCGGCA ACCAGGTGAT CATCCAGGTG ATCGATTCCG GCATCGGCAT TCCGCCGTCG
AAATTCCGCA CGGTGTTCAA GGAATTCGCC CGGCTGGACG AAGGGGCAAA AACCGCCTCC
GGCCTCGGGC TCGGCCTGTC GATCGTCGAC CGCATCGCCC GGGTGCTCAA CCATCCTGTG
GAGCTGCACT CGACGCATGG CAGGGGCACG GAATTCCGCA TCGCCATGCC GCTCGACATC
TCGCGCCCGG CCGAGGCGGC CGCCGCCGTC ACACCCACTG AGCGGCCAGG GCAGCCGCTC
AAGGGGCTGA AGATCCTCTG CATCGACAAC GAGCCGAAGA TCCTCGAAGG CATGCGGCTG
CTGCTCAGCG GCTGGGGCTG CGAGGTCAAG GCGCTCGATA GCCTCGCCGA CGTGATATCA
GGTGACGGCC GTGACGGGCC GCCGGATCTC GCCATCGCCG ACTATCATCT CGACGACGGC
ACCGGCATCG CCGCGATCCT GCATCTGCGC CGGCAGTTCG GCGCCGATAT CCCGGCTTTG
CTGGTTACCG CCGACCGTAC GCCCGAAGTG CGCAGCGAGG CCGAGCGATA CGGCATCGCC
GTCCAGCACA AGCCGGTGCG GCCGGCGGCG CTCCGCGCCT ATATCACCCA GGTTTCCGGC
CTTAAGCGCG CGGCTGCCGA GTAA
 
Protein sequence
MLPGWVIFAS AFGYLLLLFA VASYGDHKKR GPGGLEGGWP VVYALSLAIY CTSWTYFGSV 
GLAAQRGLEF AGIYIGPILV FTLGMPLLRR IIELAKAEKL TSVADFVAAR YGKNPTVATI
VALISLIGTI PYIALQLKAI SSTVSAMVNP SDYGIGSGNL YFLDLPLVAT LVLACFAIMF
GTRHTDATEH QDGLILAVSM ESVVKLVAFL TAGICVIWFL FDGPTDLWQK TVDNGLVMSA
LSYHTPISRW MTLILLSAFA IILLPRQFHV TVVENRTPKQ LKLAGFLFPT YLIAINLFVL
PVAIGGLLTF GGAGNADFYM LSLPLAGEMP VVSLITFIGG FSAATAMVIV DSVALSIMVS
NDIIMPIFLR RKLAGRASQR DNFAKTLLNI RRSAIFAVLL FGYAYYRSTD STAGLASIGL
LSFAAIAQIA PALFGGLIWR RANARGAILG LSSGFVIWIY LLFLPSLGGP DYSYVASAVL
GFIFPGTTLF AAPDADPLVN ATVMSLLVNT AFFIVGSLTR NARPLERIQA GIFVKRHSRS
QFATRGWKTR ISVGDLKAAI SRYLGEERMQ RSLTTYEQSS GRKLEDDQPA DMALIHFSEQ
LLGSAIGSSS ARLVLSLILQ KIEDASSDTA WLLDQASEAL QYNQDMLQTA LSQMDQGIAV
FDSSNRLTIW NRRFRQLLDL PESAGQVGFP LADIVTTLSE RGDIAPGDLN QTVRHFLTLD
KPFALVLSGG ERIIEVRSNA MPDKGIVATF TDITQRVSAD QALKQANETL EQRVAERTAE
LTRVNHELAE ARAAADEANI GKTRFFAAAG HDILQPLNAA RLYSSALVER MAQSDNSPIV
RNIDSALESV ETILGAVLDI SRLDTGAMRP RLAAVALSDL LERIETDFAP IAREKQLKLV
VMPTSLRVRS DPNLLRRLVQ NLVSNAIKYT ITGKVLVGAR RRGNQVIIQV IDSGIGIPPS
KFRTVFKEFA RLDEGAKTAS GLGLGLSIVD RIARVLNHPV ELHSTHGRGT EFRIAMPLDI
SRPAEAAAAV TPTERPGQPL KGLKILCIDN EPKILEGMRL LLSGWGCEVK ALDSLADVIS
GDGRDGPPDL AIADYHLDDG TGIAAILHLR RQFGADIPAL LVTADRTPEV RSEAERYGIA
VQHKPVRPAA LRAYITQVSG LKRAAAE