Gene Rleg_3320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3320 
Symbol 
ID8014204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3322985 
End bp3326371 
Gene Length3387 bp 
Protein Length1128 aa 
Translation table11 
GC content63% 
IMG OID644825879 
Producthistidine kinase 
Protein accessionYP_002977106 
Protein GI241206010 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.515444 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCGC GCCAACGCAT CATTCCGGTA AGACGCGAAT ATAATCGCTG GGTCGCCAAC 
CAGACGCTGG AAGATTATGC GCTGCGCTTC ACTGCCAAGA GCGCCCGCCA TTTTTCCTCA
CAGCGCATCT CACAGACGGC GATCGGCGCA ATCTCCTTCC TGGCGCTGGA GGCGATCGGC
GGCGCGATCA CGCTTTCCTA CGGCACCACC AACGCCTTTT ACGCCATCAT CGTCGCCTCG
ATCGCCATGC TGGCGATCGG CCTGCCGATA AGCCGCTACG CCATCCGCCA CGGCGTCGAC
ATCGATCTTC TGACGCGCGG CGCGGGCTTC GGCTATATCG GCTCGACCAT CACCTCACTG
ATCTATGCAA GCTTCACCTT CATGCTGTTT GCGATCGAGG CTTCGATCAT GTCCGGGGCG
CTGGAGCTCA CCCTCGGCAT CCCGCTCTGG ATCGGCTACA TCATCAGTGC CGTCATGGTG
ATCCCCCTGG TGACGCACGG CGTGCGGCTG ATCAGCAAGT TCCAGCTGAT GACCCAGCCC
TTCTGGATCG TGCTGAATAT CCTGCCTTTC ATCTTCATCG CCTTGCTGGA TTGGGAAAAA
TTCGATCTCT GGCGCGCCTT TGCCGGCATC CGTCATGCCT CCGGCCCGCC CGGCACCGTC
GCCGAGTTCG ATCTCGTGGA GTTCGGCGCC GCCTCCGCCG TCATCCTGGC GCTGATGTCG
CAGATCGGCG AACAGGCCGA CTTCCTGCGC TTCCTGCCGC CGGACCCGCA GCGCAAGTGG
CGCCACCGCC TGGCGATCTT TCTCGCCGGC CCCGGCTGGG TCATCATCGG CGCGCCCAAG
TTGCTTGCCG GTTCATTTCT GGTCGTGCTG ACCTTCACCT CGGGCGTCCC CCTCGATCGC
GCCGCCGACC CGGCACAGAT GTATCTGACC GCCTTCGGCT ACATGGTCCC CTGGCACAAT
GCCGCGTTGC TGTTGATGGC GGCCTTCGTC GTCGTCTCAC AGCTAAAGAT CAATGTGATG
AACGCCTATG CAGGTTCGCT CGCCTGGTCG AACTTCTTCT CGCGGCTGAC CCACAGCCAT
CCCGGCCGCG TCATCTGGCT GGTGTTCAAC GTCGCGATTG CCCTGCTTTT GATGGAACTC
GGCATCTACC GGCTGCTGGA GGAGACGCTC GGCATCTTCT CGATCATCGC CATGGCCTGG
CTCTGCACGA TCTCCGCCGA TCTCTTCATC AACAAGCCGC TGGGGCTGGC TCCCCCCGGC
ATCGAGTTCA AGCGCGCGCA TCTCTACGAC ATCAACCCGG TCGGCCTCGG CGCCATGACG
CTATCGGCGA CGGTGTCGCT GATTGCCCAT TTCGGTGCCT TCGGCGAGAT CGCCGCCTCG
CTCGCTCCCT ATATTACCCT CGTCGTCGCG CTGGTCGCCT CGCCTGTCAT CGCCTGGGCA
ACGAAAGGCA AGTTCTATCT CGCCCGCAAG CCGCGCCAGA GCTGGAAGAA CCTGACGAAC
ATCACCTGCT CGGTCTGCGA GCATCCCTTC GAGCCGGAGG ATATGGCCTG GTGCCCGGCC
TATGCCGCGC CGATCTGCTC GCTCTGCTGC TCGCTCGACA GCCGCTGCCA CGACATGTGC
AAGCCGGCCG CCCGTTTCAA TGCGCAAGTC GGCACCGTCG CCAAGGTGCT GCTGTCTGAA
ACCATCATCG AGAAGCTGAC GACGCGTCTC GGCCGCTACG GCATCGCCGT CGTGCTGGCA
TTGACCGCCA TCGGCGCGAT CCTGGCGATG ATCGCCCATC AGGTCGCCTC CGCCTCCCCT
GAGACGGCCG AGGTCGTCAA CCGCACCATC TTCATCGTCT TCTTCGTCTT CTCGGTGATC
GCAGGCGTCG TCTGCTGGTT CTATGTGCTC GCCCATGACA GCCGCGTCGT TGCCGAGGAG
GAATCCTCAC GCCAGAACAC GCTGCTGCTC AAGGAAATCG CCGCCCATAA GAAGACCGAC
GCTGCCCTGC AGAACGCCAA GGAAACGGCC GAGGCCGCCA ACCGCGCCAA GAGCCGCTAT
GTCGTCGGGC TCAGCCATGA ATTACGCACG CCGCTGAACG CCGTGCTCGG CTACGCCCAG
ATCCTCGAAC GCGACGAGAC CATTCCGGCG CCGCGCCAGT CCTCGATCAA GGTCATCCGC
CGCAGCGCCG AACACCTCTC CGGGCTGATC GACGGCCTGC TCGATATTTC CAAGATCGAA
GCCGGCCGCC TGCAGGTCTA TTCCAACGAG ATCAACATCC AGGATTTCCT CGACCAGATC
GTCGATATGT TCCGGCCACA GGCGCAGGCA AAGGGGCTCG CCTTCATCCA TGAGCGCGCG
CCGGCCTTGC CGCAATTCGT CCGCACCGAC GAAAAGCGCC TGCGCCAGAT TCTCGTCAAC
CTGCTCTCCA ATGCCATCAA GTTCACTGAC GAGGGCAGCG TCACCTTCGA TGTCGGCTAT
CGCAGCCAGG TCGCGACCTT TACCGTCGCC GATACCGGCC GAGGCATCAC CGAGAAAGAC
CTGCCCCGCA TCTACGAACC CTTCCAGCGC GGCGAGGCCG AAAGCGTGCG GCCAATGCCG
GGGCTCGGCC TCGGCCTCAC CATCACCCGG CTTCTGACCA ACACGCTCGG CGGCGAGATC
TCGGTTTCAA GCGTGAAGGA GGAAGGCTCG ACCTTCCGCG TCCGGCTGAT GCTGTCCGCC
GTCATGCGCG CAGTGGCCGC AGCGCCGCAG GAGAAGCGCA TCGTCGGTTA TGACGGCCCG
CGCCGCACGA TCGTCGTCGT CGATGATAAC GAGGACCACC GCGAGATGAT GCGCGAGATC
CTGGCGCCGC TCGATTTCAT CGTGCTGACG GCGGCAGGCG GCGGCGAATG CCTGACGCTG
ATCGAGGGCA TCATGCCGGA CCTTTTCCTC GTCGATATCC TGATGCCCGG CATGAACGGC
TGGCAGCTCG TCTCGCGTCT GCGCGAGGCC GGACAGACAG CGCCAGTGCT GATGCTGTCG
GCCAATATCG GTGATGCGGC CGTTCTCAGC GACAGTGACG ACAGCCACAA TGATGCGATC
GGCAAGCCGG TCGACATCCG CCAGCTGCGC GACAAGCTCG CCCTGCATCT CGGCCTGACA
TGGATCTATG CCGATGCCAT GCCAACCGTC CCTGTCAAGA TCGAAGCGCC GATGCTGAGC
CCAGGTGCTG CCCATGTGCA GGAATTGCTG CGGCTCGGCG AGATCGGCTA TATCAGAGGC
ATCGAAGCCA AGCTTTCGGA CCTTGCCAAG GTGGAGGCAA ATCAGCCATT CACGGAGGAA
CTTCGCGCCT ATGTCGCCGC CTTCGATCTC GCCGGCTTCA TGACCTTCCT GCACGACTTC
GACGAAAAGG TGGAATCCAT TGGCTGA
 
Protein sequence
MAARQRIIPV RREYNRWVAN QTLEDYALRF TAKSARHFSS QRISQTAIGA ISFLALEAIG 
GAITLSYGTT NAFYAIIVAS IAMLAIGLPI SRYAIRHGVD IDLLTRGAGF GYIGSTITSL
IYASFTFMLF AIEASIMSGA LELTLGIPLW IGYIISAVMV IPLVTHGVRL ISKFQLMTQP
FWIVLNILPF IFIALLDWEK FDLWRAFAGI RHASGPPGTV AEFDLVEFGA ASAVILALMS
QIGEQADFLR FLPPDPQRKW RHRLAIFLAG PGWVIIGAPK LLAGSFLVVL TFTSGVPLDR
AADPAQMYLT AFGYMVPWHN AALLLMAAFV VVSQLKINVM NAYAGSLAWS NFFSRLTHSH
PGRVIWLVFN VAIALLLMEL GIYRLLEETL GIFSIIAMAW LCTISADLFI NKPLGLAPPG
IEFKRAHLYD INPVGLGAMT LSATVSLIAH FGAFGEIAAS LAPYITLVVA LVASPVIAWA
TKGKFYLARK PRQSWKNLTN ITCSVCEHPF EPEDMAWCPA YAAPICSLCC SLDSRCHDMC
KPAARFNAQV GTVAKVLLSE TIIEKLTTRL GRYGIAVVLA LTAIGAILAM IAHQVASASP
ETAEVVNRTI FIVFFVFSVI AGVVCWFYVL AHDSRVVAEE ESSRQNTLLL KEIAAHKKTD
AALQNAKETA EAANRAKSRY VVGLSHELRT PLNAVLGYAQ ILERDETIPA PRQSSIKVIR
RSAEHLSGLI DGLLDISKIE AGRLQVYSNE INIQDFLDQI VDMFRPQAQA KGLAFIHERA
PALPQFVRTD EKRLRQILVN LLSNAIKFTD EGSVTFDVGY RSQVATFTVA DTGRGITEKD
LPRIYEPFQR GEAESVRPMP GLGLGLTITR LLTNTLGGEI SVSSVKEEGS TFRVRLMLSA
VMRAVAAAPQ EKRIVGYDGP RRTIVVVDDN EDHREMMREI LAPLDFIVLT AAGGGECLTL
IEGIMPDLFL VDILMPGMNG WQLVSRLREA GQTAPVLMLS ANIGDAAVLS DSDDSHNDAI
GKPVDIRQLR DKLALHLGLT WIYADAMPTV PVKIEAPMLS PGAAHVQELL RLGEIGYIRG
IEAKLSDLAK VEANQPFTEE LRAYVAAFDL AGFMTFLHDF DEKVESIG