Gene Rleg_6856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6856 
Symbol 
ID8022439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012858 
Strand
Start bp302468 
End bp305470 
Gene Length3003 bp 
Protein Length1000 aa 
Translation table11 
GC content58% 
IMG OID644833722 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_002984856 
Protein GI241666772 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGATAA ACGTCTTTCA ATTCTTCGAT GCGCTGCCCG GCATGGCATG GACCATGCTG 
GAGGATGGCC AAGTCGATTT CGTCAACCGA AGCTGGGTAG AGTTCACGGG GCTGCGACCG
GCGTCCGGAG AGTCTTGGCG CTGGGAGGAT GCGGTGCACC CGGCCGACGT TGGCGCGGTC
ACCGCCCATC TGCAGAGCGT TAGGCACGCA GAGACACAGG GAACGATCGA AGCGCGGCTT
CGGAACGCTG CAGGCGAATT TCACCATTTC CTTATCCAGT GGGCACCTCT GGAAGACGCT
CCCCACGGCG TAGTGAATTG GTGCGCGGTC GCGACCAATA TTGAACCCGT CGTTCAAAAA
CGGGAAAGGT CACAGGCCGC ACTCGATTTC CAGCTCGTCG TCGATAGCAT CCCGATACCC
GTGGCCGTGA CAACACCTGC TGGGGAAGTC GAAGGCCTCA ACCAACTGAC CCTCTCCTAT
TTTGGGCTAA GCCTCTCGGA TCTAAAGGAC TGGAAGGCTT CGGAGGTCGT GCACCCGGAT
GACTTGAAGG AAACCATCGA AGCCCAGATC GCCGCTCACA TGGCGGGCAC GTCTTATAAC
GTTGAGAGCC GCCATCTTCG CGCCGACGGA ATTTATCGGT GGCACAATGT ACTTGGTCTA
CCGCTGCGGG ACCAGTCGGG TGCGATTCAG CGGTGGCTGC ATCTTTTGAT AGACATCGAT
GACCGAAAAC GTGCCGAGGT AGCTCTTGCA AACAGTGAAC GAGAGTCTCG CTTGATTGTC
GGCACTATCG CAGGAATGGT CGCGCTCTTT ACGCCCGAAG GTCAGTTGAA CGGCGCAAAT
CAACAGCTTC TCGACTACTT CCAACTGCCC CTGGAGGAGG TTGTGAACTG GGCGACCAAC
GGTATCACGC ACCCCGACGA CCTGCAGCAC TGTGTCGAGA CATTTACGGC GTCGCTCAAA
ACGGGAGAGC CGTATGACTT CGAGACGCGC TTCCGTCGCC ATGATGGCGA ATTTCGCTGG
TTCCAGGTCC GCGGCCACCC GGTCAAAGAC GATAATGGCG GGATCGTCCG CTGGTATGGT
CTGCTGACTG ATATCGATGA TGGGCGCAGG GCGGTGGAGG CGTTGCGCGA GCGCGAGATC
GAGCTGCAAT TGATCGTAAA CTCGATCCCA GGCCTCATCA TTGTGCTCCG GCCCGACGGG
GCTGTGGAGA GCGTAAACGA TCAGTCCTTG CGATATTTCG GCTATGATTT CAACGAGCAT
CAAAAGTGGA AGACCAACGA TATCATTCAT CCTGACGATC GTGACCGGGG CGTGGCCAGG
TTCGCCGAGG CAGTCGCCGC TGGTCAATCG TACGAGGTAG TGGAAAGGCT CCGTCGGCAC
GACGGCGTCT ATCGTTGGTT CCAGGTGCGG GGAACTCCCG TGCGCGACTA CGAGGGCCTC
GTTGTGCGAT GGTACTTCCT GCTGAATGAT ATCGACGACC GGAAGCACGC CGAAGTGGCG
CTTGCTAACA GCGAGAGGGA ATTCCGCCAT ATCGTCAACA TGGTTCCGGG CATGATCATA
CTTTCGCAGC CTGATGGAAC GCTCGACGGG AGCAACCAAC AGCTACTGGA CTACTTCGGA
ATCTCTCTCG ACGAGGTGCA AGACTGGTCA ACGAACGGCA TAACTCACCC CGATGACGTG
CAAGTCAATA TCGATACCTT CCTGGGCGCA CTCAAGAGCG GCAATCCTTA CGACTACCAA
AGCCGTTATC GAAGACACGA CGGCGTGTTC CGATGGTTCC AGGTGCGAGG ACAGCCACTT
CGTGACGCAG AGGGCAAGAT CGTCCGATGG TATGGACTAC TGACTGACAT CGACGATCGC
AAGCAGGCCG AGGATGAACT GCGTAGAAGT CAGGCCCTTC TCGTTGCAGG CCAGCGCTTG
ATCAGGACGG GTACGTTCTC CTGGCATGTC GAAACCGACG AGCTGATCTT GTCCGACGAA
TGGCTTCGAA TCCTCGAATT CGAAAAGGAC GAAGTCGTGA CGTTTGACCG GATAACGGAG
CGGATACACC CGGACGACGT TGCGCTCTTT GCCGGGAAGA TCGGCGCCGT ACGCGAAGGC
GACGAGGACT CGGAATACGA GGTCCGCGTT CTCGCCCGCA ACGGCGACAT AAAGTACGTT
CGCGTCATCG GCGAGGTTAT AATTCACCGG AATGGAAATC GGGAATGCCT TGGAGCGATC
CAGGACGTGA CACAAAGGCG CTTGACGGAG GAAGCGCGCG ATCGGTTGCG CACAGAGCTT
GCGCGAGTGA CCAGCATCCT CAGCCTCGGC CAGATGTCGG CGGCGATCGC GCACGAGGTG
AACCAGCCAC TGTCAGGCAT CATCACGAAC GCGAACACCT GTCTGCGGAT GCTTGCAGCC
ACACCGCCCG ACATAGAAAC GGCCCTTGAG ACGGCGCGGC GCACGATCAG GGACGGCAAT
CGTGCCACCG AGGTAATCGC CAGACTGCGG GCGTTATTCA GTAAGCGCAA CATCGAATTC
GAAGACGTCG ATATAAACGA GGCGGTGAGT GAGGTAGTCG CCTTGTCTGC AGGTGACCAG
AGACGCAACG GCGTCGCTAT TCGAACGCAC TTCGCCACCT CCCTTCCCCC CGTCAACGGC
GACCGAGTCC AACTTCAACA AGTGATCAAC AACTTGCTTC GCAACGCCAT TGACGCGGTA
TCTGGCGTGA AGGATCGGCT GAGATTAGTC GAGATCCGAA CCCAGCTCGG CGGTGATGGG
CAGATAAGCG TCGCGGTTAG CGACAATGGA ATTGGTCTCG ACCCAGACGG AGGGACGCGG
ATCTTCGAAG CCTTCTACAC GACGAAGAAC AATGGGATGG GGATCGGTCT TTCAGTCTGT
CGTTCCATCA TCGAGAGTCA TGGTGGTCGC CTTTGGGCTG AGCCTAACCA GGGGCCGGGC
GTCACCATGC ACTTCTCCGT ACCGTCGGCC GAAGAGGCAA GCATCACGGC AGCCTCGCAC
TGA
 
Protein sequence
MGINVFQFFD ALPGMAWTML EDGQVDFVNR SWVEFTGLRP ASGESWRWED AVHPADVGAV 
TAHLQSVRHA ETQGTIEARL RNAAGEFHHF LIQWAPLEDA PHGVVNWCAV ATNIEPVVQK
RERSQAALDF QLVVDSIPIP VAVTTPAGEV EGLNQLTLSY FGLSLSDLKD WKASEVVHPD
DLKETIEAQI AAHMAGTSYN VESRHLRADG IYRWHNVLGL PLRDQSGAIQ RWLHLLIDID
DRKRAEVALA NSERESRLIV GTIAGMVALF TPEGQLNGAN QQLLDYFQLP LEEVVNWATN
GITHPDDLQH CVETFTASLK TGEPYDFETR FRRHDGEFRW FQVRGHPVKD DNGGIVRWYG
LLTDIDDGRR AVEALREREI ELQLIVNSIP GLIIVLRPDG AVESVNDQSL RYFGYDFNEH
QKWKTNDIIH PDDRDRGVAR FAEAVAAGQS YEVVERLRRH DGVYRWFQVR GTPVRDYEGL
VVRWYFLLND IDDRKHAEVA LANSEREFRH IVNMVPGMII LSQPDGTLDG SNQQLLDYFG
ISLDEVQDWS TNGITHPDDV QVNIDTFLGA LKSGNPYDYQ SRYRRHDGVF RWFQVRGQPL
RDAEGKIVRW YGLLTDIDDR KQAEDELRRS QALLVAGQRL IRTGTFSWHV ETDELILSDE
WLRILEFEKD EVVTFDRITE RIHPDDVALF AGKIGAVREG DEDSEYEVRV LARNGDIKYV
RVIGEVIIHR NGNRECLGAI QDVTQRRLTE EARDRLRTEL ARVTSILSLG QMSAAIAHEV
NQPLSGIITN ANTCLRMLAA TPPDIETALE TARRTIRDGN RATEVIARLR ALFSKRNIEF
EDVDINEAVS EVVALSAGDQ RRNGVAIRTH FATSLPPVNG DRVQLQQVIN NLLRNAIDAV
SGVKDRLRLV EIRTQLGGDG QISVAVSDNG IGLDPDGGTR IFEAFYTTKN NGMGIGLSVC
RSIIESHGGR LWAEPNQGPG VTMHFSVPSA EEASITAASH